snark/st - Un-Hack-Bar

snark/st

Author	SHA1	Message	Date
Hiltjo Posthuma	818ec746f4	fix unicode glitch in DCS strings, patch by Tim Allen Reported on the mailinglist: " I discovered recently that if an application running inside st tries to send a DCS string, subsequent Unicode characters get messed up. For example, consider the following test-case: printf '\303\277\033P\033\\\303\277' ...where: - \303\277 is the UTF-8 encoding of U+00FF LATIN SMALL LETTER Y WITH DIAERESIS (ÿ). - \033P is ESC P, the token that begins a DCS string. - \033\\ is ESC \, a token that ends a DCS string. - \303\277 is the same ÿ character again. If I run the above command in a VTE-based terminal, or xterm, or QTerminal, or pterm (PuTTY), I get the output: ÿÿ ...which is to say, the empty DCS string is ignored. However, if I run that command inside st (as of commit `9ba7ecf`), I get: ÿÃ¿ ...where those last two characters are \303\277 interpreted as ISO8859-1 characters, instead of UTF-8. I spent some time tracing through the state machines in st.c, and so far as I can tell, this is how it works currently: - ESC P sets the "ESC_DCS" and "ESC_STR" flags, indicating that incoming bytes should be collected into the strescseq buffer, rather than being interpreted. - ESC \ sets the "ESC_STR_END" flag (when ESC is received), and then calls strhandle() (when \ is received) to interpret the collected bytes. - If the collected bytes begin with 'P' (i.e. if this was a DCS string) strhandle() sets the "ESC_DCS" flag again, confusing the state machine. If my understanding is correct, fixing the problem should be as easy as removing the line that sets ESC_DCS from strhandle(): diff --git a/st.c b/st.c index ef8abd5..b5b805a 100644 --- a/st.c +++ b/st.c @@ -1897,7 +1897,6 @@ strhandle(void) xsettitle(strescseq.args[0]); return; case 'P': /* DCS -- Device Control String / - term.mode \|= ESC_DCS; case '_': / APC -- Application Program Command / case '^': / PM -- Privacy Message */ return; I've tried the above patch and it fixes my problem, but I don't know if it introduces any others. "	2020-06-17 21:35:39 +02:00
Hiltjo Posthuma	a2a704492b	config.def.h: add an option allowwindowops, by default off (secure) Similar to the xterm AllowWindowOps option, this is an option to allow or disallow certain (non-interactive) operations that can be insecure or exploited. NOTE: xsettitle() is not guarded by this because st does not support printing the window title. Else this could be exploitable (arbitrary code execution). Similar problems have been found in the past in other terminal emulators. The sequence for base64-encoded clipboard copy is now guarded because it allows a sequence written to the terminal to manipulate the clipboard of the running user non-interactively, for example: printf '\x1b]52;0;ZWNobyBoaQ0=\a'	2020-05-30 22:06:15 +02:00
Hiltjo Posthuma	e6e2c6199f	tiny style fix	2020-05-30 22:05:17 +02:00
Hiltjo Posthuma	94b8ec0021	Partially add back in "support REP (repeat) escape sequence" Add the functionality back in for xterm compatibility, but do not expose the capability in st.info (yet). Some notes: It was reverted because it caused some issues with ncurses in some configurations, namely when using BSD padding (--enable-bsdpad, BSD_TPUTS) in ncurses it caused issues with repeating digits. A fix has been upstreamed in ncurses since snapshot 20200523. The fix is also backported to OpenBSD -current.	2020-05-30 22:04:28 +02:00
Hiltjo Posthuma	475a0a36cb	Revert "support REP (repeat) escape sequence" This reverts commit `e8392b282c`. There is currently a bug in older ncurses versions (like on OpenBSD) where a fix for a bug with REP is not backported yet. Most likely in tty/tty_update.c: Noticed while using lynx (which uses ncurses/curses). To reproduce using lynx: echo "Z0000000" \| lynx -stdin or using the program: int main(void) { WINDOW *win; win = initscr(); printw("Z0000000"); refresh(); sleep(5); return 0; } This prints "ZZZZZZZ" (incorrectly).	2020-05-16 21:06:13 +02:00
Avi Halachmi (:avih)	e8392b282c	support REP (repeat) escape sequence The sequence \e[Nb prints the last printed char N (more) times if it's printable, and it's ignored after newline or other control chars. This is Ecma-048/ANSI-X3.6 sequence and not DEC VT. It's supported by xterm, and ncurses uses it when possible, e.g. when TERM is xterm* (and with this commit also st*). xterm supports only codepoints<=255, possibly due to internal limits. We support any value/codepoint which was placed in a cell. To test: - tput rep 65 4 -> prints 'AAAA' - printf "\342\225\246\033[4b" -> prints U+2566 1+4 times.	2020-05-16 14:08:10 +02:00
Jakub Leszczak	045a0fab4f	Fix selection: selscroll	2020-05-12 15:38:17 +02:00
Jakub Leszczak	9c30066e73	Fix selection: ignore ATTR_WRAP when rectangular selection in getsel	2020-05-12 15:38:02 +02:00
Jakub Leszczak	8304d4f059	Fix selection: selclear in tputc	2020-05-12 15:37:59 +02:00
Hiltjo Posthuma	914fb825df	code-style: add fallthrough comment Patch by Steve Ward, thanks.	2020-05-09 14:43:31 +02:00
Hiltjo Posthuma	cde480c693	optimize column width calculation and utf-8 encode for ASCII In particular on OpenBSD and on glibc wcwidth() is quite expensive. On musl there is little difference.	2020-05-09 14:11:25 +02:00
Hiltjo Posthuma	8211e36d28	fix for incorrect (partial) written sequences when libc wcwidth() == -1 Fix an issue with incorrect (partial) written sequences when libc wcwidth() == -1. The sequence is updated to on wcwidth(u) == -1: c = "\357\277\275" but len isn't. A way to reproduce in practise: * st -o dump.txt * In the terminal: printf '\xcd\xb8' - This is codepoint 888, on OpenBSD it reports wcwidth() == -1. - Quit the terminal. - Look in dump.txt (partial written sequence of "UTF_INVALID"). This was introduced in: " commit `11625c7166` Author: czarkoff@gmail.com <czarkoff@gmail.com> Date: Tue Oct 28 12:55:28 2014 +0100 Replace character with U+FFFD if wcwidth() is -1 Helpful when new Unicode codepoints are not recognized by libc." Change: Remove setting the sequence. If this happens to break something, another solution could be setting len = 3 for the sequence.	2020-05-09 14:07:52 +02:00
Hiltjo Posthuma	87545c612e	tiny code-style and typo-fix in comment	2020-05-09 14:05:04 +02:00
Jan Klemkow	d6ea0a1a61	replace exit(3) by _exit(2) in signal handler sigchld() exit(3) is not async-signal-safe but, _exit(2) is. This change prevents st to crash and dump core.	2020-04-30 01:21:21 +02:00
Ivan Tham	72e3f6c7c0	Update XIM cursor position only if changed Updating XIM cursor position is expensive, so only update it when cursor position changed.	2020-04-19 19:39:48 +02:00
Hiltjo Posthuma	33a9a45664	just remove the EOF message	2020-04-11 15:45:06 +02:00
Hiltjo Posthuma	e997303502	Fix small typos	2020-04-11 15:23:23 +02:00
Quentin Rameau	c1145268f6	Launch scroll program with the default shell	2020-04-11 15:23:23 +02:00
Roberto E. Vargas Caballero	fbae700a3f	Fix style issue	2020-04-11 15:23:23 +02:00
Roberto E. Vargas Caballero	e52319cc7d	ttyread: test for EOF while reading tty When a read operation returns 0 then it means that we arrived to the end of the file, and new reads will return 0 unless you do some other operation such as lseek(). This case happens with USB-232 adapters when they are unplugged.	2020-04-11 15:23:23 +02:00
Roberto E. Vargas Caballero	21e0d6e8b8	Add support for scroll(1) Scroll is a program that stores all the lines of its child and be used in st as a way of implementing scrollback. This solution is much better than implementing the scrollback in st itself because having a different program allows to use it in any other program without doing modifications to those programs.	2020-04-11 15:23:20 +02:00
Avi Halachmi (:avih)	2e54a21b5a	OSC 52 - copy to clipboard: don't limit to 382 bytes Strings which an application sends to the terminal in OSC, DCS, etc are typically small (title, colors, etc) but one exception is OSC 52 which copies text to the clipboard, and is used for instance by tmux. Previously st cropped these strings at 512 bytes, which for OSC 52 limited the copied text to 382 bytes (remaining buffer space before base64). This made it less useful than it can be. Now it's a dynamic growing buffer. It remains allocated after use, resets to 512 when a new string starts, or leaked on exit. Resetting/deallocating the buffer right after use (at strhandle) is possible with some more code, however, it doesn't always end up used, and to cover those cases too will require even more code, so resetting only on new string is good enough for now.	2019-11-10 22:45:54 +01:00
Hiltjo Posthuma	289c52b7aa	CSIEscape, STREscape: use size_t for buffer length	2019-11-10 22:45:54 +01:00
Avi Halachmi (:avih)	7ceb3d1f72	STREscape: don't trim prematurely STRescape holds strings in escape sequences such as OSC and DCS, and its buffer is 512 bytes. If the input is too big then trailing chars are ignored, but the test was off-by-1 such that it took 510 chars instead of 511 (before a terminating NULL is added). Now the full size can be utilized.	2019-11-10 22:45:54 +01:00
Avi Halachmi (:avih)	ea4d933ed9	base64dec: don't read out of bounds Previously, base64dec checked terminating input '\0' every 4 calls to base64dec_getc, where the latter progressed one or more chars on each call, and could read past '\0' in the way it was used. The input to base64dec currently comes only from OSC 52 escape seq (copy to clipboard), and reading past '\0' or even past the buffer boundary was easy to trigger. Also, even if we could trust external input to be valid base64, there are different base64 standards, and not all of them require padding to 4 bytes blocks (using trailing '=' chars). It didn't affect short OSC 52 strings because the buffer is initialized to 0's, so typically it did stop within the buffer, but if the string was trimmed to fit (the buffer is 512 bytes) then it did also read past the end of the buffer, and the decoded suffix ended up arbitrary. This patch makes base64dec_getc not progress past '\0', and instead produce fake trailing padding of '='. Additionally, at base64dec, if padding is detected at the first or second byte of a quartet, then we identify it as invalid and abort (a valid quartet has at least two leading non-padding bytes).	2019-11-10 22:45:54 +01:00
Avi Halachmi (:avih)	f1546cf9c1	selection: fix view to match actual selection on first cell	2019-04-14 13:50:20 +02:00
Hiltjo Posthuma	21367a040f	revert part of commit `add0211522` "use iswspace()/iswpunct() to find word delimiters this inverts the configuration logic: you no longer provide a list of delimiters -- all space and punctuation characters are considered delimiters, unless listed in extrawordchars." Feedback from IRC and personal preference.	2019-03-15 20:40:16 +01:00
Hiltjo Posthuma	b650256044	dont print color warning on color reset OSC 104 without parameter also print explicitly "(null)" when printf "%s" p=NULL. noticed when exiting mutt: printf '\x1b]104\x07'	2019-03-15 14:47:08 +01:00
Hiltjo Posthuma	9acec468fb	minor code-style, initialize var at the top of function	2019-03-15 14:42:50 +01:00
Lauri Tirkkonen	add0211522	use iswspace()/iswpunct() to find word delimiters this inverts the configuration logic: you no longer provide a list of delimiters -- all space and punctuation characters are considered delimiters, unless listed in extrawordchars.	2019-03-15 12:25:13 +01:00
Lauri Tirkkonen	d5efd256aa	replace utf8strchr with wcschr	2019-03-15 12:24:13 +01:00
Lauri Tirkkonen	75b4ba4b4b	be silent about explicitly unhandled mouse modes	2019-03-13 17:51:58 +01:00
Hiltjo Posthuma	4e0135afec	style: remove double empty newlines	2019-03-03 11:23:54 +01:00
Ivan Tham	e85b6b6466	better Input Method Editor (IME) support Features: - Allow input methods swap with hotkey (E.g. left ctrl + left shift). - Over-the-spot pre-editing style, pre-edit data placed over insertion point. - Restart IME without segmentation fault. TODO: - Automatically pickup IME if st started before IME	2019-02-12 18:45:15 +01:00
Lauri Tirkkonen	096b125db7	output child WEXITSTATUS/WTERMSIG on abnormal termination	2018-12-11 18:36:02 +01:00
Hiltjo Posthuma	b4d68d4daa	st: small typofix in comment	2018-11-04 14:30:56 +01:00
Hiltjo Posthuma	30ec9a3dc3	small code-style fix	2018-09-11 19:06:35 +02:00
Quentin Rameau	67d0cb65d0	Remove the ISO 14755 feature And move it to the patches section. Keeping it would force to add an exec pledge on OpenBSD, and some people think it's bloated, so bye!	2018-09-11 19:05:55 +02:00
Hiltjo Posthuma	235a783e03	code-style for pledge(2) feedback from Klemens, thanks	2018-05-25 13:04:09 +02:00
Hiltjo Posthuma	30ce2cc002	Pledge on OpenBSD	2018-05-25 11:59:28 +02:00
Hiltjo Posthuma	041912a791	error message style and use strerror in a few places	2018-03-29 18:30:05 +02:00
Daniel Tameling	74cff67bd7	set sel.alt in selstart instead of selextend	2018-03-29 18:15:29 +02:00
Hiltjo Posthuma	6ac8c8aa50	selextend: clarify: !sel.mode == SEL_IDLE	2018-03-17 13:48:29 +01:00
Hiltjo Posthuma	7648697f71	minor code-style: whitespace fixes	2018-03-16 16:45:58 +01:00
Hiltjo Posthuma	8b8255ac0e	regression: include termios.h for tcsendbreak etc	2018-03-09 15:35:34 +01:00
Devin J. Pohly	20e0da7f14	General cleanup Simplifies logic in a couple places and removes a redundant function call. Signed-off-by: Devin J. Pohly <djpohly@gmail.com>	2018-02-25 21:56:26 -06:00
Devin J. Pohly	403c57ebb5	Clean up #includes Signed-off-by: Devin J. Pohly <djpohly@gmail.com>	2018-02-25 21:56:26 -06:00
Devin J. Pohly	e0215d5377	Reduce visibility wherever possible When possible, declare functions/variables static and move struct definitions out of headers. In order to allow utf8decode to become internal, use codepoint for DECSCUSR extension directly. Signed-off-by: Devin J. Pohly <djpohly@gmail.com>	2018-02-25 21:56:26 -06:00
Devin J. Pohly	30683c70ab	Limit usage of extern to config.h globals Prefer passing arguments to declaring external global variables. The only remaining usage of extern is for config.h variables which are needed in st.c instead of x.c (where it is now included). Signed-off-by: Devin J. Pohly <djpohly@gmail.com>	2018-02-25 21:56:26 -06:00
Devin J. Pohly	a3beb626d2	Remove x.c dependency on term The xinit function only needs to the rows/cols, so pass those in rather than accessing term directly. With a bit of arithmetic, we are able to avoid the need for term.row and term.col in x2col, y2row, and xdrawglyphfontspecs as well, completing the removal. Term is now fully internal to st.c. Signed-off-by: Devin J. Pohly <djpohly@gmail.com>	2018-02-25 21:56:26 -06:00

1 2 3 4 5 ...

824 commits