Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1845 |
Symbol | |
ID | 4809391 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2188980 |
End bp | 2189897 |
Gene Length | 918 bp |
Protein Length | 305 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640107259 |
Product | homoserine O-succinyltransferase |
Protein accession | YP_001038259 |
Protein GI | 125974349 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1897] Homoserine trans-succinylase |
TIGRFAM ID | [TIGR01001] homoserine O-succinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000102531 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAATAA AGATACCTGA CAGTTTGCCG GCGAAAGAAG TATTAACCAA TGAGAATATA TTTGTAATGG ACGAACACAG GGCTCTGCAC CAGGATGTAA GACCCTTAAG GATTGCCATT TTGAATCTTA TGCCTACAAA GATTACCACG GAGACACAGC TTCTTCGACT GATTGGGAAT ACGCCTATTC AGGTTGAGAT AGAGCTTTTG CATCCGAAAA CCCATGTATC AAAGAATACT CCGGAAGAAC ATTTAACAAA ATTTTACAAA ACCTTTGATG AGGTAAAGGA TGAAAAATTT GACGGACTTA TAATTACCGG TGCACCGGTG GAACAAATGG AGTTTGAAGA GGTTAATTAC TGGGAAGAGC TTAAAAAGAT AATGGACTGG AGCGTTCACA ATGTGTATTC GACATTTCAC ATTTGCTGGG GAGCTCAGGC GGCTTTATAC CATCATTACG GCATAAAGAA ATATCCTTTG AAGGAGAAAA TGTTCGGTAT CTTCCCACAC CGTATTTGCA AGCCAAATAC AATGCTTTTA AGAGGATTTG ACGATTGCTT CTACGCTCCT CATTCCAGGC ACACGGAGGT AAGAAGGGAA GATATTGAAA AGGTGGGCGA AATTGATATT CTTTCCGACT CGGAAGAAGC AGGAGTGTAC ATTATGAAGA CCAGGGGAGG AAGACAGGTT TTTGTGACCG GCCATTCCGA GTATGACCAG TTTACCTTGA AAGAGGAGTA TGAAAGAGAT TTGGCAAAAG GACTCAAGAT AAAAATGCCA AAGAACTACT TCCCGGATGA TGACCCGACA AAACCACCGG TTGTTAATTG GAGAGGGCAT GCAAATCTCC TTTTTTCAAA CTGGCTTAAC TATTATGTAT ACCAGGAAAC GCCGTTTGAT TTGAATGAAT TAAAATAA
|
Protein sequence | MPIKIPDSLP AKEVLTNENI FVMDEHRALH QDVRPLRIAI LNLMPTKITT ETQLLRLIGN TPIQVEIELL HPKTHVSKNT PEEHLTKFYK TFDEVKDEKF DGLIITGAPV EQMEFEEVNY WEELKKIMDW SVHNVYSTFH ICWGAQAALY HHYGIKKYPL KEKMFGIFPH RICKPNTMLL RGFDDCFYAP HSRHTEVRRE DIEKVGEIDI LSDSEEAGVY IMKTRGGRQV FVTGHSEYDQ FTLKEEYERD LAKGLKIKMP KNYFPDDDPT KPPVVNWRGH ANLLFSNWLN YYVYQETPFD LNELK
|
| |