Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0774 |
Symbol | sucB |
ID | 5594324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 787507 |
End bp | 788724 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640919950 |
Product | dihydrolipoamide succinyltransferase |
Protein accession | YP_001457524 |
Protein GI | 157160206 |
COG category | [C] Energy production and conversion |
COG ID | [COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes |
TIGRFAM ID | [TIGR01347] 2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 57 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAGCG TAGATATTCT GGTCCCTGAC CTGCCTGAAT CCGTAGCCGA TGCCACCGTC GCAACCTGGC ATAAAAAACC CGGCGACGCA GTCGTACGTG ATGAAGTGCT GGTAGAAATC GAAACTGACA AAGTGGTACT GGAAGTACCG GCATCAGCAG ACGGCATTCT GGATGCGGTT CTGGAAGATG AAGGTACAAC GGTAACGTCT CGTCAGATCC TTGGTCGCCT GCGTGAAGGC AACAGCACCG GTAAAGAAAC CAGCGCCAAA TCTGAAGAGA AAGCGTCCAC TCCGGCGCAA CGCCAGCAGG CGTCTCTGGA AGAGCAAAAC AACGATGCGT TAAGCCCGGC GATCCGTCGC CTGCTGGCTG AACACAATCT CGACGCCAGC GCCATTAAAG GCACCGGTGT GGGTGGTCGT CTGACTCGTG AAGATGTGGA AAAACATCTG GCGAAAGCCC CGGCGAAAGA GTCTGCTCCG GCAGCGGCTG CTCCGGCGGC GCAACCGGCT CTGGCTGCAC GTAGTGAAAA ACGTGTCCCG ATGACTCGCC TGCGTAAGCG TGTGGCAGAG CGTCTGCTGG AAGCGAAAAA CTCCACCGCC ATGCTGACCA CGTTCAACGA AGTCAACATG AAGCCGATTA TGGATCTGCG TAAGCAGTAC GGTGAAGCGT TTGAAAAACG CCACGGCATC CGTCTGGGCT TTATGTCCTT CTACGTGAAA GCGGTGGTTG AAGCCCTGAA ACGTTACCCG GAAGTGAACG CTTCTATCGA CGGCGATGAC GTGGTTTACC ACAACTATTT CGACGTCAGC ATGGCGGTTT CTACGCCGCG CGGCCTGGTG ACGCCGGTTC TGCGTGATGT CGATACCCTC GGCATGGCAG ACATCGAGAA GAAAATCAAA GAGCTGGCAG TCAAAGGCCG TGACGGCAAG CTGACCGTTG AAGATCTGAC CGGTGGTAAC TTCACCATCA CCAACGGTGG TGTGTTCGGT TCCCTGATGT CTACGCCGAT CATCAACCCG CCGCAGAGCG CAATTCTGGG TATGCACGCT ATCAAAGATC GTCCGATGGC GGTGAATGGT CAGGTTGAGA TCCTGCCGAT GATGTACCTG GCGCTGTCCT ACGATCACCG TCTGATCGAT GGTCGCGAAT CCGTGGGCTT CCTGGTAACG ATCAAAGAGT TGCTGGAAGA TCCGACGCGT CTGCTGCTGG ACGTGTAG
|
Protein sequence | MSSVDILVPD LPESVADATV ATWHKKPGDA VVRDEVLVEI ETDKVVLEVP ASADGILDAV LEDEGTTVTS RQILGRLREG NSTGKETSAK SEEKASTPAQ RQQASLEEQN NDALSPAIRR LLAEHNLDAS AIKGTGVGGR LTREDVEKHL AKAPAKESAP AAAAPAAQPA LAARSEKRVP MTRLRKRVAE RLLEAKNSTA MLTTFNEVNM KPIMDLRKQY GEAFEKRHGI RLGFMSFYVK AVVEALKRYP EVNASIDGDD VVYHNYFDVS MAVSTPRGLV TPVLRDVDTL GMADIEKKIK ELAVKGRDGK LTVEDLTGGN FTITNGGVFG SLMSTPIINP PQSAILGMHA IKDRPMAVNG QVEILPMMYL ALSYDHRLID GRESVGFLVT IKELLEDPTR LLLDV
|
| |