Gene EcHS_A0774 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0774 
SymbolsucB 
ID5594324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp787507 
End bp788724 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content56% 
IMG OID640919950 
Productdihydrolipoamide succinyltransferase 
Protein accessionYP_001457524 
Protein GI157160206 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01347] 2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component) 


Plasmid Coverage information

Num covering plasmid clones57 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAGCG TAGATATTCT GGTCCCTGAC CTGCCTGAAT CCGTAGCCGA TGCCACCGTC 
GCAACCTGGC ATAAAAAACC CGGCGACGCA GTCGTACGTG ATGAAGTGCT GGTAGAAATC
GAAACTGACA AAGTGGTACT GGAAGTACCG GCATCAGCAG ACGGCATTCT GGATGCGGTT
CTGGAAGATG AAGGTACAAC GGTAACGTCT CGTCAGATCC TTGGTCGCCT GCGTGAAGGC
AACAGCACCG GTAAAGAAAC CAGCGCCAAA TCTGAAGAGA AAGCGTCCAC TCCGGCGCAA
CGCCAGCAGG CGTCTCTGGA AGAGCAAAAC AACGATGCGT TAAGCCCGGC GATCCGTCGC
CTGCTGGCTG AACACAATCT CGACGCCAGC GCCATTAAAG GCACCGGTGT GGGTGGTCGT
CTGACTCGTG AAGATGTGGA AAAACATCTG GCGAAAGCCC CGGCGAAAGA GTCTGCTCCG
GCAGCGGCTG CTCCGGCGGC GCAACCGGCT CTGGCTGCAC GTAGTGAAAA ACGTGTCCCG
ATGACTCGCC TGCGTAAGCG TGTGGCAGAG CGTCTGCTGG AAGCGAAAAA CTCCACCGCC
ATGCTGACCA CGTTCAACGA AGTCAACATG AAGCCGATTA TGGATCTGCG TAAGCAGTAC
GGTGAAGCGT TTGAAAAACG CCACGGCATC CGTCTGGGCT TTATGTCCTT CTACGTGAAA
GCGGTGGTTG AAGCCCTGAA ACGTTACCCG GAAGTGAACG CTTCTATCGA CGGCGATGAC
GTGGTTTACC ACAACTATTT CGACGTCAGC ATGGCGGTTT CTACGCCGCG CGGCCTGGTG
ACGCCGGTTC TGCGTGATGT CGATACCCTC GGCATGGCAG ACATCGAGAA GAAAATCAAA
GAGCTGGCAG TCAAAGGCCG TGACGGCAAG CTGACCGTTG AAGATCTGAC CGGTGGTAAC
TTCACCATCA CCAACGGTGG TGTGTTCGGT TCCCTGATGT CTACGCCGAT CATCAACCCG
CCGCAGAGCG CAATTCTGGG TATGCACGCT ATCAAAGATC GTCCGATGGC GGTGAATGGT
CAGGTTGAGA TCCTGCCGAT GATGTACCTG GCGCTGTCCT ACGATCACCG TCTGATCGAT
GGTCGCGAAT CCGTGGGCTT CCTGGTAACG ATCAAAGAGT TGCTGGAAGA TCCGACGCGT
CTGCTGCTGG ACGTGTAG
 
Protein sequence
MSSVDILVPD LPESVADATV ATWHKKPGDA VVRDEVLVEI ETDKVVLEVP ASADGILDAV 
LEDEGTTVTS RQILGRLREG NSTGKETSAK SEEKASTPAQ RQQASLEEQN NDALSPAIRR
LLAEHNLDAS AIKGTGVGGR LTREDVEKHL AKAPAKESAP AAAAPAAQPA LAARSEKRVP
MTRLRKRVAE RLLEAKNSTA MLTTFNEVNM KPIMDLRKQY GEAFEKRHGI RLGFMSFYVK
AVVEALKRYP EVNASIDGDD VVYHNYFDVS MAVSTPRGLV TPVLRDVDTL GMADIEKKIK
ELAVKGRDGK LTVEDLTGGN FTITNGGVFG SLMSTPIINP PQSAILGMHA IKDRPMAVNG
QVEILPMMYL ALSYDHRLID GRESVGFLVT IKELLEDPTR LLLDV