Gene EcSMS35_0739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0739 
SymbolsucB 
ID6145713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp746247 
End bp747464 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content56% 
IMG OID641615628 
Productdihydrolipoamide succinyltransferase 
Protein accessionYP_001742827 
Protein GI170681017 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01347] 2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component) 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAGCG TAGATATTCT GGTCCCTGAC CTGCCTGAAT CCGTAGCCGA TGCCACCGTC 
GCAACCTGGC ATAAAAAACC CGGCGACGCA GTCGTACGTG ATGAAGTGCT GGTAGAAATC
GAAACTGACA AAGTGGTACT GGAAGTACCG GCATCAGCAG ACGGCATTCT GGATGCGGTT
CTGGAAGATG AAGGTACAAC GGTAACGTCT CGTCAGATCC TTGGTCGCCT GCGTGAAGGC
AACAGCGCCG GTAAAGAAAC CAGCGCCAAA TCTGAAGAGA AAGCGTCCAC TCCGGCGCAA
CGCCAGCAGG CGTCTCTGGA AGAGCAAAAC AACGATGCGT TAAGCCCGGC GATCCGTCGC
CTGCTGGCTG AACACAATCT TGATGCCAGC GCCATTAAAG GCACCGGTGT AGGTGGTCGT
CTGACCCGTG AAGATGTGGA AAAACATCTG GCGAAAGCCC CGGCGAAAGA GTCTGCTCCG
GCAGCGGCTG CTCCGGCGGC GCAACCGGCC CTGGCTGCAC GTAGCGAAAA ACGTGTGCCG
ATGACTCGCC TGCGTAAGCG TGTGGCAGAG CGTCTGCTGG AAGCGAAAAA CTCCACCGCC
ATGCTGACCA CGTTCAACGA AGTCAACATG AAGCCGATTA TGGATCTGCG TAAGCAGTAC
GGTGAAGCGT TTGAAAAACG CCACGGCATC CGTCTGGGCT TTATGTCCTT CTACGTGAAA
GCGGTGGTTG AAGCCCTGAA ACGTTACCCG GAAGTGAATG CTTCTATCGA CGGCGATGAC
GTGGTTTACC ACAACTATTT CGACGTCAGC ATGGCGGTTT CTACGCCGCG CGGCCTGGTG
ACGCCGGTAC TGCGTGATGT CGATACCCTC GGTATGGCAG ACATTGAGAA GAAAATTAAA
GAGCTGGCAG TTAAAGGCCG TGACGGCAAG CTGACGGTTG AAGATCTGAC CGGTGGTAAC
TTCACCATCA CCAACGGTGG TGTGTTCGGT TCCCTGATGT CTACGCCGAT CATCAACCCG
CCGCAGAGCG CAATTCTGGG TATGCACGCT ATCAAAGATC GTCCGATGGC GGTGAATGGT
CAGGTTGAGA TCCTGCCGAT GATGTACCTG GCGCTGTCCT ACGATCACCG TCTGATCGAT
GGTCGCGAAT CCGTGGGCTT CCTGGTAACG ATCAAAGAGT TGCTGGAAGA TCCGACGCGT
CTGCTGCTGG ACGTGTAG
 
Protein sequence
MSSVDILVPD LPESVADATV ATWHKKPGDA VVRDEVLVEI ETDKVVLEVP ASADGILDAV 
LEDEGTTVTS RQILGRLREG NSAGKETSAK SEEKASTPAQ RQQASLEEQN NDALSPAIRR
LLAEHNLDAS AIKGTGVGGR LTREDVEKHL AKAPAKESAP AAAAPAAQPA LAARSEKRVP
MTRLRKRVAE RLLEAKNSTA MLTTFNEVNM KPIMDLRKQY GEAFEKRHGI RLGFMSFYVK
AVVEALKRYP EVNASIDGDD VVYHNYFDVS MAVSTPRGLV TPVLRDVDTL GMADIEKKIK
ELAVKGRDGK LTVEDLTGGN FTITNGGVFG SLMSTPIINP PQSAILGMHA IKDRPMAVNG
QVEILPMMYL ALSYDHRLID GRESVGFLVT IKELLEDPTR LLLDV