Gene EcSMS35_0738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0738 
SymbolsucA 
ID6145269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp743431 
End bp746232 
Gene Length2802 bp 
Protein Length933 aa 
Translation table11 
GC content56% 
IMG OID641615627 
Product2-oxoglutarate dehydrogenase E1 component 
Protein accessionYP_001742826 
Protein GI170681528 
COG category[C] Energy production and conversion 
COG ID[COG0567] 2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymes 
TIGRFAM ID[TIGR00239] 2-oxoglutarate dehydrogenase, E1 component 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAACA GCGCTTTGAA AGCCTGGTTG GACTCTTCTT ACCTCTCTGG CGCAAACCAG 
AGCTGGATAG AACAGCTCTA TGAAGACTTC TTAACCGATC CTGACTCGGT TGACGCTAAC
TGGCGTTCGA CGTTCCAGCA GTTACCTGGT ACGGGAGTCA AACCGGATCA ATTCCACTCT
CAAACGCGTG AATATTTCCG CCGCCTGGCG AAAGACGCTT CACGTTACTC TTCAACGATC
TCCGACCCTG ACACCAATGT GAAGCAGGTT AAAGTCCTGC AGCTCATTAA CGCATACCGC
TTCCGTGGTC ACCAGCATGC GAATCTCGAT CCGCTGGGAC TGTGGCAGCA AGATAAAGTG
GCCGATCTGG ATCCGTCTTT CCACGATCTG ACCGAAGCAG ACTTCCAGGA GACCTTCAAC
GTCGGTTCAT TTGCCAGCGG CAAAGAAACC ATGAAACTCG GCGAGCTGCT GGAAGCCCTC
AAGCAAACCT ACTGCGGCCC GATTGGTGCC GAGTATATGC ACATCACCAG CACCGAAGAA
AAACGCTGGA TCCAACAGCG TATCGAGTCT GGTCGCGCGA CTTTCAATAG CGAAGAGAAA
AAACGCTTCT TAAGCGAACT GACCGCCGCT GAAGGCCTTG AACGTTACCT CGGCGCAAAA
TTCCCTGGCG CAAAACGCTT CTCGCTGGAA GGCGGTGACG CGTTAATCCC GATGCTTAAA
GAGATGATCC GCCACGCTGG CAACAGCGGC ACCCGCGAAG TGGTTCTCGG GATGGCGCAC
CGTGGTCGTC TGAACGTGCT GGTGAACGTG CTGGGTAAAA AACCGCAAGA CTTGTTCGAC
GAGTTTGCCG GTAAACATAA AGAACACCTC GGCACGGGCG ACGTGAAATA CCACATGGGC
TTCTCGTCTG ACTTCCAGAC CGATGGCGGC CTGGTGCACC TGGCGCTGGC GTTTAACCCG
TCTCACCTTG AGATTGTCAG CCCGGTAGTT ATCGGTTCTG TTCGTGCCCG TCTGGACAGA
CTCGATGAGC CGAGCAGCAA CAAAGTGCTG CCAATCACCA TCCACGGTGA CGCCGCAGTG
ACCGGGCAGG GCGTGGTTCA GGAAACCCTG AACATGTCGA AAGCGCGTGG TTATGAAGTT
GGCGGTACGG TACGTATCGT TATCAACAAC CAGGTTGGCT TCACCACCTC TAATCCGCTG
GATGCTCGTT CTACACCGTA CTGTACTGAT ATCGGTAAGA TGGTTCAGGC ACCGATTTTC
CACGTAAACG CGGATGATCC GGAAGCCGTT GCTTTTGTGA CCCGTCTGGC GCTCGATTTC
CGTAACACCT TTAAACGTGA TGTCTTCATC GACCTGGTGT GCTACCGCCG TCACGGCCAC
AACGAAGCCG ACGAGCCGAG CGCAACCCAG CCGCTGATGT ATCAGAAAAT CAAAAAACAT
CCGACGCCGC GCAAAATCTA TGCTGACAAG CTGGAGCAGG AAAAAGTCGC GACGCTGGAA
GATGCCACCG AGATGGTTAA CCTGTACCGC GATGCGCTGG ATGCTGGCGA TTGCGTTGTA
GCAGAGTGGC GTCCGATGAA CATGCACTCT TTCACCTGGT CGCCGTACCT CAACCACGAA
TGGGACGAAG AGTACCCGAA CAAAGTTGAG ATGAAGCGCC TGCAGGAACT GGCTAAACGC
ATCAGCACGG TGCCGGAAGC AGTTGAAATG CAGTCTCGCG TTGCCAAGAT TTATGGCGAT
CGCCAGGCGA TGGCTGCTGG TGAGAAACTG TTCGACTGGG GCGGCGCGGA AAACCTCGCT
TACGCCACGC TGGTTGACGA AGGCATTCCG GTTCGCCTGT CGGGTGAAGA CTCCGGTCGC
GGTACCTTCT TCCACCGCCA CGCGGTGATC CACAACCAGT CTAACGGTTC CACTTACACG
CCGCTGCAAC ACATCCATAA CGGCCAGGGC GCGTTCCGTG TCTGGGACTC CGTACTGTCT
GAAGAAGCCG TACTGGCGTT TGAATACGGT TATGCCACCG CAGAACCACG CACTCTGACT
ATCTGGGAAG CGCAATTCGG TGACTTCGCC AACGGTGCTC AGGTGGTTAT CGACCAGTTC
ATCTCCTCTG GCGAGCAGAA ATGGGGCCGG ATGTGTGGTC TGGTGATGTT GCTGCCGCAC
GGTTACGAAG GGCAGGGGCC GGAGCACTCC TCCGCGCGTC TGGAACGTTA TCTGCAACTT
TGCGCTGAGC AAAACATGCA GGTGTGCGTA CCGTCTACCC CGGCACAGGT TTACCACATG
CTGCGTCGTC AGGCGCTGCG CGGGATGCGT CGTCCGCTGG TCGTGATGTC GCCGAAATCC
CTGCTGCGTC ATCCGCTGGC GGTTTCCAGC CTCGAAGAAC TGGCGAACGG CACCTTCCTG
CCAGCCATCG GTGAAATCGA CGAGCTTGAT CCGAAGGGCG TGAAGCGCGT AGTGATGTGT
TCTGGTAAGG TTTATTACGA CCTGCTGGAA CAGCGTCGTA AGAACAATCA ACACGATGTC
GCCATTGTGC GTATCGAGCA ACTCTACCCG TTCCCGCATA AAGCGATGCA GGAAGTGTTG
CAGCAGTTTG CTCACGTCAA GGATTTTGTC TGGTGCCAGG AAGAGCCGCT CAATCAGGGC
GCATGGTACT GCAGCCAGCA TCATTTCCGT GAAGTGATTC CGTTTGGGGC TTCTCTGCGT
TATGCAGGCC GCCCGGCCTC CGCCTCTCCG GCGGTAGGGT ATATGTCCGT TCACCAGAAA
CAGCAACAAG ATCTGGTTAA TGACGCGCTG AACGTCGAAT AA
 
Protein sequence
MQNSALKAWL DSSYLSGANQ SWIEQLYEDF LTDPDSVDAN WRSTFQQLPG TGVKPDQFHS 
QTREYFRRLA KDASRYSSTI SDPDTNVKQV KVLQLINAYR FRGHQHANLD PLGLWQQDKV
ADLDPSFHDL TEADFQETFN VGSFASGKET MKLGELLEAL KQTYCGPIGA EYMHITSTEE
KRWIQQRIES GRATFNSEEK KRFLSELTAA EGLERYLGAK FPGAKRFSLE GGDALIPMLK
EMIRHAGNSG TREVVLGMAH RGRLNVLVNV LGKKPQDLFD EFAGKHKEHL GTGDVKYHMG
FSSDFQTDGG LVHLALAFNP SHLEIVSPVV IGSVRARLDR LDEPSSNKVL PITIHGDAAV
TGQGVVQETL NMSKARGYEV GGTVRIVINN QVGFTTSNPL DARSTPYCTD IGKMVQAPIF
HVNADDPEAV AFVTRLALDF RNTFKRDVFI DLVCYRRHGH NEADEPSATQ PLMYQKIKKH
PTPRKIYADK LEQEKVATLE DATEMVNLYR DALDAGDCVV AEWRPMNMHS FTWSPYLNHE
WDEEYPNKVE MKRLQELAKR ISTVPEAVEM QSRVAKIYGD RQAMAAGEKL FDWGGAENLA
YATLVDEGIP VRLSGEDSGR GTFFHRHAVI HNQSNGSTYT PLQHIHNGQG AFRVWDSVLS
EEAVLAFEYG YATAEPRTLT IWEAQFGDFA NGAQVVIDQF ISSGEQKWGR MCGLVMLLPH
GYEGQGPEHS SARLERYLQL CAEQNMQVCV PSTPAQVYHM LRRQALRGMR RPLVVMSPKS
LLRHPLAVSS LEELANGTFL PAIGEIDELD PKGVKRVVMC SGKVYYDLLE QRRKNNQHDV
AIVRIEQLYP FPHKAMQEVL QQFAHVKDFV WCQEEPLNQG AWYCSQHHFR EVIPFGASLR
YAGRPASASP AVGYMSVHQK QQQDLVNDAL NVE