Gene ECH74115_0818 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0818 
SymbolsucA 
ID6969912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp838622 
End bp841423 
Gene Length2802 bp 
Protein Length933 aa 
Translation table11 
GC content56% 
IMG OID643384843 
Product2-oxoglutarate dehydrogenase E1 component 
Protein accessionYP_002269349 
Protein GI209399395 
COG category[C] Energy production and conversion 
COG ID[COG0567] 2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymes 
TIGRFAM ID[TIGR00239] 2-oxoglutarate dehydrogenase, E1 component 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.919234 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAACA GCGCTTTGAA AGCCTGGTTG GACTCTTCTT ACCTCTCTGG CGCAAACCAG 
AGCTGGATAG AACAGCTCTA TGAAGACTTC TTAACCGATC CTGACTCGGT TGACGCTAAC
TGGCGTTCGA CGTTCCAGCA GTTACCTGGT ACGGGAGTCA AACCGGATCA ATTCCACTCT
CAAACGCGTG AATATTTCCG CCGCCTGGCG AAAGACGCTT CACGTTACTC TTCAACGATC
TCCGACCCTG ACACCAATGT GAAGCAGGTA AAAGTCCTGC AGCTCATTAA CGCATACCGC
TTCCGTGGTC ACCAGCATGC GAATCTCGAT CCGCTGGGAC TGTGGCAGCA AGATAAAGTG
GCCGATCTGG ATCCGTCTTT CCACGATCTG ACCGAAGCAG ACTTCCAGGA GACCTTCAAC
GTCGGTTCAT TTGCCAGCGG CAAAGAAACC ATGAAACTCG GCGAGCTGCT GGAAGCCCTC
AAGCAAACCT ACTGCGGCCC GATTGGTGCC GAGTATATGC ACATTACCAG CACCGAAGAA
AAACGCTGGA TCCAACAGCG TATCGAGTCT GGTCGCGCGA CTTTCAATAG CGAAGAGAAA
AAACGCTTCC TAAGCGAACT GACCGCCGCT GAAGGCCTTG AACGTTACCT CGGTGCAAAA
TTCCCTGGCG CAAAACGCTT CTCGCTGGAA GGCGGTGACG CGTTAATCCC GATGCTTAAA
GAGATGATTC GCCACGCTGG CAACAGCGGC ACCCGCGAAG TGGTTCTCGG GATGGCGCAC
CGTGGTCGTC TGAACGTGCT GGTGAACGTG CTGGGTAAAA AACCGCAAGA CTTGTTCGAC
GAGTTTGCCG GTAAACATAA AGAACACCTC GGCACGGGTG ACGTGAAATA CCACATGGGC
TTCTCGTCTG ACTTCCAGAC CGATGGCGGC CTGGTACATC TGGCGCTGGC GTTTAACCCG
TCTCACCTTG AGATTGTCAG CCCGGTCGTT ATCGGTTCTG TTCGTGCCCG TCTGGACAGA
CTTGATGAGC CGAGCAGCAA CAAAGTGCTG CCAATCACCA TTCACGGTGA CGCCGCAGTG
ACCGGGCAGG GCGTGGTTCA GGAAACTCTG AACATGTCGA AAGCGCGTGG TTATGAAGTT
GGCGGTACGG TACGTATCGT TATCAACAAC CAGGTTGGCT TCACCACCTC TAACCCGCTG
GATGCCCGTT CAACGCCGTA CTGTACTGAT ATCGGTAAGA TGGTTCAGGC ACCGATTTTT
CACGTTAACG CGGATGATCC GGAAGCCGTT GCCTTTGTGA CCCGTCTGGC GCTCGATTTC
CGTAACACCT TTAAACGTGA TGTCTTCATC GACCTGGTGT GCTACCGCCG TCACGGCCAC
AACGAAGCCG ACGAGCCGAG CGCAACCCAG CCGCTGATGT ATCAGAAAAT CAAAAAACAT
CCGACGCCGC GCAAAATCTA TGCTGACAAG CTGGAGCAGG AAAAAGTGGC GACGCTGGAA
GATGCCACCG AGATGGTTAA CCTGTACCGC GATGCGCTGG ATGCTGGCGA TTGCGTAGTG
GCAGAGTGGC GTCCGATGAA CATGCACTCT TTCACCTGGT CGCCGTACCT CAACCACGAA
TGGGACGAAG AGTACCCGAA TAAAGTTGAG ATGAAGCGCC TGCAGGAGCT GGCGAAACGC
ATCAGCACGG TGCCGGAAGC GGTTGAAATG CAGTCTCGCG TTGCCAAAAT TTATGGCGAT
CGCCAGGCGA TGGCTGCCGG TGAGAAACTG TTCGACTGGG GCGGCGCGGA AAACCTCGCT
TACGCCACGC TGGTTGACGA AGGCATTCCG GTTCGCCTGT CGGGTGAAGA CTCCGGTCGC
GGTACCTTCT TCCACCGCCA CGCGGTGATC CACAACCAGT CTAACGGTTC CACTTACACG
CCGCTGCAAC ACATCCATAA CGGCCAGGGC GCGTTCCGTG TCTGGGACTC CGTACTTTCT
GAAGAAGCAG TGCTGGCGTT TGAATACGGT TATGCCACCG CAGAACCACG CACTCTGACT
ATCTGGGAAG CTCAGTTCGG TGACTTCGCC AACGGTGCGC AGGTGGTTAT CGACCAGTTC
ATCTCCTCTG GCGAACAGAA ATGGGGCCGG ATGTGTGGTC TGGTGATGTT GCTGCCGCAC
GGTTACGAAG GGCAGGGGCC GGAGCACTCC TCCGCGCGTC TGGAACGTTA TCTGCAACTT
TGTGCTGAGC AAAACATGCA GGTGTGCGTA CCGTCTACCC CGGCACAGGT TTACCACATG
CTGCGTCGTC AGGCGCTGCG CGGGATGCGT CGTCCGCTGG TCGTGATGTC GCCGAAATCC
CTGCTGCGTC ATCCGCTGGC GGTTTCCAGC CTCGAAGAAC TGGCGAACGG CACCTTCCTG
CCAGCCATCG GTGAAATCGA CGAGCTTGAT CCGAAGGGCG TGAAGCGCGT AGTGATGTGT
TCTGGTAAGG TTTATTACGA CCTGCTGGAA CAGCGTCGTA AGAACAATCA ACACGATGTC
GCCATTGTGC GTATCGAGCA ACTCTACCCG TTCCCGCATA AAGCGATGCA GGAAGTGTTG
CAGCAGTTTG CTCACGTCAA GGATTTTGTC TGGTGCCAGG AAGAGCCGCT CAATCAGGGC
GCATGGTACT GCAGCCAGCA TCATTTCCGT GAAGTGATTC CGTTTGGGGC TTCTCTGCGT
TATGCAGGCC GCCCGGCCTC CGCCTCTCCG GCGGTAGGGT ATATGTCCGT TCACCAGAAA
CAGCAACAAG ATCTGGTTAA TGACGCGCTG AACGTCGAAT AA
 
Protein sequence
MQNSALKAWL DSSYLSGANQ SWIEQLYEDF LTDPDSVDAN WRSTFQQLPG TGVKPDQFHS 
QTREYFRRLA KDASRYSSTI SDPDTNVKQV KVLQLINAYR FRGHQHANLD PLGLWQQDKV
ADLDPSFHDL TEADFQETFN VGSFASGKET MKLGELLEAL KQTYCGPIGA EYMHITSTEE
KRWIQQRIES GRATFNSEEK KRFLSELTAA EGLERYLGAK FPGAKRFSLE GGDALIPMLK
EMIRHAGNSG TREVVLGMAH RGRLNVLVNV LGKKPQDLFD EFAGKHKEHL GTGDVKYHMG
FSSDFQTDGG LVHLALAFNP SHLEIVSPVV IGSVRARLDR LDEPSSNKVL PITIHGDAAV
TGQGVVQETL NMSKARGYEV GGTVRIVINN QVGFTTSNPL DARSTPYCTD IGKMVQAPIF
HVNADDPEAV AFVTRLALDF RNTFKRDVFI DLVCYRRHGH NEADEPSATQ PLMYQKIKKH
PTPRKIYADK LEQEKVATLE DATEMVNLYR DALDAGDCVV AEWRPMNMHS FTWSPYLNHE
WDEEYPNKVE MKRLQELAKR ISTVPEAVEM QSRVAKIYGD RQAMAAGEKL FDWGGAENLA
YATLVDEGIP VRLSGEDSGR GTFFHRHAVI HNQSNGSTYT PLQHIHNGQG AFRVWDSVLS
EEAVLAFEYG YATAEPRTLT IWEAQFGDFA NGAQVVIDQF ISSGEQKWGR MCGLVMLLPH
GYEGQGPEHS SARLERYLQL CAEQNMQVCV PSTPAQVYHM LRRQALRGMR RPLVVMSPKS
LLRHPLAVSS LEELANGTFL PAIGEIDELD PKGVKRVVMC SGKVYYDLLE QRRKNNQHDV
AIVRIEQLYP FPHKAMQEVL QQFAHVKDFV WCQEEPLNQG AWYCSQHHFR EVIPFGASLR
YAGRPASASP AVGYMSVHQK QQQDLVNDAL NVE