Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0818 |
Symbol | sucA |
ID | 6969912 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 838622 |
End bp | 841423 |
Gene Length | 2802 bp |
Protein Length | 933 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643384843 |
Product | 2-oxoglutarate dehydrogenase E1 component |
Protein accession | YP_002269349 |
Protein GI | 209399395 |
COG category | [C] Energy production and conversion |
COG ID | [COG0567] 2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymes |
TIGRFAM ID | [TIGR00239] 2-oxoglutarate dehydrogenase, E1 component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.919234 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGAACA GCGCTTTGAA AGCCTGGTTG GACTCTTCTT ACCTCTCTGG CGCAAACCAG AGCTGGATAG AACAGCTCTA TGAAGACTTC TTAACCGATC CTGACTCGGT TGACGCTAAC TGGCGTTCGA CGTTCCAGCA GTTACCTGGT ACGGGAGTCA AACCGGATCA ATTCCACTCT CAAACGCGTG AATATTTCCG CCGCCTGGCG AAAGACGCTT CACGTTACTC TTCAACGATC TCCGACCCTG ACACCAATGT GAAGCAGGTA AAAGTCCTGC AGCTCATTAA CGCATACCGC TTCCGTGGTC ACCAGCATGC GAATCTCGAT CCGCTGGGAC TGTGGCAGCA AGATAAAGTG GCCGATCTGG ATCCGTCTTT CCACGATCTG ACCGAAGCAG ACTTCCAGGA GACCTTCAAC GTCGGTTCAT TTGCCAGCGG CAAAGAAACC ATGAAACTCG GCGAGCTGCT GGAAGCCCTC AAGCAAACCT ACTGCGGCCC GATTGGTGCC GAGTATATGC ACATTACCAG CACCGAAGAA AAACGCTGGA TCCAACAGCG TATCGAGTCT GGTCGCGCGA CTTTCAATAG CGAAGAGAAA AAACGCTTCC TAAGCGAACT GACCGCCGCT GAAGGCCTTG AACGTTACCT CGGTGCAAAA TTCCCTGGCG CAAAACGCTT CTCGCTGGAA GGCGGTGACG CGTTAATCCC GATGCTTAAA GAGATGATTC GCCACGCTGG CAACAGCGGC ACCCGCGAAG TGGTTCTCGG GATGGCGCAC CGTGGTCGTC TGAACGTGCT GGTGAACGTG CTGGGTAAAA AACCGCAAGA CTTGTTCGAC GAGTTTGCCG GTAAACATAA AGAACACCTC GGCACGGGTG ACGTGAAATA CCACATGGGC TTCTCGTCTG ACTTCCAGAC CGATGGCGGC CTGGTACATC TGGCGCTGGC GTTTAACCCG TCTCACCTTG AGATTGTCAG CCCGGTCGTT ATCGGTTCTG TTCGTGCCCG TCTGGACAGA CTTGATGAGC CGAGCAGCAA CAAAGTGCTG CCAATCACCA TTCACGGTGA CGCCGCAGTG ACCGGGCAGG GCGTGGTTCA GGAAACTCTG AACATGTCGA AAGCGCGTGG TTATGAAGTT GGCGGTACGG TACGTATCGT TATCAACAAC CAGGTTGGCT TCACCACCTC TAACCCGCTG GATGCCCGTT CAACGCCGTA CTGTACTGAT ATCGGTAAGA TGGTTCAGGC ACCGATTTTT CACGTTAACG CGGATGATCC GGAAGCCGTT GCCTTTGTGA CCCGTCTGGC GCTCGATTTC CGTAACACCT TTAAACGTGA TGTCTTCATC GACCTGGTGT GCTACCGCCG TCACGGCCAC AACGAAGCCG ACGAGCCGAG CGCAACCCAG CCGCTGATGT ATCAGAAAAT CAAAAAACAT CCGACGCCGC GCAAAATCTA TGCTGACAAG CTGGAGCAGG AAAAAGTGGC GACGCTGGAA GATGCCACCG AGATGGTTAA CCTGTACCGC GATGCGCTGG ATGCTGGCGA TTGCGTAGTG GCAGAGTGGC GTCCGATGAA CATGCACTCT TTCACCTGGT CGCCGTACCT CAACCACGAA TGGGACGAAG AGTACCCGAA TAAAGTTGAG ATGAAGCGCC TGCAGGAGCT GGCGAAACGC ATCAGCACGG TGCCGGAAGC GGTTGAAATG CAGTCTCGCG TTGCCAAAAT TTATGGCGAT CGCCAGGCGA TGGCTGCCGG TGAGAAACTG TTCGACTGGG GCGGCGCGGA AAACCTCGCT TACGCCACGC TGGTTGACGA AGGCATTCCG GTTCGCCTGT CGGGTGAAGA CTCCGGTCGC GGTACCTTCT TCCACCGCCA CGCGGTGATC CACAACCAGT CTAACGGTTC CACTTACACG CCGCTGCAAC ACATCCATAA CGGCCAGGGC GCGTTCCGTG TCTGGGACTC CGTACTTTCT GAAGAAGCAG TGCTGGCGTT TGAATACGGT TATGCCACCG CAGAACCACG CACTCTGACT ATCTGGGAAG CTCAGTTCGG TGACTTCGCC AACGGTGCGC AGGTGGTTAT CGACCAGTTC ATCTCCTCTG GCGAACAGAA ATGGGGCCGG ATGTGTGGTC TGGTGATGTT GCTGCCGCAC GGTTACGAAG GGCAGGGGCC GGAGCACTCC TCCGCGCGTC TGGAACGTTA TCTGCAACTT TGTGCTGAGC AAAACATGCA GGTGTGCGTA CCGTCTACCC CGGCACAGGT TTACCACATG CTGCGTCGTC AGGCGCTGCG CGGGATGCGT CGTCCGCTGG TCGTGATGTC GCCGAAATCC CTGCTGCGTC ATCCGCTGGC GGTTTCCAGC CTCGAAGAAC TGGCGAACGG CACCTTCCTG CCAGCCATCG GTGAAATCGA CGAGCTTGAT CCGAAGGGCG TGAAGCGCGT AGTGATGTGT TCTGGTAAGG TTTATTACGA CCTGCTGGAA CAGCGTCGTA AGAACAATCA ACACGATGTC GCCATTGTGC GTATCGAGCA ACTCTACCCG TTCCCGCATA AAGCGATGCA GGAAGTGTTG CAGCAGTTTG CTCACGTCAA GGATTTTGTC TGGTGCCAGG AAGAGCCGCT CAATCAGGGC GCATGGTACT GCAGCCAGCA TCATTTCCGT GAAGTGATTC CGTTTGGGGC TTCTCTGCGT TATGCAGGCC GCCCGGCCTC CGCCTCTCCG GCGGTAGGGT ATATGTCCGT TCACCAGAAA CAGCAACAAG ATCTGGTTAA TGACGCGCTG AACGTCGAAT AA
|
Protein sequence | MQNSALKAWL DSSYLSGANQ SWIEQLYEDF LTDPDSVDAN WRSTFQQLPG TGVKPDQFHS QTREYFRRLA KDASRYSSTI SDPDTNVKQV KVLQLINAYR FRGHQHANLD PLGLWQQDKV ADLDPSFHDL TEADFQETFN VGSFASGKET MKLGELLEAL KQTYCGPIGA EYMHITSTEE KRWIQQRIES GRATFNSEEK KRFLSELTAA EGLERYLGAK FPGAKRFSLE GGDALIPMLK EMIRHAGNSG TREVVLGMAH RGRLNVLVNV LGKKPQDLFD EFAGKHKEHL GTGDVKYHMG FSSDFQTDGG LVHLALAFNP SHLEIVSPVV IGSVRARLDR LDEPSSNKVL PITIHGDAAV TGQGVVQETL NMSKARGYEV GGTVRIVINN QVGFTTSNPL DARSTPYCTD IGKMVQAPIF HVNADDPEAV AFVTRLALDF RNTFKRDVFI DLVCYRRHGH NEADEPSATQ PLMYQKIKKH PTPRKIYADK LEQEKVATLE DATEMVNLYR DALDAGDCVV AEWRPMNMHS FTWSPYLNHE WDEEYPNKVE MKRLQELAKR ISTVPEAVEM QSRVAKIYGD RQAMAAGEKL FDWGGAENLA YATLVDEGIP VRLSGEDSGR GTFFHRHAVI HNQSNGSTYT PLQHIHNGQG AFRVWDSVLS EEAVLAFEYG YATAEPRTLT IWEAQFGDFA NGAQVVIDQF ISSGEQKWGR MCGLVMLLPH GYEGQGPEHS SARLERYLQL CAEQNMQVCV PSTPAQVYHM LRRQALRGMR RPLVVMSPKS LLRHPLAVSS LEELANGTFL PAIGEIDELD PKGVKRVVMC SGKVYYDLLE QRRKNNQHDV AIVRIEQLYP FPHKAMQEVL QQFAHVKDFV WCQEEPLNQG AWYCSQHHFR EVIPFGASLR YAGRPASASP AVGYMSVHQK QQQDLVNDAL NVE
|
| |