Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0738 |
Symbol | sucA |
ID | 6145269 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 743431 |
End bp | 746232 |
Gene Length | 2802 bp |
Protein Length | 933 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641615627 |
Product | 2-oxoglutarate dehydrogenase E1 component |
Protein accession | YP_001742826 |
Protein GI | 170681528 |
COG category | [C] Energy production and conversion |
COG ID | [COG0567] 2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymes |
TIGRFAM ID | [TIGR00239] 2-oxoglutarate dehydrogenase, E1 component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGAACA GCGCTTTGAA AGCCTGGTTG GACTCTTCTT ACCTCTCTGG CGCAAACCAG AGCTGGATAG AACAGCTCTA TGAAGACTTC TTAACCGATC CTGACTCGGT TGACGCTAAC TGGCGTTCGA CGTTCCAGCA GTTACCTGGT ACGGGAGTCA AACCGGATCA ATTCCACTCT CAAACGCGTG AATATTTCCG CCGCCTGGCG AAAGACGCTT CACGTTACTC TTCAACGATC TCCGACCCTG ACACCAATGT GAAGCAGGTT AAAGTCCTGC AGCTCATTAA CGCATACCGC TTCCGTGGTC ACCAGCATGC GAATCTCGAT CCGCTGGGAC TGTGGCAGCA AGATAAAGTG GCCGATCTGG ATCCGTCTTT CCACGATCTG ACCGAAGCAG ACTTCCAGGA GACCTTCAAC GTCGGTTCAT TTGCCAGCGG CAAAGAAACC ATGAAACTCG GCGAGCTGCT GGAAGCCCTC AAGCAAACCT ACTGCGGCCC GATTGGTGCC GAGTATATGC ACATCACCAG CACCGAAGAA AAACGCTGGA TCCAACAGCG TATCGAGTCT GGTCGCGCGA CTTTCAATAG CGAAGAGAAA AAACGCTTCT TAAGCGAACT GACCGCCGCT GAAGGCCTTG AACGTTACCT CGGCGCAAAA TTCCCTGGCG CAAAACGCTT CTCGCTGGAA GGCGGTGACG CGTTAATCCC GATGCTTAAA GAGATGATCC GCCACGCTGG CAACAGCGGC ACCCGCGAAG TGGTTCTCGG GATGGCGCAC CGTGGTCGTC TGAACGTGCT GGTGAACGTG CTGGGTAAAA AACCGCAAGA CTTGTTCGAC GAGTTTGCCG GTAAACATAA AGAACACCTC GGCACGGGCG ACGTGAAATA CCACATGGGC TTCTCGTCTG ACTTCCAGAC CGATGGCGGC CTGGTGCACC TGGCGCTGGC GTTTAACCCG TCTCACCTTG AGATTGTCAG CCCGGTAGTT ATCGGTTCTG TTCGTGCCCG TCTGGACAGA CTCGATGAGC CGAGCAGCAA CAAAGTGCTG CCAATCACCA TCCACGGTGA CGCCGCAGTG ACCGGGCAGG GCGTGGTTCA GGAAACCCTG AACATGTCGA AAGCGCGTGG TTATGAAGTT GGCGGTACGG TACGTATCGT TATCAACAAC CAGGTTGGCT TCACCACCTC TAATCCGCTG GATGCTCGTT CTACACCGTA CTGTACTGAT ATCGGTAAGA TGGTTCAGGC ACCGATTTTC CACGTAAACG CGGATGATCC GGAAGCCGTT GCTTTTGTGA CCCGTCTGGC GCTCGATTTC CGTAACACCT TTAAACGTGA TGTCTTCATC GACCTGGTGT GCTACCGCCG TCACGGCCAC AACGAAGCCG ACGAGCCGAG CGCAACCCAG CCGCTGATGT ATCAGAAAAT CAAAAAACAT CCGACGCCGC GCAAAATCTA TGCTGACAAG CTGGAGCAGG AAAAAGTCGC GACGCTGGAA GATGCCACCG AGATGGTTAA CCTGTACCGC GATGCGCTGG ATGCTGGCGA TTGCGTTGTA GCAGAGTGGC GTCCGATGAA CATGCACTCT TTCACCTGGT CGCCGTACCT CAACCACGAA TGGGACGAAG AGTACCCGAA CAAAGTTGAG ATGAAGCGCC TGCAGGAACT GGCTAAACGC ATCAGCACGG TGCCGGAAGC AGTTGAAATG CAGTCTCGCG TTGCCAAGAT TTATGGCGAT CGCCAGGCGA TGGCTGCTGG TGAGAAACTG TTCGACTGGG GCGGCGCGGA AAACCTCGCT TACGCCACGC TGGTTGACGA AGGCATTCCG GTTCGCCTGT CGGGTGAAGA CTCCGGTCGC GGTACCTTCT TCCACCGCCA CGCGGTGATC CACAACCAGT CTAACGGTTC CACTTACACG CCGCTGCAAC ACATCCATAA CGGCCAGGGC GCGTTCCGTG TCTGGGACTC CGTACTGTCT GAAGAAGCCG TACTGGCGTT TGAATACGGT TATGCCACCG CAGAACCACG CACTCTGACT ATCTGGGAAG CGCAATTCGG TGACTTCGCC AACGGTGCTC AGGTGGTTAT CGACCAGTTC ATCTCCTCTG GCGAGCAGAA ATGGGGCCGG ATGTGTGGTC TGGTGATGTT GCTGCCGCAC GGTTACGAAG GGCAGGGGCC GGAGCACTCC TCCGCGCGTC TGGAACGTTA TCTGCAACTT TGCGCTGAGC AAAACATGCA GGTGTGCGTA CCGTCTACCC CGGCACAGGT TTACCACATG CTGCGTCGTC AGGCGCTGCG CGGGATGCGT CGTCCGCTGG TCGTGATGTC GCCGAAATCC CTGCTGCGTC ATCCGCTGGC GGTTTCCAGC CTCGAAGAAC TGGCGAACGG CACCTTCCTG CCAGCCATCG GTGAAATCGA CGAGCTTGAT CCGAAGGGCG TGAAGCGCGT AGTGATGTGT TCTGGTAAGG TTTATTACGA CCTGCTGGAA CAGCGTCGTA AGAACAATCA ACACGATGTC GCCATTGTGC GTATCGAGCA ACTCTACCCG TTCCCGCATA AAGCGATGCA GGAAGTGTTG CAGCAGTTTG CTCACGTCAA GGATTTTGTC TGGTGCCAGG AAGAGCCGCT CAATCAGGGC GCATGGTACT GCAGCCAGCA TCATTTCCGT GAAGTGATTC CGTTTGGGGC TTCTCTGCGT TATGCAGGCC GCCCGGCCTC CGCCTCTCCG GCGGTAGGGT ATATGTCCGT TCACCAGAAA CAGCAACAAG ATCTGGTTAA TGACGCGCTG AACGTCGAAT AA
|
Protein sequence | MQNSALKAWL DSSYLSGANQ SWIEQLYEDF LTDPDSVDAN WRSTFQQLPG TGVKPDQFHS QTREYFRRLA KDASRYSSTI SDPDTNVKQV KVLQLINAYR FRGHQHANLD PLGLWQQDKV ADLDPSFHDL TEADFQETFN VGSFASGKET MKLGELLEAL KQTYCGPIGA EYMHITSTEE KRWIQQRIES GRATFNSEEK KRFLSELTAA EGLERYLGAK FPGAKRFSLE GGDALIPMLK EMIRHAGNSG TREVVLGMAH RGRLNVLVNV LGKKPQDLFD EFAGKHKEHL GTGDVKYHMG FSSDFQTDGG LVHLALAFNP SHLEIVSPVV IGSVRARLDR LDEPSSNKVL PITIHGDAAV TGQGVVQETL NMSKARGYEV GGTVRIVINN QVGFTTSNPL DARSTPYCTD IGKMVQAPIF HVNADDPEAV AFVTRLALDF RNTFKRDVFI DLVCYRRHGH NEADEPSATQ PLMYQKIKKH PTPRKIYADK LEQEKVATLE DATEMVNLYR DALDAGDCVV AEWRPMNMHS FTWSPYLNHE WDEEYPNKVE MKRLQELAKR ISTVPEAVEM QSRVAKIYGD RQAMAAGEKL FDWGGAENLA YATLVDEGIP VRLSGEDSGR GTFFHRHAVI HNQSNGSTYT PLQHIHNGQG AFRVWDSVLS EEAVLAFEYG YATAEPRTLT IWEAQFGDFA NGAQVVIDQF ISSGEQKWGR MCGLVMLLPH GYEGQGPEHS SARLERYLQL CAEQNMQVCV PSTPAQVYHM LRRQALRGMR RPLVVMSPKS LLRHPLAVSS LEELANGTFL PAIGEIDELD PKGVKRVVMC SGKVYYDLLE QRRKNNQHDV AIVRIEQLYP FPHKAMQEVL QQFAHVKDFV WCQEEPLNQG AWYCSQHHFR EVIPFGASLR YAGRPASASP AVGYMSVHQK QQQDLVNDAL NVE
|
| |