Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2544 |
Symbol | sucA |
ID | 7873983 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2743921 |
End bp | 2746770 |
Gene Length | 2850 bp |
Protein Length | 949 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643699466 |
Product | 2-oxoglutarate dehydrogenase E1 component |
Protein accession | YP_002889523 |
Protein GI | 237653209 |
COG category | [C] Energy production and conversion |
COG ID | [COG0567] 2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymes |
TIGRFAM ID | [TIGR00239] 2-oxoglutarate dehydrogenase, E1 component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.483047 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAAGC AACTCGAACA GACTTCGCAC TTGTTCGGCT CGAACGCGCC GTTCATCGAA GAGCAGTACG AAAACTACCT CGCCGATCCG GCCTCGGTGT CCGCCGAGTG GCGCGAATAT TTCGACAAGC TGCAGGTTCA GGTCGGCGCT GCCGCGCGCG ACGTGCCGCA TGGTCCGGTC ATTGCCGCCT TCGAGCAGAT GGCCAAGCGC GGCCCGGTGC GCACCATCGT CACCGCCGGC GAGGACAAGC AGCAGGTCTC CGTGCTGCAA CTGATCAACG CCTACCGCTT CCTCGGCAAC CGCTGGGCCA ACCTCGATCC GCTCAAGCGC ACCGAGCGCC CGCAGATCGC CGAGCTCGAG CCGTCCTATT ACGGCTTCAC CGAAGCCGAT CTCTCCAAGA GCTTCAACGT CGGCTCCTTC CACGGCTTCA GCACCGAGCG CGCCACCCTG CGCGAGATCC TCGAGGCGCT GCGCCAGACC TACTGCGGCT CGATCGGCGC CGAGTACATG TACATGACCG ACATCGGTCA GAAGCGCTGG ATCCAAAGCC GCCTCGAGAG CCTGCGCGGT ACGCCGAAGT TCTCGGCGGA GATGAAGAAG CGCATCCTCG AGCGCACCAC CGCGGCCGAG ACCCTGGAGC GCTATCTGCA CACCCGCTAC GTCGGCCAGA AGCGCTTCTC GCTCGAAGGC GGCGAGTCGG CCATCGTCGC GATGGACGAA CTGATCCGCG TCGCCGGCAG CCTGGGTGTG CAGGAAAGCG TGATCGGCAT GGCCCACCGC GGCCGCCTCA ACGTGCTCGT CAACACCCTG GGCAAGTCGC CCTCGATGCT GTTCTCGGAG TTCGAGGGCA AGGCCGCGGC CGACCTCACC GCCGGCGACG TGAAGTACCA CATGGGCTTC TCGAGCGACG TGATGACCCC GGGCGGCCCG ATGCACCTGA CGCTCGCGTT CAACCCCTCG CACCTCGAGA TCATCAACCC GGTGGTCGAG GGCTCGGTGT ATGCCCGCCA GGTCCGTCGC GGCGACGCCG ACAAGAAGCA GGTGCTGCCG GTGCTGATCC ACGGCGACGC CGCCGTGGCC GGCCAGGGCG TTAACCAGGA AATGCTCAAC TTCTCGCAGA CTCGCGGCTA CGGCACGGGC GGCACGGTGC ACCTGGTCGT CAACAACCAG ATCGGCTTCA CCACCTCCGA CCCGCGCGAC TACCGTTCCT CGCTTTACTG CACCGACATC TTCAAGATGG TCGAGGCGCC GATCTTCCAC GTCAATGGCG ACGATCCCGA GGCCGTGGCC CTGGTCACCG CGCTGGCGGT CGAATTCCGC CAGGAGTTCA AGAAGGACGT CGTGGTCGAC ATCATCTGCT TCCGCAAGCT CGGCCACAAC GAGCAGGACG AGCCGATGGT GACGCAGCCG CTGATGTACC GCACCATCCA GAAGCACCCC GGCACCCGCA AGCTCTACGC CGAGCGCCTG GTCGCCGAGG GCACCCTGAA GGCCGAGGAG CCCGACCAGA TGATCGCCGA GTACCGCGAG CATCTCGACA AGGGCCAGCT GCTCTACAAC CCGGTGCTCT CCGGCCACAA CCGCCAGTTC GCCGCGGACT GGACGCCCTA CATCAAGCAG CCCTACACCG ACGAGTGCGA CACCACGGTG CCGGTGCAGG AGATCAAGCG CCTGTCCGAG CGCCTGAGTA CCATCCCGGC GAACTTCACG CTGCACTCGC GCGTCAAGAA GATCATCGAC GACCGCGCCG CGATGGGGCG TGGCGAAGCG CCCTTCGACT GGGGCATGGG CGAGAACCTG GCCTACGCCA GCCTGCTGGC GCAGGGTTAC GGCGTGCGCC TCTCCGGCGA GGACGTCGGC CGCGGCACCT TCTTCCATCG CCACGCCGTG CTGCACGACC AGAAGCGCGA GCGCTGGGAC GAAGGCATCT ACAAGCCGCT CGACCACATC CAGGACGGCC AGGCCCGCTT CCAGAGCTAC GACTCGGTGT TGTCGGAGGA GGGCGTGCTC GCCTTCGAAT ACGGCTACGC CACCACCGAG CCCAACGAGC TCGTCATCTG GGAAGCCCAG TTCGGCGACT TCGTGAACGG CGCCCAGGTC GTGCTCGACC AGTTCATCAG CTCGGGCGAG GCCAAGTGGG GTCGCCTGTG CGGTCTCACG CTGATGCTGC CGCACGGCTA CGAGGGCCAG GGCCCCGAGC ACTCGTCCGC CCGTCTCGAG CGCTACATGA ACAACGCTGC CGAGCACAAC TGGCAGATCT GCGTGCCGAC CACCCCGGCG CAGATCTTCC ACCTGCTGCG CCGCCAGGCG ATCCGCAAGG TGCGCAAGCC GCTGATCATC ATCACGCCGA AGTCGCTGCT GCGTCACAAG GAGGCGATTT CCTCGATCGA GGAACTGGCC AACGGCAGCT TCCAGACGGT GATCCCCGAG GTCGAGAAGC TCGACGCCAA GAAGGTCAAG CGCGTGGTTC TGTGCCAGGG CAAGATCTAT TACGAGCTCC TCGCCCATCG CCGCGAGAAC AAGATCACCG ACACCGCGCT GGTCCGCATC GAGCAGCTCT ATCCGTTCCC GACCGAGGCC TTCGCCAAGG CGATCGAGCA GTTCCCGAAC GCCAAGGAGA TCGTCTGGGC CCAGGAAGAG CCGCGCAACC AGGGCGCGTG GTACTGGCTC GCCTCGCGCC AGCACCTGGT CAACGTGCTC GGCACCAAGC GCCGTCTGCT GCTGGTGAGC CGCCCGGCCG CCGCCTCGCC CGCGGTCGGC TACTACGCCA AGCACAACGC GCAACAAAAG GCGGTCCTCG AAAACGCCTT CGGTCCGATC CAGGACACCA CGCCGCAGTC CCCGCACTGA
|
Protein sequence | MMKQLEQTSH LFGSNAPFIE EQYENYLADP ASVSAEWREY FDKLQVQVGA AARDVPHGPV IAAFEQMAKR GPVRTIVTAG EDKQQVSVLQ LINAYRFLGN RWANLDPLKR TERPQIAELE PSYYGFTEAD LSKSFNVGSF HGFSTERATL REILEALRQT YCGSIGAEYM YMTDIGQKRW IQSRLESLRG TPKFSAEMKK RILERTTAAE TLERYLHTRY VGQKRFSLEG GESAIVAMDE LIRVAGSLGV QESVIGMAHR GRLNVLVNTL GKSPSMLFSE FEGKAAADLT AGDVKYHMGF SSDVMTPGGP MHLTLAFNPS HLEIINPVVE GSVYARQVRR GDADKKQVLP VLIHGDAAVA GQGVNQEMLN FSQTRGYGTG GTVHLVVNNQ IGFTTSDPRD YRSSLYCTDI FKMVEAPIFH VNGDDPEAVA LVTALAVEFR QEFKKDVVVD IICFRKLGHN EQDEPMVTQP LMYRTIQKHP GTRKLYAERL VAEGTLKAEE PDQMIAEYRE HLDKGQLLYN PVLSGHNRQF AADWTPYIKQ PYTDECDTTV PVQEIKRLSE RLSTIPANFT LHSRVKKIID DRAAMGRGEA PFDWGMGENL AYASLLAQGY GVRLSGEDVG RGTFFHRHAV LHDQKRERWD EGIYKPLDHI QDGQARFQSY DSVLSEEGVL AFEYGYATTE PNELVIWEAQ FGDFVNGAQV VLDQFISSGE AKWGRLCGLT LMLPHGYEGQ GPEHSSARLE RYMNNAAEHN WQICVPTTPA QIFHLLRRQA IRKVRKPLII ITPKSLLRHK EAISSIEELA NGSFQTVIPE VEKLDAKKVK RVVLCQGKIY YELLAHRREN KITDTALVRI EQLYPFPTEA FAKAIEQFPN AKEIVWAQEE PRNQGAWYWL ASRQHLVNVL GTKRRLLLVS RPAAASPAVG YYAKHNAQQK AVLENAFGPI QDTTPQSPH
|
| |