Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2138 |
Symbol | |
ID | 3833138 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2236695 |
End bp | 2238569 |
Gene Length | 1875 bp |
Protein Length | 624 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637830063 |
Product | hypothetical protein |
Protein accession | YP_430973 |
Protein GI | 83590964 |
COG category | [C] Energy production and conversion |
COG ID | [COG1249] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCAGGGTA AAAAGAGGTT ATTCTTTTCC CTGGGTACAG TTATACTGTT AGTTGCCCTG GCGGGCTGGC GTGTCTGGGC CTGGCGGGCC ACAACATCTT CCGGGCCGGA CCCGTCCCAA TTCCCGCCGG CGCCAGCGCC GGCGGCCGGG GCCAGCTACG ACGTCCTGGT CGTTGGCGGT CAGCCGGAAG GGGTGGCGGC GGCCATCGCC GCCGCCCGCC AGGGGGCGAA AGTCCTCCTG GTGGAGAAAC GTGACGGCCT GGGCGGTCTC TTTACCTACG GCTGGCTGAA CTTTATTGAT ATGAACTACG GTCCCCATTA TGAACTTTTG ACCCGGGGGA CCTTCCAGGA GTTTTACCGC CGGGTCCACG GCAGCGTTTT TGATGTGGCC GAGGCTAAAA AGGTCCTGGC GGATATGGTC GGCCGCTACC CGAACCTGGC CTTAAGCCTC AATACCTCTT TTAAGGAACC TATTCTCGAG GACAACAAGC TGGTAGGCAT CAAGGCCGTC AAAGACGGCC GGGAGCTGCC CTTTTACGCC AGCCGGGTCA TTGACGCCAC CCAGAACGCC GATGTCGCCG CCGCCGCGGG CGTGCCCTAC ACCGTGGGCG CCGAGGATAT TGGGGAAAAG GACCGGCGCC AGGCGGTAAC CCTGGTCTTT CGCCTGGGCG GTGTGGACTG GCAGGCCCTG GCGAGGGCCG TGGGCAGCCA GATCAAGGAC GCTAAGATTT CCGACCGGGC GGCTTGGGGA TTTGGCAGTA TCGCCAGAGG TTACCAGCCA TCCACCCCCC GGTTGCGCCT GCGGGGGTTT AATATTGCCC GCCAGGACGA CGGCAGCGTC TTTATCAACG CCCTGCAGAT CTTTGGCGTT GACGGCTTGA GCGCTGCTTC CCGGGAAGAG GCTATAAAGC TGGCCCAAAG GGAACTGCCG GCTATTACCG ATTTTTTACG GTCCCACATG CCCGGTTTTG CCGGCGCCCG GCTCCTGGGC GCGGCGCCGG AACTCTATAT CCGGGAAACC CGGCATATCA AGGCCCTTTA CCAGCTGGAC TTGAACGACG TCCTTTTTAA CCGTTACTTT CCCGATGCCA TTGCCCTGGG CTCCTACCCG GTGGACGTCC AGGCCACCTC GCCTGAGGAT ACGGGTTATG TCTACGGCCG GCCGGAGGTC TACAGCATTC CCTTCCGCTC CCTGGTGCCG GAGAAAATCG ATAACCTCCT GGTGGTGGGG CGTTCGGCCG GCTACACCCA CCTGGCTGCC GGCAGCGCCC GGGTGGTACC CATTGGCATG GCTACCGGCG ACGCCGCCGG GGTGGCGGCC GTTTATTCTC TGCAGGTAAA TAAGAACTTT CGGGAACTGG CGGCCAGCCC CCGGGACATT AAGGCCATCC AGGACAAACT GGTGAAGATG GGGGCCTACC TCAAGGATTA TCATATAAAG AACCCCCTGG AGAATCACTG GGCCTTTGAA GGTTTAAAAT TTGTCAACCA CTGGGGCCTG ATTGTCGCCG GTTATAATAA CGACTGGAAG CTGGACACCC CCATCAGCCG TATCAGTTTT TATTATATGA CGGCCAATGC CTTAAAGAGG GCGGCCGGCC GGGCCGACCT GGTGGCAGCC AGGGCGGAGG TTTTAAAACC CTACCTGGAA GGCGGCAACT TGAACCGGGG TGACGCAGCC AAACTTCTCT TGACCTACCT GGGGGTAGAC GCCAGCACTC TGGACCCCGG AGCAGCCGTG GCCATGGCTG GAGAAAAGGG ACTTCTGCCC CTGGAGCATA CGGGCAGCGA TCCGGCCGGG GCAGTTACCG GAGCCGAAGC TTACTACGCT ACGGAAAGAT TATGCGCCCT GCTGGCCAAG GGAAGCTCCA GGTAA
|
Protein sequence | MQGKKRLFFS LGTVILLVAL AGWRVWAWRA TTSSGPDPSQ FPPAPAPAAG ASYDVLVVGG QPEGVAAAIA AARQGAKVLL VEKRDGLGGL FTYGWLNFID MNYGPHYELL TRGTFQEFYR RVHGSVFDVA EAKKVLADMV GRYPNLALSL NTSFKEPILE DNKLVGIKAV KDGRELPFYA SRVIDATQNA DVAAAAGVPY TVGAEDIGEK DRRQAVTLVF RLGGVDWQAL ARAVGSQIKD AKISDRAAWG FGSIARGYQP STPRLRLRGF NIARQDDGSV FINALQIFGV DGLSAASREE AIKLAQRELP AITDFLRSHM PGFAGARLLG AAPELYIRET RHIKALYQLD LNDVLFNRYF PDAIALGSYP VDVQATSPED TGYVYGRPEV YSIPFRSLVP EKIDNLLVVG RSAGYTHLAA GSARVVPIGM ATGDAAGVAA VYSLQVNKNF RELAASPRDI KAIQDKLVKM GAYLKDYHIK NPLENHWAFE GLKFVNHWGL IVAGYNNDWK LDTPISRISF YYMTANALKR AAGRADLVAA RAEVLKPYLE GGNLNRGDAA KLLLTYLGVD ASTLDPGAAV AMAGEKGLLP LEHTGSDPAG AVTGAEAYYA TERLCALLAK GSSR
|
| |