Gene Moth_2138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2138 
Symbol 
ID3833138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2236695 
End bp2238569 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content61% 
IMG OID637830063 
Producthypothetical protein 
Protein accessionYP_430973 
Protein GI83590964 
COG category[C] Energy production and conversion 
COG ID[COG1249] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCAGGGTA AAAAGAGGTT ATTCTTTTCC CTGGGTACAG TTATACTGTT AGTTGCCCTG 
GCGGGCTGGC GTGTCTGGGC CTGGCGGGCC ACAACATCTT CCGGGCCGGA CCCGTCCCAA
TTCCCGCCGG CGCCAGCGCC GGCGGCCGGG GCCAGCTACG ACGTCCTGGT CGTTGGCGGT
CAGCCGGAAG GGGTGGCGGC GGCCATCGCC GCCGCCCGCC AGGGGGCGAA AGTCCTCCTG
GTGGAGAAAC GTGACGGCCT GGGCGGTCTC TTTACCTACG GCTGGCTGAA CTTTATTGAT
ATGAACTACG GTCCCCATTA TGAACTTTTG ACCCGGGGGA CCTTCCAGGA GTTTTACCGC
CGGGTCCACG GCAGCGTTTT TGATGTGGCC GAGGCTAAAA AGGTCCTGGC GGATATGGTC
GGCCGCTACC CGAACCTGGC CTTAAGCCTC AATACCTCTT TTAAGGAACC TATTCTCGAG
GACAACAAGC TGGTAGGCAT CAAGGCCGTC AAAGACGGCC GGGAGCTGCC CTTTTACGCC
AGCCGGGTCA TTGACGCCAC CCAGAACGCC GATGTCGCCG CCGCCGCGGG CGTGCCCTAC
ACCGTGGGCG CCGAGGATAT TGGGGAAAAG GACCGGCGCC AGGCGGTAAC CCTGGTCTTT
CGCCTGGGCG GTGTGGACTG GCAGGCCCTG GCGAGGGCCG TGGGCAGCCA GATCAAGGAC
GCTAAGATTT CCGACCGGGC GGCTTGGGGA TTTGGCAGTA TCGCCAGAGG TTACCAGCCA
TCCACCCCCC GGTTGCGCCT GCGGGGGTTT AATATTGCCC GCCAGGACGA CGGCAGCGTC
TTTATCAACG CCCTGCAGAT CTTTGGCGTT GACGGCTTGA GCGCTGCTTC CCGGGAAGAG
GCTATAAAGC TGGCCCAAAG GGAACTGCCG GCTATTACCG ATTTTTTACG GTCCCACATG
CCCGGTTTTG CCGGCGCCCG GCTCCTGGGC GCGGCGCCGG AACTCTATAT CCGGGAAACC
CGGCATATCA AGGCCCTTTA CCAGCTGGAC TTGAACGACG TCCTTTTTAA CCGTTACTTT
CCCGATGCCA TTGCCCTGGG CTCCTACCCG GTGGACGTCC AGGCCACCTC GCCTGAGGAT
ACGGGTTATG TCTACGGCCG GCCGGAGGTC TACAGCATTC CCTTCCGCTC CCTGGTGCCG
GAGAAAATCG ATAACCTCCT GGTGGTGGGG CGTTCGGCCG GCTACACCCA CCTGGCTGCC
GGCAGCGCCC GGGTGGTACC CATTGGCATG GCTACCGGCG ACGCCGCCGG GGTGGCGGCC
GTTTATTCTC TGCAGGTAAA TAAGAACTTT CGGGAACTGG CGGCCAGCCC CCGGGACATT
AAGGCCATCC AGGACAAACT GGTGAAGATG GGGGCCTACC TCAAGGATTA TCATATAAAG
AACCCCCTGG AGAATCACTG GGCCTTTGAA GGTTTAAAAT TTGTCAACCA CTGGGGCCTG
ATTGTCGCCG GTTATAATAA CGACTGGAAG CTGGACACCC CCATCAGCCG TATCAGTTTT
TATTATATGA CGGCCAATGC CTTAAAGAGG GCGGCCGGCC GGGCCGACCT GGTGGCAGCC
AGGGCGGAGG TTTTAAAACC CTACCTGGAA GGCGGCAACT TGAACCGGGG TGACGCAGCC
AAACTTCTCT TGACCTACCT GGGGGTAGAC GCCAGCACTC TGGACCCCGG AGCAGCCGTG
GCCATGGCTG GAGAAAAGGG ACTTCTGCCC CTGGAGCATA CGGGCAGCGA TCCGGCCGGG
GCAGTTACCG GAGCCGAAGC TTACTACGCT ACGGAAAGAT TATGCGCCCT GCTGGCCAAG
GGAAGCTCCA GGTAA
 
Protein sequence
MQGKKRLFFS LGTVILLVAL AGWRVWAWRA TTSSGPDPSQ FPPAPAPAAG ASYDVLVVGG 
QPEGVAAAIA AARQGAKVLL VEKRDGLGGL FTYGWLNFID MNYGPHYELL TRGTFQEFYR
RVHGSVFDVA EAKKVLADMV GRYPNLALSL NTSFKEPILE DNKLVGIKAV KDGRELPFYA
SRVIDATQNA DVAAAAGVPY TVGAEDIGEK DRRQAVTLVF RLGGVDWQAL ARAVGSQIKD
AKISDRAAWG FGSIARGYQP STPRLRLRGF NIARQDDGSV FINALQIFGV DGLSAASREE
AIKLAQRELP AITDFLRSHM PGFAGARLLG AAPELYIRET RHIKALYQLD LNDVLFNRYF
PDAIALGSYP VDVQATSPED TGYVYGRPEV YSIPFRSLVP EKIDNLLVVG RSAGYTHLAA
GSARVVPIGM ATGDAAGVAA VYSLQVNKNF RELAASPRDI KAIQDKLVKM GAYLKDYHIK
NPLENHWAFE GLKFVNHWGL IVAGYNNDWK LDTPISRISF YYMTANALKR AAGRADLVAA
RAEVLKPYLE GGNLNRGDAA KLLLTYLGVD ASTLDPGAAV AMAGEKGLLP LEHTGSDPAG
AVTGAEAYYA TERLCALLAK GSSR