Gene Moth_2259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2259 
Symbol 
ID3830754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2364747 
End bp2366405 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content60% 
IMG OID637830179 
Productdihydroxyacid dehydratase 
Protein accessionYP_431089 
Protein GI83591080 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCAGCG ATGCTATGAA AACGGGCCTG GCCCGGGCTC CCCACAGATC GTTACTTAAA 
GCCATGGGCC TGACCGAGAC GGAAATCGAG CGGCCGATTA TAGGTGTTGT CAATGCCCAT
AATGAACTCA TTCCCGGCCA TATACATTTA AATAACCTGG TGGAAGCCGT AAAAGCAGGG
GTGCGCCTGG CCGGCGGTAC GCCCCTGGAG TTTCCCACTA TCGGTGTCTG TGATGGCCTG
GCCATGAACC ATGTGGGGAT GAAGTATTCC CTGGCCAGCC GGGAGCTCAT CGCCGATATG
ATTGAAGTAA TGGCTATGGC CCATCCCTTT GACGCCCTGG TATTTATCCC TAACTGCGAT
AAGATCGTCC CCGGGATGCT TATGGCAGCA GCCCGCCTGA ACCTCCCTGC CATTTTTATC
AGCGGCGGTC CCATGCTGGC CGGCCGTTAC CAGGGCCGGG ATGTTTCCCT GAGTACTATG
TTCGAGGCTG TAGGGGCTGT CCAGGCCGGG AAGATGACAG AACAGGAACT GGCCGCCCTG
GAGGACTGCG CCTGCCCGGG CTGTGGTTCC TGTGCGGGTA TGTTTACCGC CAACACCATG
AACTGCATGG TTGAGGCCTT AGGGATGGGC CTGCCGGGTA ACGGTACTAC CCCTGCAGTG
AGCGGCTCCC GGGTACGTTT GGCTAAAGAA GCCGGTATGC AGGTGATGAA ATTACTCCAG
GAAAATATCC GGCCCCTGGA TATTATGACG GCTACAGCCT TCCGCAATGC CGTGGCCGTG
GATATGGCCC TGGGTGGTTC GACCAATACC TGCCTGCACC TGCCGGCCAT AGCCCATGAA
GCCGGCGTAA AACTTGACCT GAATACTTTC AACGAAATCA ATCGGCGGAC GCCCCAGATC
TGTAAGCTCA GCCCGGCCGG CAGCCAGCAC ATCCAGGACC TGGATGAGGC CGGCGGTATC
CCGGCGGTGA TGAATGAGCT CTACCGTCAT GGCCTGATTG ACGGCAGCGC CCTTACTGTG
ACCGGACGGA CAGTGGCTGA TAACGTCAGC GGTCGGGTGG TAAGCCGGCG CGAGGTTATC
CGGCCTGTGG AAGACCCCTA CAGCAGGGAA GGTGGCCTGG CCGTATTGTA TGGCAACCTG
GCTCCTGAGG GTGCCGTTGT AAAGAAGGGC GCCGTGCTGC CGGAGATGAT GCGGCATGAA
GGGCCGGCGC GGGTATTTAA CAGCGAGGAA GAGGCTTTTG CCGCCATTAT GGGGAAGCAG
ATTAAACCCG GGGATGTGGT GGTAATCCGC TACGAAGGCC CTCGTGGCGG TCCGGGTATG
CAGGAAATGC TCAGTCCCAC GGCAGCCCTG GCCGGTATGG GCTTGGACAG CTCCGTGGCT
CTGATCACTG ACGGCCGTTT CTCCGGTGCC AGCCGTGGCG CCTCTATCGG TCACGTCTCG
CCGGAAGCAG CGGCCGGGGG GCTCATCGCC CTGGTGGAAG AAGGAGATAT CATCGCCATC
GATATTGAAG CCGGCAAGCT GGAACTTAAG GTGCCGGAAG AAGAAATTGC CCGCCGCCGC
CAGAATTGGC AGGCGCCGCC GCCGAAGATC ACCGGCGGTT ACCTGGGCCG CTACGCGCGC
ATGGTTACTT CCGGAGCCAG GGGCGCGGTG TTGGAGTAA
 
Protein sequence
MRSDAMKTGL ARAPHRSLLK AMGLTETEIE RPIIGVVNAH NELIPGHIHL NNLVEAVKAG 
VRLAGGTPLE FPTIGVCDGL AMNHVGMKYS LASRELIADM IEVMAMAHPF DALVFIPNCD
KIVPGMLMAA ARLNLPAIFI SGGPMLAGRY QGRDVSLSTM FEAVGAVQAG KMTEQELAAL
EDCACPGCGS CAGMFTANTM NCMVEALGMG LPGNGTTPAV SGSRVRLAKE AGMQVMKLLQ
ENIRPLDIMT ATAFRNAVAV DMALGGSTNT CLHLPAIAHE AGVKLDLNTF NEINRRTPQI
CKLSPAGSQH IQDLDEAGGI PAVMNELYRH GLIDGSALTV TGRTVADNVS GRVVSRREVI
RPVEDPYSRE GGLAVLYGNL APEGAVVKKG AVLPEMMRHE GPARVFNSEE EAFAAIMGKQ
IKPGDVVVIR YEGPRGGPGM QEMLSPTAAL AGMGLDSSVA LITDGRFSGA SRGASIGHVS
PEAAAGGLIA LVEEGDIIAI DIEAGKLELK VPEEEIARRR QNWQAPPPKI TGGYLGRYAR
MVTSGARGAV LE