Gene Mthe_0673 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0673 
Symbol 
ID4463313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp710196 
End bp711842 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content59% 
IMG OID639699683 
Productdihydroxy-acid dehydratase 
Protein accessionYP_843103 
Protein GI116753985 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAGCG ATATCACCAA ATCAGGACCT GAGAGGGCGC CGCATCGCGC ACTTCTCAAG 
GCGATGGGCA TCACTGACGA TGAGATCAAA AGACCGTTCA TAGGCGTTGC GAACTCGGCG
AACGAGTTCG TACCAGGGCA CATACATCTT GACAGGATCG CAGAGGCTGT GAAGGCGGGT
ATAAGAATCG CGGGAGGCGT GCCGTTCGAG TTCCAGACGA TCGGCGTCTG CGACGGGATC
GCGATGGGTC ATGGCGGCAT GCGGTACTCC CTCCCATCGA GGGAGATTAT CGAGGACTCG
ATAGAGATCA TGGCCCAGGC GCACCAGCTC GACGGTCTTG TTCTGATACC AACATGCGAC
AAGATCGTCC CCGGACATCT CATGGCAGCC GGGCGTCTTG ATCTCCCGAC CATAGTCGTG
ACGGGCGGCC CGATGCTTCC AGGATTTGCA TGCGATCGTG AGCTCGATCT GATCAACGTC
TTCGAGGAGT GGCAGAAGGG AGGCGAGTCC CTCTCGATTT TAGAGGATCT CGCATGCCCG
GGTGCAGGGT CATGTGCTGG ACTGTTCACA GCTAACTCCA TGGCATGCAT GGCCGAGGCG
CTGGGATTGA GCCTCCCTGG ATGCGCAACA GCACATGCAG TGGATGCGAA GAAGATGCGC
ATCGCCAAAC TCTCCGGGAT GATGATCGTG GAGCTTGTGA AGAGAGGGCT CACTGCGAGA
AAGATCGTCT CGCGCGAGTC ATTCGAGAAC GCTGTCAGGG TCGACATGGC CATCGGAGGG
TCCACAAACA CAGCACTGCA CCTCCCGGCA ATCGCTGCAG AATTCGATAT CGATTTAGAG
CTGGATGTCT TCGACAGGCT GAGCAGGGAG ACGCCGCATC TGGTCAATCT GCGCCCCGGA
GGCCCGCATC ACATGCTGGA TCTTGACCGT GCAGGTGGGG TGCAGGCTGT GATGCATCGC
CTATCATCCA AACTGGATCT TAGTGTCCTC ACGGTCACAG GAAAGACTCT GGGGGCGGTG
CTCGCGGAGT TCAAACCTGT CAACCCCAAG GCGAATGCAG AGGTCATAGC AACACTGGAG
AGACCTGTGC ATCCTGAGGG CGGGATCGCG ATACTCAAGG GAAGCCTGGC GCCAGAGGGC
TCTGTTGTGA AGCAGACTGC GGTCTCGAAG AAGATGCTCG TGCACAAGGG CCCCGCAGTC
GTCTACGACT CCGAGGAGGA GTCGATGAAG GGGATACTGA GCGGCGAGGT CAAGGCGGGA
GATGTTGTTG TCATAAGATA CGAGGGGCCA AAGGGTGGTC CAGGAATGAG GGAGACCCTG
GCACCGACGT CAGCGATCGC AGGCGCGGGG CTCAGCGAGT CTGTGGCGCT GATCACAGAC
GGCAGGTTCA GCGGCGGTAC GCGCGGGCCG TGCATAGGGC ATGTCTCCCC TGAGGCAGCT
GTCGGAGGCC CAATAGCGCT CGTCGAGAAC GGGGATATGA TCTCCATAGA TATACCGAAC
AGGAGGCTGG ATCTGCTCGT TGATGAGGGT GTGCTAGAGC GAAGACGCGC ATCCTGGAGG
CCTCCTGAGC CGAGGGTGAG GGGAGGAGTT CTCGATAGGT ACAGAAAGTC CGTGACGTCT
GCGAGCAAGG GCGGAGTTTT GAGATGA
 
Protein sequence
MRSDITKSGP ERAPHRALLK AMGITDDEIK RPFIGVANSA NEFVPGHIHL DRIAEAVKAG 
IRIAGGVPFE FQTIGVCDGI AMGHGGMRYS LPSREIIEDS IEIMAQAHQL DGLVLIPTCD
KIVPGHLMAA GRLDLPTIVV TGGPMLPGFA CDRELDLINV FEEWQKGGES LSILEDLACP
GAGSCAGLFT ANSMACMAEA LGLSLPGCAT AHAVDAKKMR IAKLSGMMIV ELVKRGLTAR
KIVSRESFEN AVRVDMAIGG STNTALHLPA IAAEFDIDLE LDVFDRLSRE TPHLVNLRPG
GPHHMLDLDR AGGVQAVMHR LSSKLDLSVL TVTGKTLGAV LAEFKPVNPK ANAEVIATLE
RPVHPEGGIA ILKGSLAPEG SVVKQTAVSK KMLVHKGPAV VYDSEEESMK GILSGEVKAG
DVVVIRYEGP KGGPGMRETL APTSAIAGAG LSESVALITD GRFSGGTRGP CIGHVSPEAA
VGGPIALVEN GDMISIDIPN RRLDLLVDEG VLERRRASWR PPEPRVRGGV LDRYRKSVTS
ASKGGVLR