Gene Tmz1t_3508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3508 
Symbol 
ID7873014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3844382 
End bp3846232 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content68% 
IMG OID643700449 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002890479 
Protein GI237654165 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCAGT ACCGTTCCCG CACGTCCACC GCCGGCCGCA ACATGGCGGG CGCCCGCGCC 
CTGTGGCGCG CCACCGGCAT GAAGGACGGC GACTTCGAAA AGCCGATCAT CGCGATCGCC
AACAGCTTCA CCCAGTTCGT GCCCGGCCAC GTGCACCTGA AGGATCTCGG TCAGCTCGTC
GCGCGCGAGA TCGAGTCCGC CGGCGGCGTC GCCAAGGAAT TCAACACCAT CGCGGTCGAT
GACGGCATCG CCATGGGCCA CGGCGGCATG CTGTATTCGC TGCCCTCGCG CGAGCTCATC
GCCGACAGCG TCGAGTACAT GTGCAACGCG CACACGGCGG ACGCGCTGGT GTGCATCTCG
AACTGCGACA AGATCACCCC GGGCATGCTG ATGGCCGCGC TGCGCCTGAA CATCCCGGCG
ATCTTCGTCT CCGGCGGTCC GATGGAGGCC GGCAAGGTCA AGTGGGAAGC CAAGGTGATC
TCGCTCGACC TCGTGGATGC GATGGTCAAG GCGGCCGACA AGTCGTGTTC GGACGAGGAA
GTCGACGCCA TCGAGCGCTC GGCCTGCCCG ACCTGCGGGT CGTGCTCGGG CATGTTCACA
GCCAACTCGA TGAACTGCCT CACCGAGGCG CTCGGCCTGT CGCTGCCCGG CAACGGCACC
ACGCTCGCCA CCCACGCCGA CCGCGAGCGC CTGTTCAAGG AAGCCGGCCG CCGCATCGTC
GATCTCGCGC GTCGTTACTA CGAAAAGGAC GACGCCTCGG TGCTGCCGCG CTCGATCGCC
AGCTTCCAGG CCTTCGAGAA CGCGATGAGC CTGGACGTGG CCATGGGCGG CTCGACCAAC
ACCGTGCTGC ACCTGCTCGC CGCCGCGCGC GAGGCCGGCG TGGACTTCAC GATGAAGGAC
ATCGACCGCG TCAGCCGCCG CGTGCCTTGC CTGTGCAAGG TCGCGCCCGC GATCGCCGAC
GTGCACATCG AGGACGTGCA TCGTGCCGGC GGCATCATGT CCATCCTCGG TGAACTCGAC
CGCGCCGGCC TGCTGCACAC CGATGTGCCG ACCGTGCACA GCGCGAGCCT GGGCGAAGCC
CTGGACAAGT GGGACATCAA GCGCACCGAA GACGAAGCGG TGCACACCTT CTTCCGTGCA
GCGCCGGGCG GGGTGCCGAC CCAGGTCGCC TTCAGTCAGG ACCGGCGCTG GAAGTCGCTC
GACGTCGACC GCGAGCACGG CATCATCCGC AACAAGGAAC ATGCCTTCAC GGCCGACGGA
GGGCTGGCCG TGCTCTACGG CAACATCGCC GAGAAGGGCT GCATCGTGAA GACCGCCGGG
GTGGATGAAT CCATCTGGAA GTTCACCGGC AAGGCCAGGG TGTACGAGAG CCAGGAAGAC
GCGGTGGAGG GCATCCTCGG CGAGCAGGTG CAGGCGGGCG ACGTGGTGGT GATCCGCTAC
GAAGGCCCGA AGGGTGGCCC CGGCATGCAG GAGATGCTCT ATCCCACCTC TTACCTGAAG
AGCCGCGGCC TGGGCGCGCA GTGCGCGCTG CTCACCGACG GACGCTTCTC GGGCGGCACC
TCGGGCCTGT CGATCGGCCA TGCGTCGCCC GAGGCGGCCT GCGGCGGCGC GATCGCGCTG
GTCGAGGACG GCGACACGAT CGAGATCGAC ATCCCCGCGC GCCGCATCCA TCTTGCCATC
GCCGACGCCG AGCTCGCCCG CCGGCGTGCC GCGATGGAGG CGAAGGGCAA TGCGGCCTGG
AAGCCGGTGA AGCGCGAACG CGTGGTCTCC GCCGCGCTGC AGGCCTACGC CGCGCTCACC
ACCTCGGCCG ACACCGGCGC GGTGCGGGAC GTCACCCAGG TCCAGCGCTG A
 
Protein sequence
MPQYRSRTST AGRNMAGARA LWRATGMKDG DFEKPIIAIA NSFTQFVPGH VHLKDLGQLV 
AREIESAGGV AKEFNTIAVD DGIAMGHGGM LYSLPSRELI ADSVEYMCNA HTADALVCIS
NCDKITPGML MAALRLNIPA IFVSGGPMEA GKVKWEAKVI SLDLVDAMVK AADKSCSDEE
VDAIERSACP TCGSCSGMFT ANSMNCLTEA LGLSLPGNGT TLATHADRER LFKEAGRRIV
DLARRYYEKD DASVLPRSIA SFQAFENAMS LDVAMGGSTN TVLHLLAAAR EAGVDFTMKD
IDRVSRRVPC LCKVAPAIAD VHIEDVHRAG GIMSILGELD RAGLLHTDVP TVHSASLGEA
LDKWDIKRTE DEAVHTFFRA APGGVPTQVA FSQDRRWKSL DVDREHGIIR NKEHAFTADG
GLAVLYGNIA EKGCIVKTAG VDESIWKFTG KARVYESQED AVEGILGEQV QAGDVVVIRY
EGPKGGPGMQ EMLYPTSYLK SRGLGAQCAL LTDGRFSGGT SGLSIGHASP EAACGGAIAL
VEDGDTIEID IPARRIHLAI ADAELARRRA AMEAKGNAAW KPVKRERVVS AALQAYAALT
TSADTGAVRD VTQVQR