Gene Tmz1t_0203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0203 
Symbol 
ID7084324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp238135 
End bp239259 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content72% 
IMG OID643697245 
Producttranscriptional regulator of molybdate metabolism, LysR family 
Protein accessionYP_002353894 
Protein GI217968660 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1910] Periplasmic molybdate-binding protein/domain 
TIGRFAM ID[TIGR00637] ModE molybdate transport repressor domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.433996 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAGGA TCAGGCTGAA CTACGTACTC GGCGCGGACA CGGGGTCGGC GCCGCTCCAC 
AATCCCCTGC TCGACCTCCT GCAGGCGGTG CGCGAGCAGG GCTCGATCTC CGCCGCCGCG
CGTGTGCTGG ACCTCTCCTA CCGCCACGTT TGGGGCGAGC TCAAGCGCTG GGAGCTCGAG
CTCGGCCAGC CGCTGATCCT GTGGGAGAAG GGCCAGGCCG CGCGCCTGTC CGAGTTCGGC
GCCAAGCTGC TGCTCGCCGA GCGCCAGGTG CAGGCTCGCC TGTTGCCGCA GATCGAGGCG
CTGCGCGCCG ACCTCGAGCG CGCCTTCGCG ATCGCCTTCG ACGATTCGGT GCATGTGCTG
TCCTTCCACG CCAGCCACGA CGAGGCGCTC GCCGCGCTCG GCGAGGAGGC GCGGACCCGC
GGCCTGCACC TGGACATCCG CTTCACCGGC AGCGTCGATG CGATCCGCGC ATTGAACGAG
GGCCGCTGCA CCATGGCCGG CTTCCACGTG CGCCTGCCGG CGGTGCCGGG CTCGCGGGGG
TCGTCGTCAC ACTCGCAGCG CACCTACAAG CCGCTGCTGC GCCCCGGCCT GCACAAGCTG
ATCGGCTTCG CCCGCCGCAG CCAGGGCCTG ATCGTGGCAC GCGGCAATCC GCGCGGCCTG
CACGGCCTCG CCGACCTCGC GCGCCCCGGC GTGCGCTTCG TCAATCGCGC GCGCGGCACC
GGCACGCGGG TGATTTTCGA CGAGCTGCTC GGCGAGCTCG GCCTCGCGCC CGCCGCGATC
GAGGGCTACG CCAATGACGA ACCCTCGCAC GCCGCGGTCG CGCAGGCGGT GGCGAGCGGT
CAGGCCGACG CCGGCTTCGG CATCGAGGCG AGCGCGCGCG GCCGCGGACT GGACTTCGTG
CCGCTGGTCG AAGAGGCCTA CTTCCTCGCC TGCCTCAAGT CCACCCTGGA GCACGACGCC
ACCCGCGCCC TGCTCGCGCT GCTGCGCACC GCCGCATGGC AGCAGCGCCT GGCCGACCTG
CCCGGCTACG CGCCGATGCA GAGCGGCGAG GTGCTGTCGA TGAGCCGGGT GCTGCCGTGG
TGGCGCTTCG GGGGGCGTGG AAACGCTCGC GAAGGAAAGC AATAA
 
Protein sequence
MRRIRLNYVL GADTGSAPLH NPLLDLLQAV REQGSISAAA RVLDLSYRHV WGELKRWELE 
LGQPLILWEK GQAARLSEFG AKLLLAERQV QARLLPQIEA LRADLERAFA IAFDDSVHVL
SFHASHDEAL AALGEEARTR GLHLDIRFTG SVDAIRALNE GRCTMAGFHV RLPAVPGSRG
SSSHSQRTYK PLLRPGLHKL IGFARRSQGL IVARGNPRGL HGLADLARPG VRFVNRARGT
GTRVIFDELL GELGLAPAAI EGYANDEPSH AAVAQAVASG QADAGFGIEA SARGRGLDFV
PLVEEAYFLA CLKSTLEHDA TRALLALLRT AAWQQRLADL PGYAPMQSGE VLSMSRVLPW
WRFGGRGNAR EGKQ