Gene Tmz1t_3100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3100 
Symbol 
ID7874570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3356661 
End bp3357632 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content66% 
IMG OID643700023 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002890075 
Protein GI237653761 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.605914 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCATCC GCTTCCTCAA CGGCCAGAGC AATGTCTTCG CCGCCGCCAA CCCCTTCGAG 
GTCTCCGAGT ACGTGCGCGC CAACGTGGGT TCGCACAGCC TGCGCCTGCC GCGCGCCAGC
GACGCCAGCG CCTCGCTCAG CCACCGCCGC GCCGGCACGC TGGATCTTTG CCGTCTCAGC
TACGGCGCGC AGGCCCGCGT GCTGTCCGAG AGCCTCGGCG ACATCTACCA CGCCCAGTTC
ATCCTGCAAG GCTATTGCAG CTACACGCTC GCCAACCGCA CGCTCGACCT GCCCGCCGGC
CACGTGCTGG TGCTCAACCC GGACGAGCCG GTGGACCTCA CCTACTCGGA CAACTGCGAG
AAGTTCATCG TCCGCATCCC CTCGGCGATG CTCGACGACG CCTGCACCGA GCACCGCTGG
TTCAAGCCCA ACGAGCGCAT CAAGTTCAGC CCCGAGCCGC AGCGTTTCGA GGACATCGAC
AGCCTGCTGC TGCTGTTGCG CCTGCTCTGC GAGGAGGCCG AATCCGAGCT GGCGACGCCG
CAGATGCTGC AGCACTACTG TCGCGTGGTC ACCACCAAGC TGATGGTGAT GCTCAAGCAC
AACGTCAGCA TGGTCGCCCC CACCCGGCAC GCGCCCAGCT TCGAGCGCCT GGTGAACTAC
ATCGAGCGCA ACATCAAGCT CGATCTCAGC GCCGAGGATC TCGCCCACTA CGCCGGGCTG
AGCCTGCGCT CGCTCTACCT GCTGTTCGAG AAGAACGTGA AGACGACGCC GAAGAACTTC
GTGCGCCAGA AGAAGCTCGA GAAGGTGCAT TCGATCCTGA GCGACCCGGG CCAGGCCTGT
CCGAACGTCA CCGCGGTCGC GCTCGAGTAC GGCTTCTCGC ACCTGGGCCG CTTCTCCGAA
CTTTACAAAT CCACCTACGG CGTGCTGCCC TCGCAGTCGA TCCGCTGCCG CCAGCCCCAG
GCCGGGCGCT GA
 
Protein sequence
MPIRFLNGQS NVFAAANPFE VSEYVRANVG SHSLRLPRAS DASASLSHRR AGTLDLCRLS 
YGAQARVLSE SLGDIYHAQF ILQGYCSYTL ANRTLDLPAG HVLVLNPDEP VDLTYSDNCE
KFIVRIPSAM LDDACTEHRW FKPNERIKFS PEPQRFEDID SLLLLLRLLC EEAESELATP
QMLQHYCRVV TTKLMVMLKH NVSMVAPTRH APSFERLVNY IERNIKLDLS AEDLAHYAGL
SLRSLYLLFE KNVKTTPKNF VRQKKLEKVH SILSDPGQAC PNVTAVALEY GFSHLGRFSE
LYKSTYGVLP SQSIRCRQPQ AGR