Gene Tmz1t_3372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3372 
Symbol 
ID7873863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3685810 
End bp3686937 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content73% 
IMG OID643700309 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002890343 
Protein GI237654029 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATCCCCCGCC GTAGCCGCGC CACGGTCGCG CCCGGCTTCG TCACCGGCAT GCTGGCCGGG 
CTGCAGCGGC GCGGACTCGA TGGGGCGCCG CTGCTCGCAC GCGCCGGGAT CGACCTTGCA
GAAACCGACA CGCGCATCCC AGTCGAGCGC TACGCCCTGC TCTACAACCT GTGCGTGGAG
GCACTTGAAG ACGAGGCCTT CGGCCTGCTC CCGGAGCCGA TGCGCCCGGG CAGCTTCGAG
TTCCTCTGCC GCGCACTGCT CGGCGCGCCC ACGCTGGGCG AGGCCCTGCT GCGCGCGATC
CGCTTCCTGC GCATCGTGTT GCCCGCCTTC CGCATTGAGC TGCGGGTGGA CGGCGCGCGC
GCCGAGTTGC TGCTGGACGA CGGCGGCAGC CTCGGCCCCG GGCTCGACGC CCCGGCGCGC
GTGTTCGCCT ACGAATGGCT GCTGCGCCTG CTGCACGGCG TGGCGAGCTG GTTCGTCGGC
CGCGGCCTGG CGCTCGACGC GGTCGCCTTC CCTTACGCAC GCCCGGCCCA TGCGGACGAC
TACGCGCTCG TCTACACCGA GCATTCGAGC TTCGACGCGC CGCAGCTGGC CGCCCGCCTG
CAGGCCAACC TGCTGGCGCT GCCGCTGCGC CGCGACGAGG CGGCGCTGGT CGGCTTCCTC
GAGGGCGCAC CGGGAAAGAT CACCACCCTC TACCGGCGCG ACCGCGAGAT GGTCTTCCGC
GTGCGCGACA TCCTGCGCGA CGCGCTGCCG CAGAACCTTT CGCTCGAAGA GGTCGCCGAG
CGCCTGCACG TGTCGCCGCG CACCCTGCAC CGGCGGCTGG AGGATGAGGG CTCGGGATTC
CGCAACATCA AGGAGGCCAC CCGCCGCGAC ATCGCCTATG CGCGCCTGGC CAAGACCCGC
CAGCCCATCG CCCGCATCGC GGCCGAGCTC GGCTACGCCG ACCCGTCCAC CTTCTACCGC
GCCTTCGTCG CCTGGAGCGG CATGTCGCCG GAGCAGTTCC GGCACCGGCT GGCGGGCAAC
GACGGTCTTC CCGCGGCGTC TGCCGGTCCG GACAGGCGCC CTGCCCCAAC CCCCCGTGTC
ACCGCCGGAC GGCAGCGATC ACGCGAATTC GGTCCGACCG AGGCATAG
 
Protein sequence
MPRRSRATVA PGFVTGMLAG LQRRGLDGAP LLARAGIDLA ETDTRIPVER YALLYNLCVE 
ALEDEAFGLL PEPMRPGSFE FLCRALLGAP TLGEALLRAI RFLRIVLPAF RIELRVDGAR
AELLLDDGGS LGPGLDAPAR VFAYEWLLRL LHGVASWFVG RGLALDAVAF PYARPAHADD
YALVYTEHSS FDAPQLAARL QANLLALPLR RDEAALVGFL EGAPGKITTL YRRDREMVFR
VRDILRDALP QNLSLEEVAE RLHVSPRTLH RRLEDEGSGF RNIKEATRRD IAYARLAKTR
QPIARIAAEL GYADPSTFYR AFVAWSGMSP EQFRHRLAGN DGLPAASAGP DRRPAPTPRV
TAGRQRSREF GPTEA