Gene Tmz1t_0747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0747 
Symbol 
ID7083976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp827955 
End bp829001 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content71% 
IMG OID643697772 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002354414 
Protein GI217969180 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.40849 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATCC TCGCCCCCTC GCCCGCCGAA CTCACCCGCG GCCGCGCCGC CACGCCGATC 
GCCTTCATCC GTGCGATCGT TGCCGGGTAT CGGCTGCGGG GCATGGATCC GGCCAGCGCG
CTGGCGCAGG CGCAGATCCC GCCCGCGCTG CTGGAAGATC CGGCCGCACA TGTCACCGCC
GCGCAGATGG AGCTGCTCTC GGGCGTGGCG ATGCAGGAGC TCGACGACGA GGCGCTGGGC
TGGTTCTCGC GCCGCCTGCC CTGGGGCAGC TACGGCATGC TCTGCCGCGC CTCACTGACC
TCGCCGACGC TGGAGGTGGC GCTCAAGCGC TGGTGCCGTC ACCACCGCCT GCTGACCGAG
GACATCGTCT TCGAGCTCGC GCAACGCGGC GGCATGGCTA CGATCCACGT CGCCGAGCAC
GCCGGACTCG GCGAGCTGCG CGAGTTCTGC CTCGTCAGCA CGCTGCGCTA CCTGCTCGGC
TACGCCTGCT GGCTGGTCGA TTCGCGCATC GCGCTCGGCG AGGCCGCCTT CCCCTTCCCT
GCCCCGGCGC ACGCCGACGC CTACGCCTAC CTCTTCGCCG GCCCGGCGCG TTTCTCCGCC
GCGGCCGCCT GCATCCGCTT CGACGCGCGC TACCTCGCCC TGCCGGTGCG CCGCGACGAG
AAGGCGCTGC AGGCCATGCT GCAGCGTGCG CTGCCGCTCA CCGTGCTGCA GTATCGCCGC
GACCGCCTGC TGGTGCATGG CGTCGCGCAA TTGCTCGCCG CCAACCCCGC CGCCGCCCAC
ACCGCCGAGG AGGTTGCCGC ACAGCTCAAC CTGTCGGTGC GCACCCTGCA CCGACAGCTC
AAGGAAGAAG GCGTGTCGCT GCAGCGCCTG AAGAACGCGG CGCGGCGGGA GCATGCGGTG
AAGCTGTTGC TGCAGAGCGC CAGACCGGTG AAGCAGATCG CAGCCGCCGT CGGCTTCGAC
AGCGAGAAGA GCTTCGCGCG CGCGTTCCGG GAGTGGACGG GGGTCGCGCC AAGCGCGTAC
CGGTCAAACG CAGAGCCCGC CTGCTAG
 
Protein sequence
MKILAPSPAE LTRGRAATPI AFIRAIVAGY RLRGMDPASA LAQAQIPPAL LEDPAAHVTA 
AQMELLSGVA MQELDDEALG WFSRRLPWGS YGMLCRASLT SPTLEVALKR WCRHHRLLTE
DIVFELAQRG GMATIHVAEH AGLGELREFC LVSTLRYLLG YACWLVDSRI ALGEAAFPFP
APAHADAYAY LFAGPARFSA AAACIRFDAR YLALPVRRDE KALQAMLQRA LPLTVLQYRR
DRLLVHGVAQ LLAANPAAAH TAEEVAAQLN LSVRTLHRQL KEEGVSLQRL KNAARREHAV
KLLLQSARPV KQIAAAVGFD SEKSFARAFR EWTGVAPSAY RSNAEPAC