Gene Tmz1t_2042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2042 
Symbol 
ID7083802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2306180 
End bp2307823 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content71% 
IMG OID643699069 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_002355686 
Protein GI217970452 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0504248 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGTCGC GCCGGCGCCC GTCGCCGCAC CCGCTTCGCG CACGGCCCGC CGCGGGCAGC 
TTCTGTGCTA TACAGGACGA GTCCGACCGC CACGGAAACC ATGCGGCATG CGGGACCGCC
GCCACTGACC AGGTGAGCCC CATCGACGCG TCCAGACCGC ACCCACCGCC TCGTGCCGGG
CATGTGCTCG TGGTGGCGCT CGCCCTCGGC GCGGGCTCGG CGCAGGCCGC GGGGATCGAT
GCCGGCGGGC TGGCCTGGAA TGGCGGTCTG GTTCTGCTCG GCGGCGGGCT GGGTTATCTG
CTCGGCCGCC GCAGCCTGCG CCGACGCACC CTCGCCGGTG CGGCAGAGGG CTGCGCTGCG
GACGGCCCGC TTCACGACGA CGCCCCGGCG AGCGGCGGCG CGGTTTCGAG GCGGGAGTGG
GCAGGCACGC CTGCAGACGC CGACGATTGC GCGCCGGCGA GCGCGCTCCC CCATGCCCTC
CTCCCCGCCG CGCACGCCGC TCCGCCCGCG GGCGCAGGCG CGGCGATCGG GGTGATCGCG
ACCGAATCCT ATCGCGAGGT GGTCGATTCG CTCAGCGAGG TGCTCTTCCG CACCGACGAG
CTTGGCCGCC TGGTTTTTCT CAACGACGCC TGGAAGGATC TCTCCGGTTT CGATCTGGAC
GCCACGCGCC AGCATGCGCT GACCGATTTC CTCCATCCAG ACGACCGCTT GCGCGCACGC
GACGCAATCC GCGCCCTGCT GTGCGGAGAC GGCCAGGAAT GGACCGACGA GTTGCGGCTG
CGCACGCTCT CCGGCGAGAT CCGCTGGGTG GCGATCGACT GCCGTGCCTT GCGCGACGGC
AGCGGGCAGG AAAGCGGGAT CGCCGGCACC ATCGACGACA TCTCGGCGCG CAAGATCGCC
GAGTTCAGCC TGCGCAACCT GAACCAGGAG CTCGAGTCGC GGGTGCGCTC GCGCACCGCC
GAACTCGAGA CCGCGGTGCG CGAGCTCGAG GCCTTCTCGT ATTCCGTCTC GCACGACCTG
CGTGCGCCGC TGCGCGCCAT CGACGGCTTC GCGCGCATCC TCGTCGAGGA GGCCGGGCCG
CGGCTCGACG AACGCCAACG CGAGCAGCTC GTGCGCATCC GCGCCGGCGC CGAGCGCATG
GCGATCCTGA TCGATGCGCT GATCGACCTC GCCAGCGTCT CCCGGCAACC GTTGCGCAGG
AGGCCGGTCG ACCTGTCGCG CATCGCCGAT GCGGTGATCC GCGACCTGCA GGCGGAGTCG
CCCGGGCGCG TGGTCGCGGT CGAGATCACC AGCGACATGA CGGTGGTCGC CGACCCGGTG
CTGATGCACG TGCTGCTCGA CAACCTGCTG CGCAATGCAT GGAAGTTCAC AAGCCAGTGC
GAACATCCGC GTATCGTCTT CGGCGCGGAA CGCGACGGCG AGCGCACGGT GTTCCACGTC
GAGGACAACG GCGCCGGATT CGAGATGAAC TACGCGGGCA AGCTCTTCCA GCCCTTCCAG
CGCCTGCACG CGCAGCACGA GTTTCCCGGC ACCGGGATCG GGCTCGCCAC CGTACAACGC
ATCGTCGCCC GCCATGAGGG CCGGGTGTGG GCCAGCGCGG AACCGGGCAA GGGCGCGCGC
TTCTGCTTCA TGCTGGGGCA TTGA
 
Protein sequence
MRSRRRPSPH PLRARPAAGS FCAIQDESDR HGNHAACGTA ATDQVSPIDA SRPHPPPRAG 
HVLVVALALG AGSAQAAGID AGGLAWNGGL VLLGGGLGYL LGRRSLRRRT LAGAAEGCAA
DGPLHDDAPA SGGAVSRREW AGTPADADDC APASALPHAL LPAAHAAPPA GAGAAIGVIA
TESYREVVDS LSEVLFRTDE LGRLVFLNDA WKDLSGFDLD ATRQHALTDF LHPDDRLRAR
DAIRALLCGD GQEWTDELRL RTLSGEIRWV AIDCRALRDG SGQESGIAGT IDDISARKIA
EFSLRNLNQE LESRVRSRTA ELETAVRELE AFSYSVSHDL RAPLRAIDGF ARILVEEAGP
RLDERQREQL VRIRAGAERM AILIDALIDL ASVSRQPLRR RPVDLSRIAD AVIRDLQAES
PGRVVAVEIT SDMTVVADPV LMHVLLDNLL RNAWKFTSQC EHPRIVFGAE RDGERTVFHV
EDNGAGFEMN YAGKLFQPFQ RLHAQHEFPG TGIGLATVQR IVARHEGRVW ASAEPGKGAR
FCFMLGH