Gene Tmz1t_2540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2540 
Symbol 
ID7873979 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2737758 
End bp2739803 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content67% 
IMG OID643699462 
Productprotein of unknown function DUF839 
Protein accessionYP_002889519 
Protein GI237653205 
COG category[R] General function prediction only 
COG ID[COG3211] Predicted phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGC CCGATACGCT CGACGACCTT CCGACCAACC TCTCGTCCAA CGAGCACTTC 
CAGTCCGTCG TGGAGCGTGC GGTCAGCCGC CGCGGCTTCC TGAAGAGCGG CCTGGGCCTC
TCGGCCGTCA CCTTCCTGTC GGGCTCGCTC GCTGCCTGTA CCTCGGACGA CGACACGCCC
GTGGCCGGCA CCCCGACCGC CGGGACGCCC CCTGCGCCTG CACCTGCCGC GGGGCCGCTC
CTCGGCTTCG CCGCCGTCGC CACATCGAGC GGCGATGCGA TCGTCGTCCC CGCGGGCTAT
TCGGCGCAGA TCTTCACTCC CTGGGGATCG CCGCTGTTCA GCGACTCCCC CGCATGGCGA
GCGGACGGCA CCAACACCGG TGAAGAGCAG GCTCGCCAGG TCGGCGACAA CCACGACGGG
ATGAGCTACT TCCCGATCGA CGGATCGAAC GAAGGCCTCC TGGTGATGAA CCACGAGTAC
TGCAACTACG AGTATCTGTT CGGCGCCGAA TTCATGACGC CGTGGACGGC GGACAAGGTC
TCCAAGGCGC TCAACGCGCA TGGGGTCTCG GTCCTCCACG TCAAGAAGAA CGGCGCGGGG
CGCTGGGAAG TCCACATCGG CTCGCCGTAC AACCGCCGGA TCACCGGCAA GACGCCGATG
ACGCTGACCG GCCCCGCCGC CGGCGACGCC CTCCTGCGCA CCACGGCGGA CCCCAGTGGC
CTCAACGTCT TGGGAACGCT GAACAACTGC GCCAACGGCA AGACGCTGTG GAACACCTAC
CTGACCTGCG AAGAGAACTT CAACGGCTAT TTCGCCACCG CAGCGAGCCC GGCACCGACG
CGCAGCGCGG CCTTCGTCCG CTACGGCATC AGCGCCGGCG GTTCCGGCTA CCGCTGGCAC
GAGCATGAGG ACCGCTTCGA CTACGCCAAG GAACCCAACG AGGCCAACCG CTTCGGCTGG
GTCGTCGAGA TCAATCCCTT CGAGCCCGGC TCCACGCCGA AGAAGCGCAC CGCGCTCGGC
CGCTTCAAGC ATGAAAACGC CGAGATGAGG CTCGCCGCAG ACAAGCGCGT GGTCGTCTAC
ATGGGCGACG ACCAGGCCAA CGATTACATC TACAAGTTCG TGTCCGATGG CGTTTTCGAC
GCAAGCCGTG GGCTTGCAAA CGGCAACCTG CTCGACGCTG GCAAGCTGTA TGTGGCCAAA
TTCGACGCCG GCGCGGCCAG CGGCGACTTC ATGGGTGTGG GGGAATGGCT GCTGCTCGAC
AAGGCGGCCA ACCCCACGCT GGCGGCAGAC GCCCGCTTCG CCACCCAGGC CGAAGTCCTG
ATCCACGCCC GCCTCGCCGC CGACGCCGTC GGTGCGACGA AGATGGATCG CCCGGAGTGG
ATCACCACGC ATCCGCAAAC CGGCGAGGTC TATTGTGCCC TGACCAACAA CTCCGGCAGG
ACCACGACGG ACGAGGCGAA CCCGCGCGCA CAGAACCGCT ACGGACAGAT CGTGCGCTGG
CGCGAGGCCG GCGACGACGC CGCCGCGATG ACTTTCGAGT GGGATCTCTT CGTGCTCGCA
GGCAACCCGG TGGCTTACCC CGACCGCCAG GACCTGCGCT CCGGTTCCGC GAACGTATCC
GCCGACAACA CCTTCAACAG CCCCGACGGC ATCGGCTTCG ACGGCGCGGG CCGGCTGTGG
ATCCAGACCG ACGGAAACTT CTCGAATAGC GGCGACTACG CGGGCCAGGG CAACAACCAG
ATGCTGGTTG CCGACCCCGA GAGCAAGGAG ATCCGCCGCT TCCTGGTCGG ACCTTCGGGC
TGCGAGATCA CCGGCCTTGC ATTCTCCCCC GACTACCGGA CCATGTTCAT CAACGTGCAA
CATCCCGGCG AGGCCGGTTC GCATCCGCGC GCACCGGACG CGAGCATGCG CGGGAGCCTG
TCGATGGACG AGTATCTTGC GCAGAACCCG CTCGCGTTCA GTCAGTGGCC CGAGGCCGGC
GGCGGTCGCC CACGCTCGGC GACCGTCGTG ATCACGAAGG ATGACGGGGG CGTGGTCGGC
TCCTGA
 
Protein sequence
MKKPDTLDDL PTNLSSNEHF QSVVERAVSR RGFLKSGLGL SAVTFLSGSL AACTSDDDTP 
VAGTPTAGTP PAPAPAAGPL LGFAAVATSS GDAIVVPAGY SAQIFTPWGS PLFSDSPAWR
ADGTNTGEEQ ARQVGDNHDG MSYFPIDGSN EGLLVMNHEY CNYEYLFGAE FMTPWTADKV
SKALNAHGVS VLHVKKNGAG RWEVHIGSPY NRRITGKTPM TLTGPAAGDA LLRTTADPSG
LNVLGTLNNC ANGKTLWNTY LTCEENFNGY FATAASPAPT RSAAFVRYGI SAGGSGYRWH
EHEDRFDYAK EPNEANRFGW VVEINPFEPG STPKKRTALG RFKHENAEMR LAADKRVVVY
MGDDQANDYI YKFVSDGVFD ASRGLANGNL LDAGKLYVAK FDAGAASGDF MGVGEWLLLD
KAANPTLAAD ARFATQAEVL IHARLAADAV GATKMDRPEW ITTHPQTGEV YCALTNNSGR
TTTDEANPRA QNRYGQIVRW REAGDDAAAM TFEWDLFVLA GNPVAYPDRQ DLRSGSANVS
ADNTFNSPDG IGFDGAGRLW IQTDGNFSNS GDYAGQGNNQ MLVADPESKE IRRFLVGPSG
CEITGLAFSP DYRTMFINVQ HPGEAGSHPR APDASMRGSL SMDEYLAQNP LAFSQWPEAG
GGRPRSATVV ITKDDGGVVG S