Gene Tmz1t_1761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1761 
Symbol 
ID7085728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1981910 
End bp1983016 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content75% 
IMG OID643698780 
ProductSmr protein/MutS2 
Protein accessionYP_002355409 
Protein GI217970175 
COG category[S] Function unknown 
COG ID[COG2840] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGCC GCGCTCCCGC GCCGGGTCCG GCCGGCACCC CAGGACGCGA CGCGCAGGGC 
GCGGTGCCGA AGCCCGCCTC GCCCTTCGCA GCCCTGCGCA AGCAGCTGCA GCAGCGCGCG
CTCCCGGCAC CCGTGTCGTC GCCCGCCCCG GCGAAGCGTA TCCGCCACCC GCTTGAGGAA
GACTCCTCCA GCGCCGGTGC CGATCGTGCA CAGGAGCCCG ACGCGGAGGC GCTCGAGCTC
TTCCGGCGCA GCGTCGGCGC GGTGCGCCCG GTGCGCGGCA CGGACCGCGT CGAGATCCAC
CGCCCCCGCC CTGCCCCGCG GCCGCGCACG CAAGCCGTGG AGGACGAGGA AACCGAGGAG
CCCGTCCGTG CGCGACCCGA GACCGACCCG CTACGCGCGG CCTACGAGGG CGTGATGCCG
CTGAGGGATA CCGGCCGAGT GGCGCTCGAC ACACCGCTGC GTCACCACGC CCGCCATGCA
GGCGGTACGC ATCCCGCGCC GGCGCTGCGA CCCGACGCGA TCGTGCTGCC CGCGGACGCC
GACGTCAGCG ATCCGGCAGC GCTCTTTCTT GCGGTAGTGG GCAATGCCCG CCCGGTCACC
GACCGCAACC GCGTGGAGCT GGAGCGCCCG CAGCCGGCAC CTGCGCCGCT CAAGCGCGAG
GAGGACGAGC GCGCGGCGCT CGGCGAATCG CTCGCCGCAC CGCTCACCTT CGAGGATCGC
CTGGACATGG GCGACGAGGC GGCCTTCCTG CGGACCGGGC TGCCGCGCCG GGTGCTGACC
GATCTGCGCC GCGGGCGCTG GGTGCTGCAG GGCCAGATCG ACCTCCACGG CCTCACCCGC
GACGAGGCGC GCGCCGCGCT GGCGAACTTC CTGCACGACG CGCTTGCCCA GGGCAAGCGC
TGCATCCGGG TGATCCACGG CAAGGGCCAC GGCTCGCCCG GGAAGGTGTC GATCCTGAAA
CAGCTGTCGC GCGGCTGGCT GGCGCAGCGC GAGGAGATCC TCGCCTTTTG CCAGGCCGGC
CCCCACGATG GCGGCGGCGG CGCCCTGCTG GTGCTGCTGC GCGCGCAGAA CGCCGCGCCG
CGCGCCCGAA TGCCGTTACC CGCCTGA
 
Protein sequence
MSRRAPAPGP AGTPGRDAQG AVPKPASPFA ALRKQLQQRA LPAPVSSPAP AKRIRHPLEE 
DSSSAGADRA QEPDAEALEL FRRSVGAVRP VRGTDRVEIH RPRPAPRPRT QAVEDEETEE
PVRARPETDP LRAAYEGVMP LRDTGRVALD TPLRHHARHA GGTHPAPALR PDAIVLPADA
DVSDPAALFL AVVGNARPVT DRNRVELERP QPAPAPLKRE EDERAALGES LAAPLTFEDR
LDMGDEAAFL RTGLPRRVLT DLRRGRWVLQ GQIDLHGLTR DEARAALANF LHDALAQGKR
CIRVIHGKGH GSPGKVSILK QLSRGWLAQR EEILAFCQAG PHDGGGGALL VLLRAQNAAP
RARMPLPA