Gene Tmz1t_3599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3599 
Symbol 
ID7873104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3947418 
End bp3949016 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content63% 
IMG OID643700539 
Productrestriction modification system DNA specificity domain protein 
Protein accessionYP_002890569 
Protein GI237654255 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.743632 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGATT CCGCTCCACA GGCCAGCGAT CTTCCCGCAG GCTGGGACGT CGCGTCATTC 
GGTGAACTCA ACTCGTTTAG CGGCAGCACG GTAAACCCCG CGACACGGCC AGACGAAGTC
TTTGAGCTCT ACAGCGTGCC GAGCTTCCCA ACAAAGCACC CCGAGCAGCT ACCGGGACGT
GCGATCGGCT CGACGAAGCA GACCGTCAGG CCTGGTGACG TCCTGGTCTG CAAGATCAAC
CCCCGCATTA ATCGTGTTTG GACAGTCGGT ACCCGTCGCG ATCACGAGCA AATTGCCTCG
TCAGAGTGGA TCGGGTTCCG ATCTGACGCC ATGGTGCCGC GGTTCGCGAA GCACTACTTC
AGCGAACCGT CATTCCGGTC GCTCTTGTGC AGCGAGGTCT CCGGCGTAGG CGGCTCCCTG
ACCCGCGCCC AGCCAAGTCG CGTAGCCAAG TATCCTGTCC TCGTTGCGCC GCTGGCAGAA
CAGGCCCGCA TCGCCGACCA ACTCGAGGCC CTGCTGGCGC GTATCCAGGC CTGTCAGGAC
CGCCTGGAGG CCATTCCGGC GTTGCTCAAG CGGTTTCGAA AGCTGGTTCT CTCGTCTGCG
CTTTCTGGCG ACCTGACTGA AGTCTGGCGG GCCGAACAAG GAGTGGGCTT AGATACTTGG
TCGGCGAGGA CGATTGCTGA CGTCGCGGAA GTTGGGACTG GATCCACCCC TCTTCGATCA
AACAGCAACT TCTACGCAGA GACCGGGACC CCTTGGGTGA CAAGCGCGGC CACGAGTCGC
CCTTACATCG ACTCTGCCGA CCAGTACGTG ACTAAAGCGG CAATCGATGC ACACAGGCTC
AGGGTCTACC GCCCCGGGAC ACTGATCATT GCTATGTACG GTGAAGGGAA GACTCGTGGG
CAAGTCAGCG AGCTTCGAAT TGACGCGACC ATCAATCAGG CCTGCGCTGC GATAACTGTC
GATGAGCAGC AAGCCAACGC CGCCTTCGTC AAGCTTGCAC TCTTGTCGCA GTACGAGCAA
ACGCGCGCGC TTGCGGAAGG CGGCGCGCAG CCAAATCTGA ACTTGTCCAA GGTGCGCGGA
ATTCCACTAC GCCTGCCAGA AGGGCCCGAA CAAGCTCAGA TCGTTCATCG AGTTGGAGAA
CTGTTCGCTT TTGCCGACAC CATCGATTCT CGCGTCGCTG CGGCAACAGG CAAGACACGG
AAGCTTCCCT CGCTCACTCT CGCCAAAGCC TTCCGCGGCG ACTTGGTTCC GCAAGATCCC
ACCGACGAGC CGGCCAGCGT CTTGCTGGCC CGTATTGCCG CCCAACGCGC AGCGCCCCCG
CATGCCGCCT CGGCAACCAC ACCGCGCCGC GGCCGCCCAC CCCGTGCCCC GAAGGAAACC
GCCGCCATGA CCAAGAGCCG CCAGGACGAC GACGTGACGG GTCAGCCCTA CCTGGCCGCG
CACCTGCACC GCATCGGCAC GCCCGCCAGC GCCGAAGCAC TGTTCAAGGT GGCCGAGTTG
CCCGTCGCCG ACTTCTACAA GCAACTCGCT TGGGAGGTGG CGCAAGGCCA CGTGAAGGAC
AACCAGACCA CGCTGGAGCC CGGGCATGCG GCTGGATAA
 
Protein sequence
MIDSAPQASD LPAGWDVASF GELNSFSGST VNPATRPDEV FELYSVPSFP TKHPEQLPGR 
AIGSTKQTVR PGDVLVCKIN PRINRVWTVG TRRDHEQIAS SEWIGFRSDA MVPRFAKHYF
SEPSFRSLLC SEVSGVGGSL TRAQPSRVAK YPVLVAPLAE QARIADQLEA LLARIQACQD
RLEAIPALLK RFRKLVLSSA LSGDLTEVWR AEQGVGLDTW SARTIADVAE VGTGSTPLRS
NSNFYAETGT PWVTSAATSR PYIDSADQYV TKAAIDAHRL RVYRPGTLII AMYGEGKTRG
QVSELRIDAT INQACAAITV DEQQANAAFV KLALLSQYEQ TRALAEGGAQ PNLNLSKVRG
IPLRLPEGPE QAQIVHRVGE LFAFADTIDS RVAAATGKTR KLPSLTLAKA FRGDLVPQDP
TDEPASVLLA RIAAQRAAPP HAASATTPRR GRPPRAPKET AAMTKSRQDD DVTGQPYLAA
HLHRIGTPAS AEALFKVAEL PVADFYKQLA WEVAQGHVKD NQTTLEPGHA AG