Gene Tmz1t_2373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2373 
Symbol 
ID7094295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011667 
Strand
Start bp35503 
End bp36504 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content55% 
IMG OID643701061 
Productrestriction endonuclease-like protein 
Protein accessionYP_002364202 
Protein GI217980152 
COG category[V] Defense mechanisms 
COG ID[COG3440] Predicted restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones69 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0000067766 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTGCTTG CTCAGTCAGA TCTCGCTACG CATATGACAA TAATCTGCCT AACACCTCAG 
CCAGACTGGG ATGCTCCGAT CTTCAAGATT CTGGCGAACA ACGACACCGG GAGTGCTCCG
GGGCATCAGG GCGGGATTGT TATCCCGAAG GATCTGCGTT CTTTCTTTCC AGGGCTTGTG
GACAACACGT CGCACTACCG GCCAACGGTT GATCAACGTA TTGATGCCCA GCTTTTTGAC
GGAGACAAAT TTCTAGCGAC AGTGAACACT CGCTACCAGT ATCAGACATG GGGCGGCGCG
CGCAGTCCAG AGTCGCGTTT GACGGATCAG CTCTCTACTC TCAGAAATCG CGCAAGCGGT
GGCGACATCC TACTCATCCA GCGGAATATC AGCACTCTCG ATCAGTACCG CCTCGTACTA
GTACGTCAGT CGAGCCCTGA TTTTGCGCTG GTCATGCGCC TTGCGGCGGG GAGGCGCTGG
GGTGTGCTTT ACCCAGAACG AGTTCCCCTT GCAGACGATG ATCTAACAGA TGCGTTTAAG
GAAGAACTCG AGCGTGAGGG CAAGCCTTTC AAACTCATTG ACGACGAAGC TGGCACAACA
ACCGTAACTG TGAAGAAGGT TGCCCGGGCG CTTGCGTTTA GGACGATCGT TATTGCGCTT
TACGACGAAC GCTGCGCAGT TTGTGGAGAG GGGCTGAAGT CACCTGCTGG GGCTACTGAG
GTTGAAGCTG CTCATGTTGT CCCCCGCTCT CAGTTCGGTG CGGATGATGC CCGAAACGGT
GTTTCACTGT GTAAGGCACA TCACTGGGCG TTCGATAGAG GCCTGTTCGG CGTAGGGGAT
GATCGCACTG TCGTTGTTCC AAGCTCAGTA CGGTCACTTG TGCAGAACAA AAGCATCTCC
ACGTTTTTGG GTAGGCGAAT CAGGGAGGCC AGTGACCCTA GGCGCTCTGT TCATCCTGAT
GCTTTTGCGT GGCACCGCAA GAATCTGCTC CTCAGCGGCT AG
 
Protein sequence
MVLAQSDLAT HMTIICLTPQ PDWDAPIFKI LANNDTGSAP GHQGGIVIPK DLRSFFPGLV 
DNTSHYRPTV DQRIDAQLFD GDKFLATVNT RYQYQTWGGA RSPESRLTDQ LSTLRNRASG
GDILLIQRNI STLDQYRLVL VRQSSPDFAL VMRLAAGRRW GVLYPERVPL ADDDLTDAFK
EELEREGKPF KLIDDEAGTT TVTVKKVARA LAFRTIVIAL YDERCAVCGE GLKSPAGATE
VEAAHVVPRS QFGADDARNG VSLCKAHHWA FDRGLFGVGD DRTVVVPSSV RSLVQNKSIS
TFLGRRIREA SDPRRSVHPD AFAWHRKNLL LSG