Gene Tmz1t_4070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_4070 
Symbol 
ID7873297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4469301 
End bp4470830 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content71% 
IMG OID643701001 
ProductHNH endonuclease 
Protein accessionYP_002891024 
Protein GI237654710 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCCGCAC CTCCCTCTCC TGCGTTCGCG CGTGCCTGGC GCCTGTCCGC CATCCTGGTG 
CTCGGCTTCG CCTCCGGCCT GCCGTTGGCG CTGACCGGCC AGGCCATGCA GGCCTGGCTG
ACAGTCGACG GCGTGGACCT CGCCACGATC GGTTTTTTCG GCCTGGTCGG CGTCCCGTAT
ACCTTCAAGT TCCTGTGGGC GCCGCTGATG GACCGCTTCG AGCCGCCCTG GCTGGGGCGC
CGGCGCGGCT GGCTGGCGCT GACGCAGCTC GCGCTGGCGG CGCTGCTGTG GTGGATGGCG
AGCCTGTCGC CGACGGCCAC GCCGGGCCTG TTCGCCATCG CGGCGGTGGC GATCGCCTTC
CTCTCGGCCT CGCAGGACGT TGTGGTGGAC GCCTACCGCA CCGACCTGCT GCCCGAGGCC
GAGCGCGGGC TGGGCGCCTC TGTGCACGTC TTCGCCTACC GCCTGGCGAT GATCCTGTCC
GGCGGCATCG CGCTGATCTG GGCCGGGCAG TGGGCGTCGT GGCCGCGGGT ATACGAGACC
ATGGCGCTGA TCATGGCGGC CTGCGCGGTG GTGTCGCTGC TGGCGCTGCC GCGCGTGTCG
GCGGCGCTGA AGCCGCTCGA TTCCGACCCC AGGCGCGAGC TGCTCGGCTT CGCCGCGATG
CTCGCCGGGG TCGCCGCCGG CTACTGGAGC GCCCGCCAGG CGCTGATCCT GCTCGGGCTC
GACCCGAACG ACGCCAACCG CTGGATCCAG CTCCTGTTCG TGATGGCGGA GATCGCGCTC
GCGCTGCCGC TGGCGGGCTG GGCGGCGCGT CGTGCCGGCT TCGAGACGCT CAACCGCTCG
CTGTCGAGCT ACTTCGCGCA GCAGGGCGCG GCGGCCTTCC TGGCGCTGAT CATCCTCTAC
AAGCTCGGCG ACGCCTTCGC CGGCAGCCTG ACCACGCCCT TCCTGATCAA GGGCATGGGC
TTCTCGCAGG AAGAGGTCGG CATCGCCAAC AAGGTGATCG GCATCTGGCT GACCATCCTC
GGCGCCTTCA TCGGCGGGCT GATCATGACG CGGCTGGCGC TGTACCGCTC GCTGCTGCTG
TTCGGCGTGC TGCAGCTGGT GTCCAACTTC GGCTTCTACC TGCTCGCAGA GCTCGGCAAG
GGCGCCTGGG GCGCAGTCAT GGTGCCGGCC TTCGACTGGG GCTTCGTGGC GATCGACACG
CCGGCCGCGC TCGACTGGCT GCTGCTGACC GTGATCGCCG GCGAGAACAT CAGCGGCGGC
ATGGGCACGG TGGCCTTCGT CGCGCTGCTG ATGGGGCTGT GCAACCAGCG CTTCACGGCG
ACCCACTACG CCATGCTGTC GGCCTTCGCC GCAGTGGGGC GGATCTACGT CAGCCCGCTG
TCGGGCGTGC TGTCGCAGAG CATCGGCTGG CCGGCCTTCT TCCTCTTCTC GATCGTGGTC
GCCGTACCGG GCGTGGTGAT GGTGTGGTGG CTGCGCGACG CGCTCGCGCG CCTCGGCCGC
CCGCAGACCG ACGGCATGGT GGACGACTGA
 
Protein sequence
MSAPPSPAFA RAWRLSAILV LGFASGLPLA LTGQAMQAWL TVDGVDLATI GFFGLVGVPY 
TFKFLWAPLM DRFEPPWLGR RRGWLALTQL ALAALLWWMA SLSPTATPGL FAIAAVAIAF
LSASQDVVVD AYRTDLLPEA ERGLGASVHV FAYRLAMILS GGIALIWAGQ WASWPRVYET
MALIMAACAV VSLLALPRVS AALKPLDSDP RRELLGFAAM LAGVAAGYWS ARQALILLGL
DPNDANRWIQ LLFVMAEIAL ALPLAGWAAR RAGFETLNRS LSSYFAQQGA AAFLALIILY
KLGDAFAGSL TTPFLIKGMG FSQEEVGIAN KVIGIWLTIL GAFIGGLIMT RLALYRSLLL
FGVLQLVSNF GFYLLAELGK GAWGAVMVPA FDWGFVAIDT PAALDWLLLT VIAGENISGG
MGTVAFVALL MGLCNQRFTA THYAMLSAFA AVGRIYVSPL SGVLSQSIGW PAFFLFSIVV
AVPGVVMVWW LRDALARLGR PQTDGMVDD