Gene Tmz1t_2345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2345 
Symbol 
ID7094267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011667 
Strand
Start bp7246 
End bp10212 
Gene Length2967 bp 
Protein Length988 aa 
Translation table11 
GC content65% 
IMG OID643701035 
Producttransposase Tn3 family protein 
Protein accessionYP_002364176 
Protein GI217980126 
COG category[L] Replication, recombination and repair 
COG ID[COG4644] Transposase and inactivated derivatives, TnpA family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones82 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCGCC GTTCGATCCT GTCTGCCGCC GAGAGCGAAA TCCTGTTGGC GTTACCGGAA 
ACGAAGGAGG ACTTGATCCG CCACTACACG TTCAACGAAA CCGACCTCTC CATCATCCGG
CAGCGGCGCG GCCCAGCCAA TCGTCTTGGC TTCGCGGTAC AGCTCTGCTA CATGCGCTTT
CCCGGCGTGA TGCTCGCGGT AGATGTGGAG CCGTTTCCGC CGTTGCTGCG TGTGGTGGCT
GCGCAGTTGA AGGTGCCACC AGAGGCGTGG GCGGACTATG GCCAGCGCGC CGAGACACGC
CGCGAGCACC TGCTGGAACT GCAATCGGTC TTTGGTTTCC AGACCTTTGC AACGCGGCAC
TATCGCCCCA GCGTGCACGC CCTGGATGAA CTGGCTTGGC AGACAGACAA AGGTTTCGTG
TTGGCGACGG AGTTGGTCGA AGGGCTGCGA CGAAAGAGCG TGTTGCTCCC GTCACCAGGC
GTCATCGAGC GCATCTGCGC CGAGGCGATT ACCCGTGCCA ACCGGCGCAT CTACGACACG
CTGTCCGCGC CGCTGACAGA CACCCACCGG CATCGCCTCG ACGAGCTACT GAAACGCCGA
TACGACGGCA AGACGACCTG GCTGGCCTGG CTGCGCCAGT CACCCGCCAA GCCGAATTCG
CGGCACATGC TCGAACACAT CGAACGCCTC AAGGCGTGGC AGGCACTCGA CTTGCCTTCC
GGCATCGAGC GGCTGGTCCA CCAGAACCGC CTGCTCAAGA TTGCCCGCGA AGGCGGCCAG
ATGACGCCTG CCGACCTGGC GAAGTTCGAG CCGCAGCGTC GTTACGCCAC CCTGGTGGCG
CTCGCCATCG AGGGCATGGC CACCGTCACC GACGAAATCA TCGACCTGCA CGATCGCATC
CTGGGCAAGC TGTTCAATGC CGCCAGGAAC AAGCATCAAC AGCAGTTCCA GGCAGCCGGC
AAGGCGATCA ACGCCAAGGT GCGCCTGTTC GGCCGCATCG GCCGGGCGCT GATCGAAGCC
AAGCAGGCGG GCAGCGACCC GTTCGCCGCC ATCGAGGCGG TCATGTCGTG GGACGCCTTT
ACCGAGAGCG TCACCGAGGC GCAGCGGCTC GCGCAGCCCG AGGACTTCGA TTTCCTGCAT
CGCATCGGCG AGCACTACGC AACGTTGCGC CGCTACGCGC CGGAATTCCT CGCCGTGCTC
AAGCTGCGGG CCGCGCCTGC CGCCAAGGAC GTGCTCGACG CCATCGAGGT GCTACGCGGC
ATGAACAGCG ACAACGCCCG CAAGGTGCCC GCCGATGCGC CGACCGACTT CATCAAGCCG
CGCTGGCAGA AGCTGGTGAT GACCGACACC GGCATCGACC GGCGCTACTA CGAACTGTGC
GCACTGTCAG AACTGAAGAA CGCGCTGCGC TCGGGCGACA TCTGGGTGCA AGGCTCGCGC
CAGTTCAAGG ACTTCGAGGA CTACCTGGTC CCGCCCACGA AGTTCGCCAG CCTCAAGCAG
GCCCGCGAAT TGCCGCTGGC CGTGGCCACC GACTGCGACC AATACCTGCA CGACCGGCTG
ACGCTGCTGG AAACCCAGCT CGCCACCGTC AACCGGATGG CGCTGGCCAA CGAACTGCCG
GACGCCATCA TCACGGGGTC GGGCCTGAAG ATCACGCCTC TGGATGCGGC GGTGCCCGAC
ACCGCGCAGG CGCTGATCGA CCAGACGGCG ATGATCCTCC CGCACGTCAA GATCACCGAA
CTGCTGCTGG AAGTGGACCA ATGGACGGGC TTCACCCGGC ACTTCACGCA CCTCAAGTCG
GGCGACCTGG CCAAAGACAG GAACCTGCTG CTGACCACGA TCCTGGCCGA CGCAATCAAC
CTCGGCCTGA CCAAGATGGC GGAATCCTGC CCGGGCACGA CCTACGCCAA GCTGGCCTGG
CTGCAGGCCT GGCACATCCG CGATGAAACC TACTCGACGG CGCTGGCTGA ACTGGTCAAT
GCGCAGTTCC GGCATCCTTT CGCCGAGCAC TGGGGTGACG GCACCACGTC CTCGTCGGAC
GGCCAGAACT TCCGCACCGG CAGCAAGGCC GAAAGTACCG GCCACATCAA CCCGAAATAC
GGCAGCAGTC CCGGGCGGAC GTTCTACACC CACATCTCCG ACCAGTACAC GCCGTTCAAC
ACCAAGGTCG TCAACGTCGG CGTGCGCGAC TCGACCTACG TGCTCGACGG CCTGCTGTAC
CACGAGTCCG ACCTTCGCAT CGAAGAGCAC TACACCGATA CGGCGGGCTT CACCGATCAC
GTGTTTGCGC TGATGCACCT GCTGGGCTTT CGCTTCGCGC CGCGCATCCG CGATCTGGGC
GACACCAAGC TCTACATCCC GAAGGGCGAT GCTGGCTATG AGGCGCTGAA GGCCATGATC
GGCGGCACGG TCAACATCAA GCATATCCGT GCCCATTGGG ACGAAATCCT ACGGCTGGCC
ACCTCGATCA AGCAGGGTAC GGTGACGGCC TCGCTGATGC TCAGGAAGCT TGGGAGCTAC
CCACGCCAGA ACGGCCTTGC CGTCGCCCTG CGCGAGCTCG GCCGCATCGA GCGCACGCTG
TTCATTCTGG ACTGGCTGCA AAGCGTTGAG CTGCGCCGCC GCGTGCATGC CGGGCTGAAC
AAGGGCGAGG CCCGAAACGC GCTGGCCCGC GCCGTGTTCT TCAACCGGCT GGGCGAAATC
CGCGACCGCA GTTTCGAGCA GCAACGCTAC CGAGCCAGCG GCCTCAACCT GGTGACGGCG
GCCATCGTGT TGTGGAATAC GGTCTATCTG GAGCGGGCGG CGAACGCCCT GCGTGGCCAT
GGCCACGCTG TCGATGACGC CCTGTTGCAG TACCTGTCGC CGCTGGGCTG GGAGCACATC
AACCTGACCG GGGATTACCT CTGGCGTAGC AGCGTCAAGA TCGGCGCGGG CAAGTTCAGG
CCGCTGCGGC CGCTGCAACC GGCTTAG
 
Protein sequence
MPRRSILSAA ESEILLALPE TKEDLIRHYT FNETDLSIIR QRRGPANRLG FAVQLCYMRF 
PGVMLAVDVE PFPPLLRVVA AQLKVPPEAW ADYGQRAETR REHLLELQSV FGFQTFATRH
YRPSVHALDE LAWQTDKGFV LATELVEGLR RKSVLLPSPG VIERICAEAI TRANRRIYDT
LSAPLTDTHR HRLDELLKRR YDGKTTWLAW LRQSPAKPNS RHMLEHIERL KAWQALDLPS
GIERLVHQNR LLKIAREGGQ MTPADLAKFE PQRRYATLVA LAIEGMATVT DEIIDLHDRI
LGKLFNAARN KHQQQFQAAG KAINAKVRLF GRIGRALIEA KQAGSDPFAA IEAVMSWDAF
TESVTEAQRL AQPEDFDFLH RIGEHYATLR RYAPEFLAVL KLRAAPAAKD VLDAIEVLRG
MNSDNARKVP ADAPTDFIKP RWQKLVMTDT GIDRRYYELC ALSELKNALR SGDIWVQGSR
QFKDFEDYLV PPTKFASLKQ ARELPLAVAT DCDQYLHDRL TLLETQLATV NRMALANELP
DAIITGSGLK ITPLDAAVPD TAQALIDQTA MILPHVKITE LLLEVDQWTG FTRHFTHLKS
GDLAKDRNLL LTTILADAIN LGLTKMAESC PGTTYAKLAW LQAWHIRDET YSTALAELVN
AQFRHPFAEH WGDGTTSSSD GQNFRTGSKA ESTGHINPKY GSSPGRTFYT HISDQYTPFN
TKVVNVGVRD STYVLDGLLY HESDLRIEEH YTDTAGFTDH VFALMHLLGF RFAPRIRDLG
DTKLYIPKGD AGYEALKAMI GGTVNIKHIR AHWDEILRLA TSIKQGTVTA SLMLRKLGSY
PRQNGLAVAL RELGRIERTL FILDWLQSVE LRRRVHAGLN KGEARNALAR AVFFNRLGEI
RDRSFEQQRY RASGLNLVTA AIVLWNTVYL ERAANALRGH GHAVDDALLQ YLSPLGWEHI
NLTGDYLWRS SVKIGAGKFR PLRPLQPA