Gene Tmz1t_2394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2394 
Symbol 
ID7094316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011667 
Strand
Start bp55594 
End bp58617 
Gene Length3024 bp 
Protein Length1007 aa 
Translation table11 
GC content61% 
IMG OID643701081 
Producttransposase Tn3 family protein 
Protein accessionYP_002364222 
Protein GI217980172 
COG category[L] Replication, recombination and repair 
COG ID[COG4644] Transposase and inactivated derivatives, TnpA family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTCAC TGGAGCGCAC TGCATACCCG CAGTTTCTGA GCTACTACTC ACCGAGTGAC 
CTGCGACAGT TCTTTACTCT GCAGGATGAA GAAATCAACT GGCTGAAGTC GGCCGGCCGT
TCTGCCGGGA CCCGACTCGG ACTGGCCGTC CTGCTCAAAG TTTTCCAGCA CTTGCGCTAC
TTTCCTAGCC TCGACAAGAT ACCTTCCGAG GTGATCACCC ACGTCCGGAA TGGCCTCGGT
TTCGGTGACG CGATACGGAT CGAATACCCC GTCGAACGCA CCCTCTTTCG GCACAAAGCC
ATGGTGCTGG GCTTGCTTAA CGTCAAACCC TTCCATGGGC ACGATGCGAT GCGCCAAGCA
GAACGCTTTG CCTGCGATGC GGCTGAGCTC ATGGATCAGC GTGCCGACAT CATCAATGGG
ATAATCGAGG GGGTGCTCCA GGCGCGCTAC GAACTCCCCG CCTTCTCGAC GCTCGACGAT
TTGGCAGAGA AGGCCCACGC TGCCGTGCAG AACCGGGTGT TCGGTAAGGT ATTTCACCGT
ATCACCCCCC AACAACTCGA AGCCCTACAG GCGCTGCTGG TCACGGACAA ACTGGAACGC
CGCCAAAGCG AATACAACGA GCTCAAGAAA TCCGCGAAAC GCCCGACCCG CAAGCATCTC
GATATGCTGG TGGAGCATCT GGAATGGCTG GATTCGATCA CGGCGGGGGA TGACGTCCTT
GCAGGCCTTC CCGACACCAA GATTCGCCAC TTCGCCGCCC AGGCCATGGC CTATGACGTG
TCCGAGTTGC GCGAATGCGC CGAGACCAAG CGTTACACCC TGCTGGTGGC CCTGATCCGG
CGCATGCAGG TGCGCGCGCG GGATCAGTTG GCCGAGATGT TCCTGCGCCG CGTGGCCACG
ATTCACAAAC GCGCCAAGGA AGAGCTCGAC CAGATCCAGT TCGGGCAGCG CGGCCAGATC
GAGCGCCTGA TCGGCACGCT GGATGGCGTG CTCGCCATCC TCGACGGCGA ACCGGACAAT
GCGGCGGCCG GCGCTCAGAT ACGCGAATAC CTTGCACCCG CAGGTGGAGT CCACGGGGTC
CGCGAGACCT GTGCCCAGGT GCAGGCGACC AGCGGCAACA ACTACCTGCC GCTCGTGTGG
AAACACTTCA AGAGCCATCG CTCCATCCTG TTCCGGCTCG TGCATTTGCT CGACATCCGC
GCCACCACCC AGGATCGGAC GCTGATCGAC GCTTTGAACC TCATCAAGAC CTACCAGGAC
AAGCATCGCA TCGAGTGGAT TACGGAGAGC ATTGACCTTT CCTTCGCCTC GGACCGCTGG
CGCAAGCTCC TGCGCGAGGG CGACAGCGGC GATGGTTTCG GTCTGGGCTT AAGCCGGCGC
AACCTCGAAA TCTGTGTCTT CTCCTACCTC GCCGAGGAGT TGCGCTCTGG CGATCTCAGT
ATCGTCGGCT CGGAGGAGTT CGCCGATTAC CGCGATCAGT TGCTTTCGTG GGAGGAATGC
CAGGAGTTGT TACCCGCCTA CTGCGACAAG ATTGGCCTGC CGCAGGACGC AACGACCTTT
GCCAGCAGTT TGCGCGACTG GCTCACCGAC ACCGCGCAGC ACCTGGATGA CACGTTTCCT
GAATGCCGCG GCGATGTGGC CCTGACTGCC AGCGGAGAAC CCGTGTTGCG CAAGCCGATT
GCGCGCGAAA TCCCGCCTTC GGTTATGTCG CTGCAGAACG CGCTGACCCA GCGCATGCCG
GCGCGGCACA TTCTGGACGT GCTCGCCAAC ATCGAACACT GGATGGGCTT CACGCGGCAC
TTCGGGCCAC TATCGGGCAA CATGGCCAAG CTCAAGCAGC CGGCCGAGCG CTACCTGCTG
ACGATCTTCG CCATGGGCTG CAACCTCGGG CCCACACAGG CGGCGCGCCA TCTGGGCAAC
AGCGGCGTCA CGCCGCACAT GCTGTCCTTC GTCAATCGGC GCCACCTCTC ACTGGAGAGC
CTGGAAGCGG CCCAACGCGA GCTCAACGAG GTATATCTGC GCCTGGATCT CCCCAAGATA
TGGGGTGATG GCAAGACGGT CGCCGCCGAC GGTACGCAGT ACGATTTCTA CGACGAAAAC
CTGCTCGCGG GCTACCACTT CCGTTACCGC AAGATGGGCG CCGTCGCTTA CCGCCACGTG
GCCAACAACT ACATCGCGGT GTTCCGTCAC TTCATCCCGC CCGGCGTGTG GGAAGCGATC
TACGTGATCG AGGGGCTACT CAAGGCGGGC TTGTCGGTCG AGGCGGATAC GGTGCATGCC
GACACCCAGG GCCAGTCGGC CACCGTGTTT GCCTTCACCC ACTTGCTGGG CATCAAGCTG
ATGCCGCGGA TCCGGAACTG GAAGAATTTG ACGCTGTTCC GGCCGGACAA GACGGTGAAG
TACAAGTACA TCAATCGGCT GTTCGGCGAC AGCGTGGATT GGAATCTCAT CGAGCGGCAC
TGGCAGGATC TGATGCAGGT GGCCTTGTCG ATCTACGCCG GCAAGATTTC TTCGGCCACG
CTGCTGCGCA AGCTGGGCAG CTACAGCCGC AAGAACCGTC TGTATTTCGC GGCGCAGGAA
CTGGGCAACG TGATCCGCAC GGGCTTCCTG CTGGAGTGGA TCGGCAGTCG TGAGCTGCGC
CAGGAGATCA CGGCGAACAC CAACAAGATC GAGTCCTACA ACGGCTTTGC CAAGTGGCTG
TCCTTCGGTG GCGACGTGAT CGCCGTGAAT GAGCCGGACG AACAGCAGAA GCGCCTGCGC
TACAACGATC TCGTGGCCTC CGCCCTGATC CTGCAGAACA CCGTCGACAT GATGCGCACG
TTGGGCAACC TCCGGCATGA AGGCTGGCAG ATCACGGAAA ACGATGTCAG CTTTCTGAGC
CCCTACCAGG TCGCGCACGT CAAGCGCTTC GGCGAGTACA GCCTGAAACT CAAGCGCAAA
CCCGAAGCCT GGATCGCGGA CGACACCTTT CAACAAGTGG CCGCGTCGGT GCAGGACATG
CGGCGCGCGA CGCGCAGAGC CTGA
 
Protein sequence
MASLERTAYP QFLSYYSPSD LRQFFTLQDE EINWLKSAGR SAGTRLGLAV LLKVFQHLRY 
FPSLDKIPSE VITHVRNGLG FGDAIRIEYP VERTLFRHKA MVLGLLNVKP FHGHDAMRQA
ERFACDAAEL MDQRADIING IIEGVLQARY ELPAFSTLDD LAEKAHAAVQ NRVFGKVFHR
ITPQQLEALQ ALLVTDKLER RQSEYNELKK SAKRPTRKHL DMLVEHLEWL DSITAGDDVL
AGLPDTKIRH FAAQAMAYDV SELRECAETK RYTLLVALIR RMQVRARDQL AEMFLRRVAT
IHKRAKEELD QIQFGQRGQI ERLIGTLDGV LAILDGEPDN AAAGAQIREY LAPAGGVHGV
RETCAQVQAT SGNNYLPLVW KHFKSHRSIL FRLVHLLDIR ATTQDRTLID ALNLIKTYQD
KHRIEWITES IDLSFASDRW RKLLREGDSG DGFGLGLSRR NLEICVFSYL AEELRSGDLS
IVGSEEFADY RDQLLSWEEC QELLPAYCDK IGLPQDATTF ASSLRDWLTD TAQHLDDTFP
ECRGDVALTA SGEPVLRKPI AREIPPSVMS LQNALTQRMP ARHILDVLAN IEHWMGFTRH
FGPLSGNMAK LKQPAERYLL TIFAMGCNLG PTQAARHLGN SGVTPHMLSF VNRRHLSLES
LEAAQRELNE VYLRLDLPKI WGDGKTVAAD GTQYDFYDEN LLAGYHFRYR KMGAVAYRHV
ANNYIAVFRH FIPPGVWEAI YVIEGLLKAG LSVEADTVHA DTQGQSATVF AFTHLLGIKL
MPRIRNWKNL TLFRPDKTVK YKYINRLFGD SVDWNLIERH WQDLMQVALS IYAGKISSAT
LLRKLGSYSR KNRLYFAAQE LGNVIRTGFL LEWIGSRELR QEITANTNKI ESYNGFAKWL
SFGGDVIAVN EPDEQQKRLR YNDLVASALI LQNTVDMMRT LGNLRHEGWQ ITENDVSFLS
PYQVAHVKRF GEYSLKLKRK PEAWIADDTF QQVAASVQDM RRATRRA