Gene Tmz1t_0487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0487 
Symbol 
ID7084998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp548091 
End bp549452 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content68% 
IMG OID643697516 
Producttransposase IS4 family protein 
Protein accessionYP_002354158 
Protein GI217968924 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCGCT TCGAAGTCAA GCAATCCGCC AAGCTCAATC TGACGTCCTA CTCGGGCCTG 
GCGTTGATCG GGCAGTGCTG CCAGGCGGCA CAGGTCGAGG CGGTGATCGA CCCGAGGCTG
CCGGTGTCGC AGGGTATGCG GAGCTCGGAC CTGGTCAAGT CGGTGGTGGG GCTGCTGAGC
CTGGGCAAAA GCGACTTCGA GGCGATCGAG CCGTTTCGCG GCGACCGCTT CTTCAAGGAA
GCGCTCGGGC TCGCCAAGGT GCCCGGCAGC GTGTGGATGC GCCAACGGCT CGATGCCCGC
GCGGCCGAGC TGCGCGAGCT GACCGACGAG CTGAGCCTGC GCCTGCTCGA GCGCACCGAG
GCGCCGATCA CGGCGCACAA GGGCTACGTC TGCGCCGATC TGGACACCTT CGTGATGGAC
AACTCCGACA CCAAGAAGGA GGCGGTCAGC CGCACTTACC AGGGCGTCGA TGGCTACACG
CCGATCGCGC TGTACCTGGG CAACGAGGGC TGGAACCTCG GCCTGGAGCT GCGCGCGGGG
TCGCACCACT CGGCGCTGGA GACCGAGTAT TTCTTCGAGC GCGCGTTCCC GCGCCTGCGC
CGGGTGTGTG CGGCCGATGC GAAGCTGCTG TGGCGGGCCG ACAGCGGCTT CGACAGCGCC
CGGCTGCTGT TCGCGCTGGC CGACGAGCGC GATCGCTGGG CAGCGCTGGG GCGTTCGTTC
GACTACCTCA CCAAGTGGAA TCCGCGCCGT CAGGACAAGA CCGCCTGGGT GGACCGGGCC
GAGGCCGCCG GCGTCTTCGA GGAAGTGCGC GCGGGCAAGC GGGTGGGGCT GCTGGACCTG
AAGATCGACC GTGCCTGGAA GAAGGCCAAG CGCACGCTGC GTCTGGTGGT GCGGGTGACC
GAGCGCACGA TCGACAAGAA GGGCCAGCAC CTGCTGACCC CCGAGATCGA GATCGAAGGC
TGGTGGACCA GCCTCGAAGT GGCGATGGCT GACGTGATCG AGCTCTACAA GCACCACGGC
ACGCACGAGC AGTTCCACTC CGAGATCAAG ACCGACCTGG ACCTCGAGCG CCTGCCCTCG
GGCAAGTTCG ACACCAACGA CGCGGTCATG CATCTGGCCG CGTTCGCCTA CAACTGCCTG
CGCCTGATCG GCCAACTCGG GCTGACCGGC GAGCTCTCGC CGATCCGTCA CCCGGCCAAG
CGCCGACGCA TCAAGACAGT GCTGCAGGAG GTGATGTACC GTGCGGCGAA GTTCGTCGAA
CACGCCCGCC GCCTGGTGCT GGACTTCGGA CGCGGCGTCG CCGCGCATGT GAAGGTGTTC
ACCACGGTGC AGGCGCGACT GTGCGCGGTG GCTTCGCCGT GA
 
Protein sequence
MPRFEVKQSA KLNLTSYSGL ALIGQCCQAA QVEAVIDPRL PVSQGMRSSD LVKSVVGLLS 
LGKSDFEAIE PFRGDRFFKE ALGLAKVPGS VWMRQRLDAR AAELRELTDE LSLRLLERTE
APITAHKGYV CADLDTFVMD NSDTKKEAVS RTYQGVDGYT PIALYLGNEG WNLGLELRAG
SHHSALETEY FFERAFPRLR RVCAADAKLL WRADSGFDSA RLLFALADER DRWAALGRSF
DYLTKWNPRR QDKTAWVDRA EAAGVFEEVR AGKRVGLLDL KIDRAWKKAK RTLRLVVRVT
ERTIDKKGQH LLTPEIEIEG WWTSLEVAMA DVIELYKHHG THEQFHSEIK TDLDLERLPS
GKFDTNDAVM HLAAFAYNCL RLIGQLGLTG ELSPIRHPAK RRRIKTVLQE VMYRAAKFVE
HARRLVLDFG RGVAAHVKVF TTVQARLCAV ASP