Gene Tmz1t_0333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0333 
Symbol 
ID7085634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp378644 
End bp379969 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content69% 
IMG OID643697370 
Producttransposase IS204/IS1001/IS1096/IS1165 family protein 
Protein accessionYP_002354018 
Protein GI217968784 
COG category[L] Replication, recombination and repair 
COG ID[COG3464] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTCTC CGATCGAAGC CCTCTTCACC ACCGCGCTCG GCCTGCAGCC GCCCTGGTAC 
GTCGCCAAGG TGGACCTCGA CACCGCGAAG CGGCGGATCG ACTTCGAAGT CGAGCACGCC
GGCAAGCGCG TGCCCTGTCC GGCCTGTGGG GCGGCGCATC AGCCGGTCCA CGATCGAGTG
CGGCGCAGCT GGCGTCACCT GGACTTCTTT CAGTTCGAGG CGTGGCTCCA TGCCGACATC
CCGCGCGTGC AGTGCTCGGG CTGCGGCAAG ACCACGCAGC TGCCGGTGCC GTGGGCTCGC
GAGGGCAGCG GTTTCACGCT GCTGTTCGAG GCGCTGGGCC TGTCCCTGTG CAGCGAGTTG
CCCGTGCGCC AGGCCGCCGC CCAGATGCGC GTCGCGCCCA AGCGGCTGTG GGGGCGGATC
CGCCATTACG TTCACGGTGC ACGCGCTCGG GATGACATGT CGGGCGTGCG CTACGTCGGC
ATCGACGAGA CCAGCGTCAA GCGCGGGCAC GCGTACATCA CCGTGGTGCA TGACCTGGAG
GCCAAGCGCC TGCTGTTCGC CACGCCCGGG CGAGACCACG CGACCCTGCA GGCCTTTGCC
CAGGACCTGC GCGCGCACGG TGGCGAGCCG GAGCGGATCG AGCACGCCTG CATCGACATG
AGCGCGGCCT ACGCCAAGGG GATTGCCCAG GCGCTGCCCA CGGCGCAGGT CAGCTACGAC
CGTTTCCACG TCGTGGCCCT GGCCAATACG GCGATGGACG AGGTTCGCCG CGAGGAGATG
CGCAGCGCCG CAGCCGCGGT CCGCGCGGCG GCCGGTACGG GAAACAAGAA GACGCTGCGC
CAGCTGTTGT GGGCGATGCG CAAGAACCCG CCGCAATGGA CGCCGGCACA GTGCGACGCG
ATGAACTGGC TGCAGCGCTC GGGCCTCAAG AGTGCGCGGG CGTGGCGGAT GAAGCAGGGC
CTGCGGCTCG TCTACCGCGA GGCGGCGGCG AGCAACTGCG AAGAGGTCGC CCGCGGGGCC
TTGATGAAGT GGATCAGTTG GGCCCGACGC TCTCGCCTGG AACCCTTCAA GCGGCTCGGC
GCCACGGTCA AGGCGCATCT GGGCGGCGTG CTCCGCGGCA TGCTCGACGG GCGCAGCAAC
GCCTACGTCG AGGCGATGAA CGGGCAGCTT CAGCAGACGA AGACCGCCGC CCGAGGCTTC
CGCAACCTCG ACAATTTCAT CGCCGTCGCC TACCTGCGCA TGTCCAAGCT CGAGCATCTA
CCGAAGAACC CCATGGTGCC GGCGATCCCC CGCGAATACG GGCGCTACCG TCATGTTTGT
TGTTGA
 
Protein sequence
MSSPIEALFT TALGLQPPWY VAKVDLDTAK RRIDFEVEHA GKRVPCPACG AAHQPVHDRV 
RRSWRHLDFF QFEAWLHADI PRVQCSGCGK TTQLPVPWAR EGSGFTLLFE ALGLSLCSEL
PVRQAAAQMR VAPKRLWGRI RHYVHGARAR DDMSGVRYVG IDETSVKRGH AYITVVHDLE
AKRLLFATPG RDHATLQAFA QDLRAHGGEP ERIEHACIDM SAAYAKGIAQ ALPTAQVSYD
RFHVVALANT AMDEVRREEM RSAAAAVRAA AGTGNKKTLR QLLWAMRKNP PQWTPAQCDA
MNWLQRSGLK SARAWRMKQG LRLVYREAAA SNCEEVARGA LMKWISWARR SRLEPFKRLG
ATVKAHLGGV LRGMLDGRSN AYVEAMNGQL QQTKTAARGF RNLDNFIAVA YLRMSKLEHL
PKNPMVPAIP REYGRYRHVC C