Gene Tmz1t_1616 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1616 
Symbol 
ID7084826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1811544 
End bp1813289 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content68% 
IMG OID643698636 
Producttransposase IS66 
Protein accessionYP_002355267 
Protein GI217970033 
COG category[L] Replication, recombination and repair 
COG ID[COG3436] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.772385 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCAAC GTACCGCCGC CCCTCGCTCG AACCCTTCCG TTGGCGTGTG CGCATCCGTC 
CTGGCCGAGC TGTTGCCCGA CGATCCAGCA ACGCTCAAAG CCTTGTTGCT GGCACAGCAG
CGTGCCTTCG AGACGCGTGA AGCCGAACGG CAGGCAGCCT TCGAGGCCCG TGAGGCCGCA
CTCCAGAAAG TCTTCGATGC GCGGGAGGCT GAACGCCAAA GAGCCTTCGA TGCGCGGGAG
GCCGAACTGC AAAAGGCCTT CGAGGCACGC ATCCTCGAGC TCTACGAGCA GCTTCGCCTG
GCGCGTCGGC GCATGTTCGG GCCCAGCAGC GAATCGCACG CGGGCCAGGC CTGGCTCTTC
GACGAGGCCG AGGCGCTGGC CGAGTCCGCA CCCGAGGCGC TCGACACCGC AACCTTGCCG
CCGCCGGCCA CCGAGACGAC GGGTGAGGCG TCCGCCGACA CCGGCAAGAA GAAGGCGCGC
GGCAAGCGCA AGCCCTTGCC CATCGAGCTG CCGCGCATCG ACGTCGTCCA TGACGTCCCC
GAGGCCGAGC GCACCTGCGC CTGCGGCACG CCCATGGTCG AGATCGGCCA GGACGTCAGC
GAACAGCTCG ACATCGTCCC GATGCAGGTG CGTGTGCTGC GCCATATCCG CAAGCGCTAC
GGCTGCCCCG AGGGCGACCA GGCGCCGGTC ACCGCCCGCG CCCCGGCGCA GGTGCTGCCC
AAGAGCAACG CCAGCAACGA CCTGCTGGCC TTGCTGATCG TCATCAAGTA CGTCGATGGG
CTGCCGCTGG CGCGTTTCGA GTACGTGCTC GCTCGCGCAG GCGTGCTTGT GCCGCGCCAG
ACCCTGGCGC GCTGGGTGAT CGGTACCGCC CAGGCGCTGC AGCCGCTCGC CAACCTGATG
CGCGACGTGC TGCTCGGGCA CGACGTCATC CACATGGACG AAACCCCGGT GCAGGTGCTC
AAGGAGCCTG GCCGGGCAGC CACGAGCAAG AGCCAGATGT GGGTGCAGCG CGGCGGACCG
CCGGGCAAGC CGGTGGTCCT CTTCGAGTAC GATCCGAGCC GCGCGCAGGC GGTGCCCTTA
CGCCTGCTCG AAGGCTGGAA GGGGCATCTG ATGGCCGACG GGCTCGAGAG CTACGGCGCA
ATTGCCTTCA CCGAAGGGGT GACCCGGCTC GGTTGCTGGG TGCACGCGCG ACGTCGTTTC
GTCGATGCCA GCAAGGTGCT GCCTGCCGGC AAGCGCGGCC GCGCCCACGA AGCGCTGGCC
CTGATCGGCA AGCTCTACGC CATCGAGAAG GACGCGCGCG AACTGAACGA CGCCCAGCGC
CTGGCGCTAC GCCAGAGCAG AAGCCGCGCC GTCATCGACG AACTGCGCCG TTGGCTCGAC
CAAGTGCTCC CCACCGTGCC GCCCACCTCG GTGCTCGGGG GTGCCCTGGG CTACCTGCAT
CGGCAGTGGC CGCGTCTGAC GCGCTACCTC GAGCGCGGCG ATCTGCCGAT CGACAACAAC
CCCGCCGAAA ACGCCATCCG TCCCTTCGTG GTCGGGAGAA AGGCATGGCT CTTCTCGGAC
ACTCAGGCCG GTGCGCGTGC CAGCGCACTC CTCTACTCGC TGGTCGAAAC CGCCAAGGCC
AACGGCTTCG AGCCGTATCT GTGGCTGCGC CACGTGCTGC GCGCCTTGCC CACCGCGACC
AACGTCGAAC ATTTCGAGGC CCTCCTGCCC TGGAATCTCA AGGCTGAGCA GTTGATCACG
GCGTAA
 
Protein sequence
MPQRTAAPRS NPSVGVCASV LAELLPDDPA TLKALLLAQQ RAFETREAER QAAFEAREAA 
LQKVFDAREA ERQRAFDARE AELQKAFEAR ILELYEQLRL ARRRMFGPSS ESHAGQAWLF
DEAEALAESA PEALDTATLP PPATETTGEA SADTGKKKAR GKRKPLPIEL PRIDVVHDVP
EAERTCACGT PMVEIGQDVS EQLDIVPMQV RVLRHIRKRY GCPEGDQAPV TARAPAQVLP
KSNASNDLLA LLIVIKYVDG LPLARFEYVL ARAGVLVPRQ TLARWVIGTA QALQPLANLM
RDVLLGHDVI HMDETPVQVL KEPGRAATSK SQMWVQRGGP PGKPVVLFEY DPSRAQAVPL
RLLEGWKGHL MADGLESYGA IAFTEGVTRL GCWVHARRRF VDASKVLPAG KRGRAHEALA
LIGKLYAIEK DARELNDAQR LALRQSRSRA VIDELRRWLD QVLPTVPPTS VLGGALGYLH
RQWPRLTRYL ERGDLPIDNN PAENAIRPFV VGRKAWLFSD TQAGARASAL LYSLVETAKA
NGFEPYLWLR HVLRALPTAT NVEHFEALLP WNLKAEQLIT A