Gene Tmz1t_0348 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0348 
Symbol 
ID7085649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp392860 
End bp394560 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content66% 
IMG OID643697381 
Producttransposase IS4 family protein 
Protein accessionYP_002354029 
Protein GI217968795 
COG category[L] Replication, recombination and repair 
COG ID[COG5421] Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0178461 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGTGA AGCTCACCAC CTCCGGAGGT CGCCGCTACG TCCAACTCGT CGAGTCCTAT 
CGCGACGAGG CCGGGCGAGT GAAGAAGCGC ACCGTCGCCA CGCTCGGGCG TGCCGAGCAG
GTCGATGGTT CGCTTGACGC GGTGATCAAC GGGCTGCTGA AGATCACCGG CCGCGAGCCG
ATGGGTGCGA AGCCGCCGGC GCCGACGGTG TCGTTCGAAT CCGCGCGGGC ACTCGGTGAC
GTGTGGGCGC TGACCGAGCT GTGGAACTCG CTGGGCTTCT CGGGGCTGCG TCGGGTGTTT
AGCCGCACCC GCCACACCAC GGACGTGGAG GCCCTGATTC GCCTGATGGT GCTCAACCGT
CTGTGCGACC CCGAATCCAA GCTCGGCGTG CTGCGCTGGG TGCACACGGT GGCGCTGCCC
GACTTCAGGC CGAAGGCGGT GACGCACCAG CAGTTGCTGC GCAGCCTCGA TGCGCTCATG
GATCACCAGG ACGAGGTCGA TGCGGTGGTC GCCGGGCTGC TGCGGCCACT GATCGATCAG
GACTTGTCGG TGGTGTTCTA CGACCTCACC ACGATTCGCA GCGAAGGGCT CAGCCAGATG
ACGGGCGATG TGCGCCAGTT CGGCATGGCC AAGGAGGGGC TGATCGCCCG TCAGTTCATG
CTCGGCGTGG TGCAGACCGC CGAAGGGCTG CCGATCTACC ATGAGGTGTT CGATGGCAAC
GCGGCCGAAA CCAGGACCTT GCTGCCCACG CTCACCAAAG TGCTCGAGCG CTTCCCTGCG
GTGCAGCGCC TGGTGCTGGT CGCCGACCGG GGTCTGCTCA GCCTGGATAA CCTCGAGGCC
CTGAAGTCCG TGCGTCTGGC CAGCGGCAAG CCGCTCGAAT TCATCGTCGC GGTGCCGGGT
CGGCGTTACA ACGAGTTCAT CGACCTGCTC GAACCCTTCC ACGAACAGCA ATGCGTCGGC
GCGACCCAGG AAGTCATCTC GGAGCGCGCC TGGAACGCGC TGCGGCTGGT GGTCGCGCAC
GATCCGCTCG CCGCCGCCGA CAAGACGCAG CAGCGCAACG CGCGCATCGA TGCGCTGTTG
CGTCAGGCCG AGCAATGGAC GGGCAAGCTC ACCGACCAGG ACGAAGGCGT CAAGTATCGC
GGTCGCAAGC TCTCGGACAG CGGCGCGAAG GCGCGCTTCT ACCATGTGGT GAGCGAAGCG
CACCTGTCGC GCATCATCAA GGTGGATCTG GCCGAGGAGC TCTTCAGCTA CGACATCGAC
GACAAGGCCC GGCGCCTGGC CGAGATGATG GACGGCAAGC TGCTGCTGGT CACCAACGCC
GAGGGGCTCT CCGCGCAGAA CGTGATTCAG CGCTACAAGT CGCTCGCCGA CATCGAGCGC
GGCTTCAAGG TGCTCAAGTC CGAGATCGAG ATCGGCCCCG TGTATCACCG CCTGCCCGAG
CGGATCCGCG CGCATGCGTC GATCTGCTTC ATGGCGCTGA TCCTGCATCG GGTCATGCGT
CGCCGGCTCA AGGCCGCCGA CGCGGGCTAC ACGCCCGAGC GGGCGCTCGA ACAACTGCAG
CGCATCCAGC ATCACCGCGT GCGCCTGAAC GGCGGCGAGC CGGTCGCCGG GGTGTCGACG
ATCAGCACGG AGCAGAACGA GGTGCTTCAT GCCTTAGGAA TAGGAAAACC GACGGCGCCG
GAGCAGCTGG CGCTGTTGTA G
 
Protein sequence
MHVKLTTSGG RRYVQLVESY RDEAGRVKKR TVATLGRAEQ VDGSLDAVIN GLLKITGREP 
MGAKPPAPTV SFESARALGD VWALTELWNS LGFSGLRRVF SRTRHTTDVE ALIRLMVLNR
LCDPESKLGV LRWVHTVALP DFRPKAVTHQ QLLRSLDALM DHQDEVDAVV AGLLRPLIDQ
DLSVVFYDLT TIRSEGLSQM TGDVRQFGMA KEGLIARQFM LGVVQTAEGL PIYHEVFDGN
AAETRTLLPT LTKVLERFPA VQRLVLVADR GLLSLDNLEA LKSVRLASGK PLEFIVAVPG
RRYNEFIDLL EPFHEQQCVG ATQEVISERA WNALRLVVAH DPLAAADKTQ QRNARIDALL
RQAEQWTGKL TDQDEGVKYR GRKLSDSGAK ARFYHVVSEA HLSRIIKVDL AEELFSYDID
DKARRLAEMM DGKLLLVTNA EGLSAQNVIQ RYKSLADIER GFKVLKSEIE IGPVYHRLPE
RIRAHASICF MALILHRVMR RRLKAADAGY TPERALEQLQ RIQHHRVRLN GGEPVAGVST
ISTEQNEVLH ALGIGKPTAP EQLALL