Gene Tmz1t_3892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3892 
Symbol 
ID7873541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4288511 
End bp4290196 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content66% 
IMG OID643700832 
Producttransposase IS4 family protein 
Protein accessionYP_002890855 
Protein GI237654541 
COG category[L] Replication, recombination and repair 
COG ID[COG5421] Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACCTGA GGCAGAGCAA ACAGAAGCGG GCAGACGGGC GGGCCATCAC CTACCTGCAA 
CTGGCCGAGA ACGTGTGGGA TGCCTCGAAG GGGCGCTCAC AGGCCCAGAT CGTGTTCAAC
TGCGGCCGTG CCGACGACCC GGAGGTGATC GAGCGGCTGC GCCGTCTGGC CAAGAGCATC
CTGCGCCGAT GCTCGCCCGA GGCCATCGTC GCGGACGATC CAGGCTGGCG GTTGGTGTGC
GCCTGGCCCT ACGGCGATGT GTACGCGCTT CAGGCGGTGT GGCGGCGGCT CGGGATCGAC
GCCATCGTGC GTGCGCAGGC CAAGGGGCGG CGCTTTGGCT TCGAGATGGA GCGGGCCTTG
TTCGCGCTGG TGGCCAATCG CGCTTGTGCG CCGGCCTCTA AACTCTACTG TCACGAGCAG
TGGCTCAAGG AAGACGTGCA TATCGAAGGC ACGCAGGCCC TGGCGCTGCA CCAGCTCTAC
CGCGCGATGG ACTTCCTCGA GGCGAACAAG GCTGCCATCG AGCAGGCGAT CTTCTTCCAG
GTCGCCGACC TGTTGAGTCT GGACGTGGAG ATCGTCTTCT ACGACACCAC CTCATTGCAC
TTCGAGATCG ACAACGAGGA TGAGGGCGAG CCCGATGGCC AGATGCGCGG CAGCGTTGCG
GCCGGCGCCA AGCGCTACGT GGCGCCCAGA AAGCGCGGCT ACAGCAAGAA CGGCCGGGGT
GATGCGCCGC AGATCGTCGT CGGGCTGGCG GTCACCCGCG ACGGCTTCCC GGTGCGCCAC
TGGGTGTTTC CGGGCAACAC CGTCGATGTC ACCACGGTGG CCCAGGTCAA GGAAGATCTG
AAGGGCTGGC AGCTCACCCG CTGCCTGTTC GTCGGGGATG CCGGGATGGT CTCGCAGGCG
AACTTCCAGG CGCTCGCCAA GGGCGGTGGC AAGTACCTGA TGGCGATGCC GATGCGCCGT
GGCGACGAGG TCACCGAAAC GGTGCTCTCG CGCCCGGGCC GCTACCGCAA GATCGCCGAC
AACCTCGAAG TCAAGGAAGT CATCGTCGGC GACGGCGAGC GCCGCCGCCG CTACGCGGTG
TGCTTCAACC CGCAGGAGGC ACATCGCCAG CGCACCTATC GCGCCGAGCG CATCCGTGAG
CTCGAGGCCG AACTCGCCTG CCTGGCCGAT CAGGACGAGG GCGGGCACAG CAAGCGCGTG
TGTGCGCTAC GTAGCAGCGC CCGCTACGGG CGGTTCTTGA AAGAGACCAA GCGCGGCCTG
GCGATCGACC GCCAGGCCAT CGCCGAACTC GAACGCTTCG ACGGCAAGTT CGTCGTCCAC
AGCAACGACG ACACCCTCAC GGCCGAGGAC ATGGCGCTCG GCTACAAGCA GCAGCAGCGC
GTCGAAGAGG CCTGGCGCAC GATGAAGGGC GGCCTGCGCA TGCGCCCAGT GTTCCACTGG
GCACCGCACC GCATCCACGC CCACATTGCG ATCACGGTGC TCGCCTTGCT GCTGGAGCGC
GTGATCGAGC ATGCCTGCCA GGACACCTGG CGCAACATCC GCGACGACCT CAAGCGCATT
CAGCTTGCGC AATTGTCCAG CCCCAACGGC ACCGCCTGGC AGGTCACCGA GCCTACGACG
GAAGCGGCCA ACCGACTGAA AGCATTGAAG ATCAAGCCGC CGCCGGCCAT CCTCAAGCTC
GACTGA
 
Protein sequence
MYLRQSKQKR ADGRAITYLQ LAENVWDASK GRSQAQIVFN CGRADDPEVI ERLRRLAKSI 
LRRCSPEAIV ADDPGWRLVC AWPYGDVYAL QAVWRRLGID AIVRAQAKGR RFGFEMERAL
FALVANRACA PASKLYCHEQ WLKEDVHIEG TQALALHQLY RAMDFLEANK AAIEQAIFFQ
VADLLSLDVE IVFYDTTSLH FEIDNEDEGE PDGQMRGSVA AGAKRYVAPR KRGYSKNGRG
DAPQIVVGLA VTRDGFPVRH WVFPGNTVDV TTVAQVKEDL KGWQLTRCLF VGDAGMVSQA
NFQALAKGGG KYLMAMPMRR GDEVTETVLS RPGRYRKIAD NLEVKEVIVG DGERRRRYAV
CFNPQEAHRQ RTYRAERIRE LEAELACLAD QDEGGHSKRV CALRSSARYG RFLKETKRGL
AIDRQAIAEL ERFDGKFVVH SNDDTLTAED MALGYKQQQR VEEAWRTMKG GLRMRPVFHW
APHRIHAHIA ITVLALLLER VIEHACQDTW RNIRDDLKRI QLAQLSSPNG TAWQVTEPTT
EAANRLKALK IKPPPAILKL D