Gene Tmz1t_0253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0253 
Symbol 
ID7084375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp286815 
End bp288122 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content64% 
IMG OID643697296 
Productintegrase family protein 
Protein accessionYP_002353944 
Protein GI217968710 
COG category[L] Replication, recombination and repair 
COG ID[COG4973] Site-specific recombinase XerC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000373788 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAGCA TCCGTGCCAG AAAGGATAAC GGAATGCTCT TCATCGACTT CCGGTACCAG 
GACAAGCGTT ACCGCGAACA GACCGCGCTC GGCGACACCG CAGCCAACCG CAAGCGCTTG
CAGAAGGTGC TTGACCGCAT CGAGGCCGAC ATCGCCGCCG GCACCTTCGA CTACCGCCGC
TTCTTCCCAG GCAGCAAGAA CGCCGCGAAG TTCGATCCAG CCCCTGGGGG GATGGTCGGG
CCGGTCAGCG CTGCTGCGGT TGCGTTGCCG TCGGCGGCAG CCAGCGTCGC GAGCACTCCG
CTCTTCAAGG ACTTCGCAGA GACCTGGTAC GGCGAGAAAG AGGTGGAGTG GCGGCGTTCG
TACAAGACCA CGCTGCGCGC GACCCTCGAT CGCGCCCTCA TCCCGAGGTT CGGGGAGAAG
GAGGTCGGCC AGATCTCCAA GGCGGACGTC CTTGCCTATC GCGCCGAGCT CGGGAAAGCG
ACCGCGAAAG GCAAGCAAAC CAAACTGTCT GCGGCAAGGA TCAACAAGAT GCTGAACCCG
CTCAGGCAAG TCCTCAATGA AGCGGCCGAC CGGTTTGATT TTCGCACGCC CTTCGACAGC
GTGAAACAGC TCAAGACGAA GCGCACCGAC GTCGATCCCT TCACCCTCGC CGAGGTCAAG
CAGATCCTCG ACACCGTGCG GCCCGACTTC AGGAACTACT TCACCGTGCG CTTCTTCACC
GGCCTGCGCA CCGGCGAGGT CGACGGACTG CAGTGGAAGT ACGTCGACTT CGACAACCGC
CTCATCCTGG TTCGCGAGAC CATCGTCGGC GGTGAAGAGG AATACACCAA GACGGACGGC
AGCCAGCGCG ACATCCAGAT GAGTCAGCTC GTCTTCGATG CGCTGCAGGC GCAGTTCGAG
GCCACCGGCA AGCTCGGCAA GTTCGTGTTC TGCAATCGGC TGGGGACGCC GCTGGACCAC
AAGAACGTCA CCAACCGGGT GTGGTACCCG CTGCTGCGGC ACCTCAACCT CAAGCAGCGC
CGGCCGTATC AGTGCCGCCA CACCGCCGCC ACGCTGTGGC TCGCCAGCGG CGAGGCGCCC
GAGTGGATCG CCCGTCAGCT CGGGCACACC ACCACCGAGA TGCTGTTTCG GGTGTATTCG
CGCTACGTGC CCAACCTCAC GCGGCGGGAT GGCTCGGCCT TCGAGCGCCT CATCACGCAG
ACCCTCGGCA CCCAGCTCCT GCCGGTGAAG ACCGCCCCTG CGGAGGAGGC GGAGCAGGAG
GCCGAGCTGC TGGCAGAGAA CGCCGCTCAA GGAGGAGACC ATGAGTGA
 
Protein sequence
MASIRARKDN GMLFIDFRYQ DKRYREQTAL GDTAANRKRL QKVLDRIEAD IAAGTFDYRR 
FFPGSKNAAK FDPAPGGMVG PVSAAAVALP SAAASVASTP LFKDFAETWY GEKEVEWRRS
YKTTLRATLD RALIPRFGEK EVGQISKADV LAYRAELGKA TAKGKQTKLS AARINKMLNP
LRQVLNEAAD RFDFRTPFDS VKQLKTKRTD VDPFTLAEVK QILDTVRPDF RNYFTVRFFT
GLRTGEVDGL QWKYVDFDNR LILVRETIVG GEEEYTKTDG SQRDIQMSQL VFDALQAQFE
ATGKLGKFVF CNRLGTPLDH KNVTNRVWYP LLRHLNLKQR RPYQCRHTAA TLWLASGEAP
EWIARQLGHT TTEMLFRVYS RYVPNLTRRD GSAFERLITQ TLGTQLLPVK TAPAEEAEQE
AELLAENAAQ GGDHE