Gene Tmz1t_4086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_4086 
Symbol 
ID7873313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4489059 
End bp4490366 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content70% 
IMG OID643701017 
Productsun protein 
Protein accessionYP_002891040 
Protein GI237654726 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID[TIGR00563] ribosomal RNA small subunit methyltransferase RsmB 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGTGGTG CCCCACGCCG TGCTCCGGAA CAGCCCATCG ACAGCCTCGG CTACGCGCTC 
GCACGGGCAA CCGAACTGGT CGCTGGCGTG ATCGACGGCG CCAACCTGAC CGATGTCTTC
GAGCGCATGC AGGCCGGGCA TCCGGAGTGG CCGGAAGGCA CGCGCGGGGC CGTGCGCGAC
CTCGCCTGGT CGACCTTGCG CGAGTTCGGG CGCGGTGATG CGATCCTTTC CCGCCTGCTG
CATAGTCCTC CTCCGGTGGA GATTCGGGCG CTTCTGCTGG TTGCCCTGCA GCGCCTGACG
CAGCGCCCCG AGCAAGCCCA CACGGTGGTC GATCAGGCTG TGGGCGCCAC GGCCGTGGCG
ATGCCCGGCC TGCGCAACCT GGTCAACGGT GTGCTGCGCA ATGCGCTGCG TCGGCAGCCG
GAGTGGCAAG GCTGGATCGA GGCCGAGCCC GAGGCGCGTC ATGCCTTTCC GGCCTGGTGG
GTGGAGCGTG TGCGCAGCGC GCATCCTCAG GCCTGGCAGG ATCTGCTCGC GGCCGGCAAT
ACGCGTCCCC CCATGGCCTT GCGGGTCAAT CCGCGCCGTG CCACGTTGGC CGAGGTCGAG
GCGGAGCTTG CCGCAGCCGG GCTGGAGTTC CGGCGACTCG ACAACGACGC GCTGGTGCTC
GCGCGTCCGC TGGCGGTCGC ACGCCTGCCC GGGTACGCCG AAGGGCGCTT GTCGGTGCAG
GACGCGGGGG CACAGTGGGC GGCGCAACTG CTCGATGTTC GAGCCGGCGA GCGTGTGCTC
GACGCCTGCG CAGCCCCCGG CGGCAAGACT GCACACATCC TCGAGAGGGC CGATGCGGAC
CTGCTCGCGC TGGAGCTTGA TCCGCTGCGG GCGGGTCGGG TGGCGCGCAA CCTCGACCGC
CTCGGCCTGC GCGCGGAGCT GAAGGTCGCC GACTGCCGCC GCCTGGCAGC GTGGTGGGAT
GGTCGTCCCT TCGACCGCAT CCTGGCCGAT GTGCCGTGCT CGGCATCCGG CGTGGTGCGC
CGACATCCGG ACATCAAGTG GTTGCGCCGG GACAGCGATA TCGCCAACTT TGCCGCACAG
CAGGCGGAAA TCCTGGAGGC ACTTTGGCGC ACGCTCGCCC CGGGTGGCAC AATGCTCTAC
GTCACCTGCT CGGTGTTCGA CGAGGAAAAC GCCGGCCAGG TCGCCCGCTT CTGCGTCCGC
CATGCCGACG CGGAGCGACT CCCGATTCGC GGATCTTCCG ACCTGCAGCT GCTGCCTTGT
GCCGACCATG ACGGCTTCTA TTACGCGCTC CTCCGCAAGC GGCCCTGA
 
Protein sequence
MRGAPRRAPE QPIDSLGYAL ARATELVAGV IDGANLTDVF ERMQAGHPEW PEGTRGAVRD 
LAWSTLREFG RGDAILSRLL HSPPPVEIRA LLLVALQRLT QRPEQAHTVV DQAVGATAVA
MPGLRNLVNG VLRNALRRQP EWQGWIEAEP EARHAFPAWW VERVRSAHPQ AWQDLLAAGN
TRPPMALRVN PRRATLAEVE AELAAAGLEF RRLDNDALVL ARPLAVARLP GYAEGRLSVQ
DAGAQWAAQL LDVRAGERVL DACAAPGGKT AHILERADAD LLALELDPLR AGRVARNLDR
LGLRAELKVA DCRRLAAWWD GRPFDRILAD VPCSASGVVR RHPDIKWLRR DSDIANFAAQ
QAEILEALWR TLAPGGTMLY VTCSVFDEEN AGQVARFCVR HADAERLPIR GSSDLQLLPC
ADHDGFYYAL LRKRP