Gene Tmz1t_3035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3035 
SymbolrpsA 
ID7874505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3284192 
End bp3285895 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content64% 
IMG OID643699958 
Product30S ribosomal protein S1 
Protein accessionYP_002890010 
Protein GI237653696 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00687642 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAACG CCACCCCTGC CTTCGAAGAA AGCTTTGCCG CCCTCTTCGA GGAAAGCCTC 
GCCCTCCAGG AAATGCGCGC CGGTGAAGTC ATCACCGCCG AAGTCGTGCG CATCGACCAG
AACTTCGTCG TCGTCAACGC CGGCCTGAAG TCCGAAAGCT ACGTGCCCAT CGAGGAGTTC
CGCAACGACC GCGGCGAACT CGAAGCCAAC ATCGGCGACT TCGTCCACGT CGCCATCGAC
GCCCTCGAAG ACGGCTACGG CGAGACCCGC CTGTCGCGCG ACAAGGCCAA GCGCATCGCC
GCCTGGAACG ACCTCGAGAA GGCGCTCAAC GAAGGCACCC TGGTCAAGGG CGTCATCACC
GGCCGCGTCA AGGGTGGCCT GACCGTCATG ACCAACAGCA TCCGCGCCTT CCTGCCGGGT
TCGCTGGTCG ACATGCGTCC GGTCAAGGAC ACCACGCCGT ACGAAGGCAA GGAATACGAG
TTCAAGGTCA TCAAGCTCGA CCGCAAGCGC AACAACGTCG TGGTCTCGCG TCGCGCCGTG
CTGGAAGAGT CGATGGGCGA AGAGCGCGAA AAGCTGCTCG CCAACCTCAA GGAAGGCACC
GTCATCAAGG GCGTGGTCAA GAACATCACC GACTACGGTG CGTTCGTGGA CCTCGGCGGC
ATCGACGGCC TGCTGCACAT CACCGACCTG GCCTGGCGCC GCGTCCGTCA CCCGTCCGAA
GTGCTGTCGG TCGGCGACGA GATCGAAGCC AAGGTGCTCA AGTTCGACCA GGAAAAGAAC
CGCGTCTCGC TCGGCCTCAA GCAGCTCGGC GAAGACCCGT GGGTCGGCAT CTCGCGCCGC
TACCCGCAGG GCACCCGTCT GTTCGGCAAG GTCACCAACA TCACCGACTA CGGCGCCTTC
GTCGAGGTCG AGCAGGGCAT CGAGGGCCTG GTCCACGTGT CCGAGATGGA CTGGACCAAC
AAGAACATCC ACCCGACCAA GGTCGTGCAG CTGGGCGACG AGGTCGAGGT CATGATCCTC
GAGATCGACG AAGACCGTCG CCGCATCTCG CTGGGCATGA AGCAGTGCAT GTCCAACCCG
TGGGACGATT TTGCCATCAA CCACAAGAAG GGCGACAAGG TCCGTGGCCA GATCAAGTCG
ATCACCGACT TCGGCGTGTT CATCGGCCTG GAAGGCGGCA TCGACGGCCT GGTGCACCTG
TCCGACCTGT CGTGGAGCGA GTCGGGCGAG GATGCCGTGC GCCGCTTCAA GAAGGGCGAC
GAGGTCGAGG CCGTGGTGCT GGCGATCGAC GTCGAGCGCG AGCGCATCTC GCTCGGCATC
AAGCAGCTCG AGGGCGACCC CTACACCAAC TTCATCGCCA CCCACGAGAA GAACAGCCTC
GTGCGCGGCA CCGTGAAGTC GGTCGAGGCC CGTGGCGCGG TGATCTCGCT GGGTGACGAC
GTCGAAGGCT ACCTGCGCGC GTCCGAGGCC GCTCCGCACC GTGTCGACGA CCTCACCACC
ATGTTCAAGG AAGGGGACGA GGTCGAGGCG CTGGTGATCA ACGTGGATCG CAAGACCCGC
TCGATCAGCC TGTCGATTCG TGCCAAGGAC CAGGTCGAGC AGTCCGAAGC CATGAGCAAG
CTGGCTTCCG AGAGCTCCGC TTCCGCCGGT ACCACCAACC TGGGTGCCCT GCTCAAGGCC
AAGCTGAACG AGCAGAAGCA GTAA
 
Protein sequence
MSNATPAFEE SFAALFEESL ALQEMRAGEV ITAEVVRIDQ NFVVVNAGLK SESYVPIEEF 
RNDRGELEAN IGDFVHVAID ALEDGYGETR LSRDKAKRIA AWNDLEKALN EGTLVKGVIT
GRVKGGLTVM TNSIRAFLPG SLVDMRPVKD TTPYEGKEYE FKVIKLDRKR NNVVVSRRAV
LEESMGEERE KLLANLKEGT VIKGVVKNIT DYGAFVDLGG IDGLLHITDL AWRRVRHPSE
VLSVGDEIEA KVLKFDQEKN RVSLGLKQLG EDPWVGISRR YPQGTRLFGK VTNITDYGAF
VEVEQGIEGL VHVSEMDWTN KNIHPTKVVQ LGDEVEVMIL EIDEDRRRIS LGMKQCMSNP
WDDFAINHKK GDKVRGQIKS ITDFGVFIGL EGGIDGLVHL SDLSWSESGE DAVRRFKKGD
EVEAVVLAID VERERISLGI KQLEGDPYTN FIATHEKNSL VRGTVKSVEA RGAVISLGDD
VEGYLRASEA APHRVDDLTT MFKEGDEVEA LVINVDRKTR SISLSIRAKD QVEQSEAMSK
LASESSASAG TTNLGALLKA KLNEQKQ