Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3035 |
Symbol | rpsA |
ID | 7874505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3284192 |
End bp | 3285895 |
Gene Length | 1704 bp |
Protein Length | 567 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643699958 |
Product | 30S ribosomal protein S1 |
Protein accession | YP_002890010 |
Protein GI | 237653696 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0539] Ribosomal protein S1 |
TIGRFAM ID | [TIGR00717] ribosomal protein S1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00687642 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAACG CCACCCCTGC CTTCGAAGAA AGCTTTGCCG CCCTCTTCGA GGAAAGCCTC GCCCTCCAGG AAATGCGCGC CGGTGAAGTC ATCACCGCCG AAGTCGTGCG CATCGACCAG AACTTCGTCG TCGTCAACGC CGGCCTGAAG TCCGAAAGCT ACGTGCCCAT CGAGGAGTTC CGCAACGACC GCGGCGAACT CGAAGCCAAC ATCGGCGACT TCGTCCACGT CGCCATCGAC GCCCTCGAAG ACGGCTACGG CGAGACCCGC CTGTCGCGCG ACAAGGCCAA GCGCATCGCC GCCTGGAACG ACCTCGAGAA GGCGCTCAAC GAAGGCACCC TGGTCAAGGG CGTCATCACC GGCCGCGTCA AGGGTGGCCT GACCGTCATG ACCAACAGCA TCCGCGCCTT CCTGCCGGGT TCGCTGGTCG ACATGCGTCC GGTCAAGGAC ACCACGCCGT ACGAAGGCAA GGAATACGAG TTCAAGGTCA TCAAGCTCGA CCGCAAGCGC AACAACGTCG TGGTCTCGCG TCGCGCCGTG CTGGAAGAGT CGATGGGCGA AGAGCGCGAA AAGCTGCTCG CCAACCTCAA GGAAGGCACC GTCATCAAGG GCGTGGTCAA GAACATCACC GACTACGGTG CGTTCGTGGA CCTCGGCGGC ATCGACGGCC TGCTGCACAT CACCGACCTG GCCTGGCGCC GCGTCCGTCA CCCGTCCGAA GTGCTGTCGG TCGGCGACGA GATCGAAGCC AAGGTGCTCA AGTTCGACCA GGAAAAGAAC CGCGTCTCGC TCGGCCTCAA GCAGCTCGGC GAAGACCCGT GGGTCGGCAT CTCGCGCCGC TACCCGCAGG GCACCCGTCT GTTCGGCAAG GTCACCAACA TCACCGACTA CGGCGCCTTC GTCGAGGTCG AGCAGGGCAT CGAGGGCCTG GTCCACGTGT CCGAGATGGA CTGGACCAAC AAGAACATCC ACCCGACCAA GGTCGTGCAG CTGGGCGACG AGGTCGAGGT CATGATCCTC GAGATCGACG AAGACCGTCG CCGCATCTCG CTGGGCATGA AGCAGTGCAT GTCCAACCCG TGGGACGATT TTGCCATCAA CCACAAGAAG GGCGACAAGG TCCGTGGCCA GATCAAGTCG ATCACCGACT TCGGCGTGTT CATCGGCCTG GAAGGCGGCA TCGACGGCCT GGTGCACCTG TCCGACCTGT CGTGGAGCGA GTCGGGCGAG GATGCCGTGC GCCGCTTCAA GAAGGGCGAC GAGGTCGAGG CCGTGGTGCT GGCGATCGAC GTCGAGCGCG AGCGCATCTC GCTCGGCATC AAGCAGCTCG AGGGCGACCC CTACACCAAC TTCATCGCCA CCCACGAGAA GAACAGCCTC GTGCGCGGCA CCGTGAAGTC GGTCGAGGCC CGTGGCGCGG TGATCTCGCT GGGTGACGAC GTCGAAGGCT ACCTGCGCGC GTCCGAGGCC GCTCCGCACC GTGTCGACGA CCTCACCACC ATGTTCAAGG AAGGGGACGA GGTCGAGGCG CTGGTGATCA ACGTGGATCG CAAGACCCGC TCGATCAGCC TGTCGATTCG TGCCAAGGAC CAGGTCGAGC AGTCCGAAGC CATGAGCAAG CTGGCTTCCG AGAGCTCCGC TTCCGCCGGT ACCACCAACC TGGGTGCCCT GCTCAAGGCC AAGCTGAACG AGCAGAAGCA GTAA
|
Protein sequence | MSNATPAFEE SFAALFEESL ALQEMRAGEV ITAEVVRIDQ NFVVVNAGLK SESYVPIEEF RNDRGELEAN IGDFVHVAID ALEDGYGETR LSRDKAKRIA AWNDLEKALN EGTLVKGVIT GRVKGGLTVM TNSIRAFLPG SLVDMRPVKD TTPYEGKEYE FKVIKLDRKR NNVVVSRRAV LEESMGEERE KLLANLKEGT VIKGVVKNIT DYGAFVDLGG IDGLLHITDL AWRRVRHPSE VLSVGDEIEA KVLKFDQEKN RVSLGLKQLG EDPWVGISRR YPQGTRLFGK VTNITDYGAF VEVEQGIEGL VHVSEMDWTN KNIHPTKVVQ LGDEVEVMIL EIDEDRRRIS LGMKQCMSNP WDDFAINHKK GDKVRGQIKS ITDFGVFIGL EGGIDGLVHL SDLSWSESGE DAVRRFKKGD EVEAVVLAID VERERISLGI KQLEGDPYTN FIATHEKNSL VRGTVKSVEA RGAVISLGDD VEGYLRASEA APHRVDDLTT MFKEGDEVEA LVINVDRKTR SISLSIRAKD QVEQSEAMSK LASESSASAG TTNLGALLKA KLNEQKQ
|
| |