Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0103 |
Symbol | rpsA |
ID | 5897815 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 112596 |
End bp | 114305 |
Gene Length | 1710 bp |
Protein Length | 569 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641560587 |
Product | 30S ribosomal protein S1 |
Protein accession | YP_001681739 |
Protein GI | 167644076 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0539] Ribosomal protein S1 |
TIGRFAM ID | [TIGR00717] ribosomal protein S1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.924927 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGACG ATATGAGCTT CAACCCGACG CGCGACGATT TCGAAGCGCT GCTGAACGAT TCCATGGGCG GCCGCGACTT CCAGGAAGGC ACCGTCGTCA AGGGCATCGT TGTCGGGATC GAGAAGGACT TCGCGATCAT CGACGTCGGT CTGAAGACCG AAGGTCGCGT TCCGCAGAAG GAATTCGGCG TCGACGAAAC CGGCAAGGCC ACCATCAAGG TCGGCGACTC GGTCGAAGTC TTCCTGGAAC GCCTGGAAAA CGCCCTGGGC GAAGCGGTCA TCAGCCGCGA AAAGGCCAAG CGCGAAGAAG CCTGGACCCG TCTGGAAGGC GTCTACGCCA AGAACGAACC CGTCATGGGC GCGATCGTCG GTCGCGTGAA GGGCGGCTTC ACCGTCGACC TGGGCGGCGC CTCGGCCTTC CTGCCGGGCT CGCAAGTCGA CATCCGCCCG GTGCGCGACG TCGGCCCGCT GATGGGCAAG GAACAGCCCT TCGCCATCCT CAAGATGGAC CGTCCGCGCG GCAACATCGT CGTCTCGCGT CGCGCCATCC TGGAAGAAGC CCGCGCCGAG CAGCGCACCG AACTGGTGTC GCAGCTGCAA GAGGGTGAAA TCCGCGAAGG CGTGGTCAAG AACATCACCG ACTACGGCGC CTTCGTGGAC CTGGGCGGCA TCGACGGCCT GCTGCACGTG ACCGACATGA GCTGGAAGCG CGTCAACCAC CCGAGCCAAG TGCTCGCCGT GGGCGACACC GTGAAGGTCC AGATCGTCAA GATCAACCCG GACACCCAGC GCATCAGCCT CGGCATGAAG CAGCTGCAGT CCGATCCGTG GGATGGCGTG GAAGCGAAGT ACCCGGTCGG CGCCAAGTTC ACGGGCCGCA TCACCAACAT CACCGACTAT GGCGCGTTCG TTGAGCTGGA AGCCGGCGTT GAAGGCCTGG TCCACGTCTC GGAAATGTCC TGGACCAAGA AGAACGTCCA CCCCGGCAAG ATCGTCTCGA CCTCGCAAGA AGTCGACGTG GTCGTGCTGG ATGTCGATCC GTCCAAGCGC CGCGTTTCGC TGGGCCTGAA GCAGGCCCTG GCCAACCCGT GGGATTCGTT CCTCGAAACC CACCCGGTCG GCTCGACGGT CGAAGGCGAA GTCAAGAACG CGACCGAGTT TGGTCTGTTC ATCGGCCTCG ACAACGACAT CGACGGCATG GTGCACCTGT CGGACATCGA CTGGAGCGCG TCGGGCGAAG AAGCCATGTC GCGTTACAAG AAGGGCGACA TGGTCAAGGC GAAGGTTCTC GACGTCGACG TCGAGAAGGA ACGCATCTCG CTGGGCATCA AGCAGCTGGG CGGCGATCCG ATGTCGGGCG ACACCTACCG CAAGAACCAG ACCGTGACCG TCACGGTGAC GGAAGTCACC ACCGGCGGCA TCGAAGTGCG CTTCGGTGAA GACGACGCCC TGATGACCGC CTTCATCCGC AAGTCGGACC TGTCGCGCGA TCGCCAGGAA CAGCGCCCCG AGCGCTTCGC CGTGGGTGAC CGCGTCGACG CTCAGATCAC CAACATCGAC AAGGCGGCTC GCCGCGTTTC GATGTCGATC AAGTCGCTGG AAATGGCCGA AGAGAAGGAA GCCATCGAGC AGTTCGGCTC GTCGGACTCC GGCGCTTCGC TGGGCGACAT TCTCGGCGCG GCCCTGCGCG AGCGGGCCAC CAAGGACTAA
|
Protein sequence | MADDMSFNPT RDDFEALLND SMGGRDFQEG TVVKGIVVGI EKDFAIIDVG LKTEGRVPQK EFGVDETGKA TIKVGDSVEV FLERLENALG EAVISREKAK REEAWTRLEG VYAKNEPVMG AIVGRVKGGF TVDLGGASAF LPGSQVDIRP VRDVGPLMGK EQPFAILKMD RPRGNIVVSR RAILEEARAE QRTELVSQLQ EGEIREGVVK NITDYGAFVD LGGIDGLLHV TDMSWKRVNH PSQVLAVGDT VKVQIVKINP DTQRISLGMK QLQSDPWDGV EAKYPVGAKF TGRITNITDY GAFVELEAGV EGLVHVSEMS WTKKNVHPGK IVSTSQEVDV VVLDVDPSKR RVSLGLKQAL ANPWDSFLET HPVGSTVEGE VKNATEFGLF IGLDNDIDGM VHLSDIDWSA SGEEAMSRYK KGDMVKAKVL DVDVEKERIS LGIKQLGGDP MSGDTYRKNQ TVTVTVTEVT TGGIEVRFGE DDALMTAFIR KSDLSRDRQE QRPERFAVGD RVDAQITNID KAARRVSMSI KSLEMAEEKE AIEQFGSSDS GASLGDILGA ALRERATKD
|
| |