Gene Caul_0103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0103 
SymbolrpsA 
ID5897815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp112596 
End bp114305 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content64% 
IMG OID641560587 
Product30S ribosomal protein S1 
Protein accessionYP_001681739 
Protein GI167644076 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.924927 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGACG ATATGAGCTT CAACCCGACG CGCGACGATT TCGAAGCGCT GCTGAACGAT 
TCCATGGGCG GCCGCGACTT CCAGGAAGGC ACCGTCGTCA AGGGCATCGT TGTCGGGATC
GAGAAGGACT TCGCGATCAT CGACGTCGGT CTGAAGACCG AAGGTCGCGT TCCGCAGAAG
GAATTCGGCG TCGACGAAAC CGGCAAGGCC ACCATCAAGG TCGGCGACTC GGTCGAAGTC
TTCCTGGAAC GCCTGGAAAA CGCCCTGGGC GAAGCGGTCA TCAGCCGCGA AAAGGCCAAG
CGCGAAGAAG CCTGGACCCG TCTGGAAGGC GTCTACGCCA AGAACGAACC CGTCATGGGC
GCGATCGTCG GTCGCGTGAA GGGCGGCTTC ACCGTCGACC TGGGCGGCGC CTCGGCCTTC
CTGCCGGGCT CGCAAGTCGA CATCCGCCCG GTGCGCGACG TCGGCCCGCT GATGGGCAAG
GAACAGCCCT TCGCCATCCT CAAGATGGAC CGTCCGCGCG GCAACATCGT CGTCTCGCGT
CGCGCCATCC TGGAAGAAGC CCGCGCCGAG CAGCGCACCG AACTGGTGTC GCAGCTGCAA
GAGGGTGAAA TCCGCGAAGG CGTGGTCAAG AACATCACCG ACTACGGCGC CTTCGTGGAC
CTGGGCGGCA TCGACGGCCT GCTGCACGTG ACCGACATGA GCTGGAAGCG CGTCAACCAC
CCGAGCCAAG TGCTCGCCGT GGGCGACACC GTGAAGGTCC AGATCGTCAA GATCAACCCG
GACACCCAGC GCATCAGCCT CGGCATGAAG CAGCTGCAGT CCGATCCGTG GGATGGCGTG
GAAGCGAAGT ACCCGGTCGG CGCCAAGTTC ACGGGCCGCA TCACCAACAT CACCGACTAT
GGCGCGTTCG TTGAGCTGGA AGCCGGCGTT GAAGGCCTGG TCCACGTCTC GGAAATGTCC
TGGACCAAGA AGAACGTCCA CCCCGGCAAG ATCGTCTCGA CCTCGCAAGA AGTCGACGTG
GTCGTGCTGG ATGTCGATCC GTCCAAGCGC CGCGTTTCGC TGGGCCTGAA GCAGGCCCTG
GCCAACCCGT GGGATTCGTT CCTCGAAACC CACCCGGTCG GCTCGACGGT CGAAGGCGAA
GTCAAGAACG CGACCGAGTT TGGTCTGTTC ATCGGCCTCG ACAACGACAT CGACGGCATG
GTGCACCTGT CGGACATCGA CTGGAGCGCG TCGGGCGAAG AAGCCATGTC GCGTTACAAG
AAGGGCGACA TGGTCAAGGC GAAGGTTCTC GACGTCGACG TCGAGAAGGA ACGCATCTCG
CTGGGCATCA AGCAGCTGGG CGGCGATCCG ATGTCGGGCG ACACCTACCG CAAGAACCAG
ACCGTGACCG TCACGGTGAC GGAAGTCACC ACCGGCGGCA TCGAAGTGCG CTTCGGTGAA
GACGACGCCC TGATGACCGC CTTCATCCGC AAGTCGGACC TGTCGCGCGA TCGCCAGGAA
CAGCGCCCCG AGCGCTTCGC CGTGGGTGAC CGCGTCGACG CTCAGATCAC CAACATCGAC
AAGGCGGCTC GCCGCGTTTC GATGTCGATC AAGTCGCTGG AAATGGCCGA AGAGAAGGAA
GCCATCGAGC AGTTCGGCTC GTCGGACTCC GGCGCTTCGC TGGGCGACAT TCTCGGCGCG
GCCCTGCGCG AGCGGGCCAC CAAGGACTAA
 
Protein sequence
MADDMSFNPT RDDFEALLND SMGGRDFQEG TVVKGIVVGI EKDFAIIDVG LKTEGRVPQK 
EFGVDETGKA TIKVGDSVEV FLERLENALG EAVISREKAK REEAWTRLEG VYAKNEPVMG
AIVGRVKGGF TVDLGGASAF LPGSQVDIRP VRDVGPLMGK EQPFAILKMD RPRGNIVVSR
RAILEEARAE QRTELVSQLQ EGEIREGVVK NITDYGAFVD LGGIDGLLHV TDMSWKRVNH
PSQVLAVGDT VKVQIVKINP DTQRISLGMK QLQSDPWDGV EAKYPVGAKF TGRITNITDY
GAFVELEAGV EGLVHVSEMS WTKKNVHPGK IVSTSQEVDV VVLDVDPSKR RVSLGLKQAL
ANPWDSFLET HPVGSTVEGE VKNATEFGLF IGLDNDIDGM VHLSDIDWSA SGEEAMSRYK
KGDMVKAKVL DVDVEKERIS LGIKQLGGDP MSGDTYRKNQ TVTVTVTEVT TGGIEVRFGE
DDALMTAFIR KSDLSRDRQE QRPERFAVGD RVDAQITNID KAARRVSMSI KSLEMAEEKE
AIEQFGSSDS GASLGDILGA ALRERATKD