Gene Acid345_4661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4661 
Symbol 
ID4070706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5518376 
End bp5520205 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content60% 
IMG OID637986701 
ProductSSU ribosomal protein S1P 
Protein accessionYP_593735 
Protein GI94971687 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0143658 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTTCG ACGATCCAAA CGTTACCTCG TCCACCGAAC AAAGCGAAGA ACACGGTGCT 
GCAGCGTCGC AGCAGCCGGT TGCTGTGCAG GCCCATAATC CACCCGAGGC AAAGCCCACT
GCGGGCCGTC CTCGCAACGA AGAGAACATG ACGGAAGATT TCGCAACCGC ACTCGAATCC
TTTGAACAAG AACAGTCTGA GCAGGCATTG AATGAGGACC GCGTCCTCGT TGGCCGAGTG
CTCAGCATAA CCCCCCAGTA CGTCGTCGTA GACGTGGGCT TGAAGTCCGA GGGCGTAGTG
CCCATCGAGG AAGTCAAGGA CCACGACGGC AACGTTTCCT TCCAGCCGGG CGAAGAAATC
GCCGTGATGC AGGAAAAGGG ACACACCGAA GAGGGGTACG TGCACCTCTC CCACCAGAAG
GCACAGCGCC TGAAGGCATG GGACGAGATC GAGAAAGCGT ACAACGATAA ATCTTCCATC
AAGGCGCGGG CGATTGACCG CATCAAGGGT GGCCTCACCG TCGACATCAT GGGAGCGCGC
GCGTTCCTGC CAGGTTCCCA GGTGGACCTG CGGCCGGTGC GCAATCTTGA CGCACTGAAG
GGCCATGAGC TCGAAGTCCG GATCATCAAG CTGAACAAGA AGCGCGGCAA CATCGTAGTT
TCGCGCAAGC AGATCCTGGA AGAAGAGCAG AACGACAAGA AGTCGAAGAC GCTCGAGCAC
CTCAACGAAG ACGCGGTTCT CACCGGCACG GTGAAGAACC TGACCGACTA CGGTGCGTTC
GTTGACCTCG GCGGCATCGA TGGCCTGCTG CACATCACCG ACATGTCGTG GGGACGCCTG
ACTCATCCGC GCGACCTCGT TCAAGTCGGC GACCAGATCC AGGTAAAGGT GCTGAAGTTC
GACCGAGATA AGCAGCGTGT CTCGCTGGGC TTCAAGCAGC TCACGCCTGA CCCGTGGCTC
GACGCATCCG AACGGTACCC GATTGGCGCG CGCGTACACG GCCGCGTGAT CAGCGTGACC
GACTACGGTG CGTTCATCGA ACTCGAACAG GGGATTGAAG GTCTCGTGCA CGTGAGCGAG
ATGACCTGGT CGAAGCGGAT GAAGCATCCG TCGAAAATCG TCAACGTTGG CGATCAAGTC
GACGCAGTGG TGCTGAACGT GAATCCGCAG GAACGTCGCA TCAGCCTCGG CCTGAAGCAG
CTCGAAACTA ACCCGTGGGA GTCGCTGCAT GAGAAGTTCC CGGTGGGCGG CGTGGTTGAG
GGCAAGGTCC GCAACCTGAC CGACTTCGGC GCGTTCATCG AGATTGAAGA CGGCATCGAC
GGCCTCGTCC ACGTCAGCAA CCTGAGCTGG ACGAAGCGCG TGAAGCATCC TTCGGAAGTG
CTGAAGAAGG GCGATAAGGT CAAGGCTGTG GTGCTCGCAA TCGAGCCCGA CAACCGCCGC
CTCTCGCTCG GCGTGAAGCA GTTACAGCCC GATGTCTGGG AGACGTTCTT CGAAACGCAT
CGCGTTGGCG ACATCATCCA CGGCAAGGTG CTGCGCCTCG CGAGCTTCGG TGCATTCATC
GAGATCGCGG ACGGAGTGGA GGGCCTGTGC CACAACTCCG AAGCGAGCGA TGAGCACGGC
GCTCCGCTCA AGCTGGAACC CGGACAAGAG TTCGACTTCA AGATCATCAA GATGAATCCT
GATGAAAAGA AGGTCGGCCT CAGCCTCCGC GCAGTCGGCG AAGAAGCCAG CCGCGTAGAG
ATCGAGAACT ACAAGGCTCC GGCCTCGAGT TCTCCGGGCG CTGCGACCAT CGGCGAACTG
CTAAGCTGGA AGCGAGAGCA GCAAGACTAA
 
Protein sequence
MPFDDPNVTS STEQSEEHGA AASQQPVAVQ AHNPPEAKPT AGRPRNEENM TEDFATALES 
FEQEQSEQAL NEDRVLVGRV LSITPQYVVV DVGLKSEGVV PIEEVKDHDG NVSFQPGEEI
AVMQEKGHTE EGYVHLSHQK AQRLKAWDEI EKAYNDKSSI KARAIDRIKG GLTVDIMGAR
AFLPGSQVDL RPVRNLDALK GHELEVRIIK LNKKRGNIVV SRKQILEEEQ NDKKSKTLEH
LNEDAVLTGT VKNLTDYGAF VDLGGIDGLL HITDMSWGRL THPRDLVQVG DQIQVKVLKF
DRDKQRVSLG FKQLTPDPWL DASERYPIGA RVHGRVISVT DYGAFIELEQ GIEGLVHVSE
MTWSKRMKHP SKIVNVGDQV DAVVLNVNPQ ERRISLGLKQ LETNPWESLH EKFPVGGVVE
GKVRNLTDFG AFIEIEDGID GLVHVSNLSW TKRVKHPSEV LKKGDKVKAV VLAIEPDNRR
LSLGVKQLQP DVWETFFETH RVGDIIHGKV LRLASFGAFI EIADGVEGLC HNSEASDEHG
APLKLEPGQE FDFKIIKMNP DEKKVGLSLR AVGEEASRVE IENYKAPASS SPGAATIGEL
LSWKREQQD