Gene Acid345_1795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1795 
Symbol 
ID4071985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2175569 
End bp2177188 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content59% 
IMG OID637983803 
ProductSSU ribosomal protein S1P 
Protein accessionYP_590870 
Protein GI94968822 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATT CCGATAACTT CAGCGACATC CTCAAGCAAT ACGACCACGC GCGTAACAGC 
AAAGGCCAAA TCGAAGGCAC CGTCGTCTCG GTCAACGATG AATTTGTTTT CGTGGACATC
GGTTACAAGA CCGAAGGCAC CCTGCCAGTC TCGGTTTTCA CCAAGCCCGT AAATCCCGGC
GACAAGCTTC TGGTTTCCAT CGCCGGACGC GATCCGGAGG GCGGCTATTA TCTGCTCTCG
CGCACCCGGG TGCAGATTCC CACCGACTGG TCCGCGCTTG AGAAGGCCTT CGCTGACGAA
GCCACCATCA TGGGCACCGT AACCGGCGTT ATAAAAGGCG GCGTGACCGT TGACGTGGGC
GTACGTGCAT TCATGCCCGC CTCGCGTACC GGCACCCGCG ATGCCGCCGA GATGGAAAAG
CTCGTCGGCA GTGAAATTCG TTGCCGCATC ACCAAGATCG ATGTAGCCGA TGAAGACGTT
GTTGTTGACC GTCGCGCCGT TCTTGAAGAA GAAACCCGCG CGCAGGAAGG CCGACGCTAC
GAAGAGTTGC AGGAAGGCGC GACTGTTCAC GGCACCGTCC GAAGCCTCGC CGATTACGGC
GCGTTCGTAG ACATTGGCGG CGTGGATGCC CTTCTGCACG TGGCTGAAAT CTCGTGGTCG
CGCGTCAACA GCCCGGCTGA TGTCCTGACC GTTGGCCAGG AAGTCGAAGC CAAGGTCATA
AAAGTCGACC CCGAAAAGCG GCGCATTTCA CTGAGCATGA AACAGCTTCA GCCGCATCCG
TGGGACTCAG TGCCGTCGAA ATACAAAGTC GGCGACCGCG TGCGCGGAAC GGTCTCTCGC
CTGATGGATT TCGGCGCATT CGTTGAGCTT GAGCCCGGTA TCGAGGGAAT GATTCACGTC
TCCGAAATGT CATGGGCGAA AAAGGTCCGC AAGCCCAGCG ATCTCCTGAA AACTGGCGAC
AGCGTCGAAG CTGTCATCCT CGGCATCAAT CCGGCAGAAA AGCGCATTGC TCTCGGACTG
AAGCAGGCGC TCGGCGATCC CTGGAAAGAC GCGTCACAGA AGTTCGCCGC CGGAACCGTA
ATTGAAGGCC CAGTCACCAG CGTTCAAAAG TTCGGCGCAT TCGTGCAGTT GACCGAAGGC
GTGGAAGGCA TGGTGCACGT CAGTGAACTC AGCGACAAGC GCGTAGATCA TCCGCAAGAT
GTCGTGAAGC TCGGCCAGCG CGTGCAGGCG ATGGTCCTAG CGATCGATCC CGAGAAGCGC
CAGATCAAGC TGAGCATGAA GCAGCTCATC CCCACCGGCC TCGACGAATA CATCGCCGAG
CACAAACTTG GCGACATCGT GAGCGGACGT GTGCTCGAGG TCAATGGCGA GCGCGGACGC
GCGGAACTCG GCCAAGGCAT CCAGGCCGAA GCCAAGTTCA CGCAAAAGGC AGCTCAACCC
GCAGCGGCGG CCACCGCGAA AGCCGATCTC TCTTCCCTTA CCTCCATGCT TCAAAACAAG
TGGAAGAGCG GTGCGTCCGC GAGTTCGAAG TCTGAAGATC TCCGTGCCGG CCAAATCCGC
AGCTTCAAAA TTACGCGCCT CGACGCCGAC GCAAAGAAGA TCGAAGTCGA ACTCGAGTAA
 
Protein sequence
MSDSDNFSDI LKQYDHARNS KGQIEGTVVS VNDEFVFVDI GYKTEGTLPV SVFTKPVNPG 
DKLLVSIAGR DPEGGYYLLS RTRVQIPTDW SALEKAFADE ATIMGTVTGV IKGGVTVDVG
VRAFMPASRT GTRDAAEMEK LVGSEIRCRI TKIDVADEDV VVDRRAVLEE ETRAQEGRRY
EELQEGATVH GTVRSLADYG AFVDIGGVDA LLHVAEISWS RVNSPADVLT VGQEVEAKVI
KVDPEKRRIS LSMKQLQPHP WDSVPSKYKV GDRVRGTVSR LMDFGAFVEL EPGIEGMIHV
SEMSWAKKVR KPSDLLKTGD SVEAVILGIN PAEKRIALGL KQALGDPWKD ASQKFAAGTV
IEGPVTSVQK FGAFVQLTEG VEGMVHVSEL SDKRVDHPQD VVKLGQRVQA MVLAIDPEKR
QIKLSMKQLI PTGLDEYIAE HKLGDIVSGR VLEVNGERGR AELGQGIQAE AKFTQKAAQP
AAAATAKADL SSLTSMLQNK WKSGASASSK SEDLRAGQIR SFKITRLDAD AKKIEVELE