Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4229 |
Symbol | |
ID | 4073155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 5011960 |
End bp | 5013258 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637986260 |
Product | sun protein |
Protein accession | YP_593303 |
Protein GI | 94971255 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0144] tRNA and rRNA cytosine-C5-methylases |
TIGRFAM ID | [TIGR00563] ribosomal RNA small subunit methyltransferase RsmB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.494405 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCAAGA ATCCCTCCCG CGCCACCGCC TTCGATATCC TTCTCCGCGT CGAGCGCGAC CAGGCCTTTG CCTCGGAACT GCTCCACTCC GATCGTCTCA ACGATCTCTC CGCACCAGAT CGCGGCCTTG CCACCGAACT CGTCATGGGC ACGTTGCGCT GGCAATCCAC ACTCGACGCA CTCGTCGCCA CGCAATCCTC ACAGCCACTC CGCAAACTCG ACATCGAGGT CCTGATCGCA CTTCGTCTCG CGGCCTACCA ACTACAGTTT CTCGACCGCA TCCCCGCCAA CGCCGCCGTA AACGAGAGCG TGGAATTGGT GAAGCGCGCG CGCAAACGCT CCGCCGTGCC GTTCGCCAAC GCAGTTCTGC GCAAAATCTC CAAGCTTCCA CGCGAAATCC ATGGTGATCT CGCCCATCCT GCGTGGCTAG TGGCGCGCTG GCGCGACAAT TACGGCGGGG ATGCCGCAGA ATCCATCTCC AAATACGGTC AAACTACGCC GGAAACCGCG CTACGGCTCC CATTCGACGC CGAAAAACGC GCCAAAGTTG AGGCAGAACT CCAGGAAAAC GGCGTAGAAC TCGCTCCCGG AAGGCTCCTG AACGCCGCAC GGCGCCTCGT CAGCGGCGAC CTCAGCGGCA CCGCGGCCTT CCAACGCGGC GACGTCTGGA TCCAGGATGA AGCCTCCCAA CTCGTCGCCC TTCTCACGGG CCACGGCGAT CGCATTCTCG ACTGCTGCGC CGCTCCCGGC GGAAAAACTT CCGTCTTGGC CGAGCGCAAT CCTTCGTCGA AAATTGTCGC CCTAGAACTC CACGAACAAC GTGCACGCCT ACTTCGAGAA CGCGTTCGCG CGTCAAACGT GGATGTGCAA ACCGCCGATG CCACGAATTT CCGCGCCGAA ACCGCGTTTG ACTGCGTCCT AGCCGACGTC CCTTGCTCCG GTACCGGCAC CCTCGCTCGC AATCCCGAAA TCAAGTGGCG CCTAAAGCCC GAGGACCTCG CCGATCTCCA GCAACGCCAG ATCGCCATCC TCCGCGCCGC CCTAAGCCAA CTCGCGCCCG GCGGCCGCCT CGTCTACTCG ACATGCTCTC TCGAACCAGA AGAAGGTGAA GCCGTAGTCG AAGCCTCGCT GACCGACGAG TTCGAACTCC AACCCGCAGC GCCCGAACTT GAGCAATTCG CACCCGCATT CGCCATCCCC GACCCGCAGA CTCTCGTCCG CGGCCCCTAC CTTCGCACCA TCCCTGGCAT TCACCCCTGC GAAGGATTCT TCGCCGCCGT AATCACACGT CGCCAGTGA
|
Protein sequence | MSKNPSRATA FDILLRVERD QAFASELLHS DRLNDLSAPD RGLATELVMG TLRWQSTLDA LVATQSSQPL RKLDIEVLIA LRLAAYQLQF LDRIPANAAV NESVELVKRA RKRSAVPFAN AVLRKISKLP REIHGDLAHP AWLVARWRDN YGGDAAESIS KYGQTTPETA LRLPFDAEKR AKVEAELQEN GVELAPGRLL NAARRLVSGD LSGTAAFQRG DVWIQDEASQ LVALLTGHGD RILDCCAAPG GKTSVLAERN PSSKIVALEL HEQRARLLRE RVRASNVDVQ TADATNFRAE TAFDCVLADV PCSGTGTLAR NPEIKWRLKP EDLADLQQRQ IAILRAALSQ LAPGGRLVYS TCSLEPEEGE AVVEASLTDE FELQPAAPEL EQFAPAFAIP DPQTLVRGPY LRTIPGIHPC EGFFAAVITR RQ
|
| |