Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2321 |
Symbol | |
ID | 4709376 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 2546549 |
End bp | 2547838 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639856796 |
Product | sun protein |
Protein accession | YP_001003886 |
Protein GI | 121999099 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0144] tRNA and rRNA cytosine-C5-methylases |
TIGRFAM ID | [TIGR00563] ribosomal RNA small subunit methyltransferase RsmB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0104634 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGTACCC CCCCGCGGGT CGCCGCGGTG CACGTCCTTG AACGGGTGCT GGAGCGCGGC GAGACCCTGG ATGAGGCCCT GGAGGCCCAG TTTTCCCGGG TATCCGAGCG CAATCGGTCG TTGCTTCAGG CCCTGGTCTA CGGCGCGCTG CGCTGGCTGA CCCGGCTCGA GGCCCAGGTG GCGACCCTGA CCCCGCGCGA CGACTGGCGC GCCGATCCGC TGCTACGTGG TCTGCTGGTG ATCGGGGCCT GGGAGGCCCA GGGGCTTGCC ACCCCAGCCC ATGCGGCGGT CTCCGAGGCC GTCGATGCGG CCCGACGGCT GCGTCGGGCG CGCGCGGCCG GCATGGTGAA CGCGGTGCTG CGCAAGCTGC ACAAGGCCAC CCCACCCGGG CCGGCAGACG AAGCCGCCCG CTACGCCCTG CCGCCGTGGC TGCTGGACCA TCTACGCCGC GCCTGGCCGG AGGATTGGCC GGCGGTGGCC GAGGCCGGGA ACGCTCACCC GCCAATGACC CTGCGCTTCG ACCGGAACCG CATCTCCCGC GAGGCGTGCC TGCAATCGCT GGCCGAGCAG GAGATCCCGG CGCACCCCGG CGAGGTCGCG CCGAGCGCCG CGACCCTCCA TGCCCCGGTA CCCGTGGCCC GCCTGCCCGG CTTCGCCGAG GGGTGGTTGT CGGTGCAGGA TGAGGCCGCG CAGCTCGCCG CCCCGCTTCT CGACCCGCAA CCCGGCGACC GGGTGCTCGA TGCCTGTGCC GCACCGGGTG GTAAGACCCT GCACCTGCTC GAGCACACGC CGACGGCGGC GGTGACCGCC CTCGACCGTT CGTCACGCCG GCTGCGCCAG GTACGCGATA ACCTCGCGCG TGGCGGCTAC GAGGCCCAAT GCCTGGCCGC CGATGCCGCC GACCCCGAAG CGTGGTGGGA CGGTGAACCG TTTCAGCGCA TCCTCCTCGA CGCCCCGTGC ACGGGCAGCG GCGTGATCCG CCGACACCCC GATATCAAGT GGCTGCGCGG GGTCGACGAC CCGGCGCGCA TGGCGGCGGC GCAGCGCCAC CTGCTCGCCG CCCTGTGGCG CGTGCTGGCG CCGGGCGGGC GGTTGCTCTA CGCCACCTGC TCGATCTTCC CCGAGGAGAA CGAGCAGGTG GTGGCCGGTT TCCTGGCCGA GCACGCCGAT GCCAAACCGG GGCCGTTGGC GCCGGTGGGC CGCTGCACCG GGTCCGGGTG CCAGATCCTG CCCGGCGAGC ACGGGATGGA TGGGTTCTTC TACGCCTGCC TCGAGCGGAG TGCCGCATGA
|
Protein sequence | MSTPPRVAAV HVLERVLERG ETLDEALEAQ FSRVSERNRS LLQALVYGAL RWLTRLEAQV ATLTPRDDWR ADPLLRGLLV IGAWEAQGLA TPAHAAVSEA VDAARRLRRA RAAGMVNAVL RKLHKATPPG PADEAARYAL PPWLLDHLRR AWPEDWPAVA EAGNAHPPMT LRFDRNRISR EACLQSLAEQ EIPAHPGEVA PSAATLHAPV PVARLPGFAE GWLSVQDEAA QLAAPLLDPQ PGDRVLDACA APGGKTLHLL EHTPTAAVTA LDRSSRRLRQ VRDNLARGGY EAQCLAADAA DPEAWWDGEP FQRILLDAPC TGSGVIRRHP DIKWLRGVDD PARMAAAQRH LLAALWRVLA PGGRLLYATC SIFPEENEQV VAGFLAEHAD AKPGPLAPVG RCTGSGCQIL PGEHGMDGFF YACLERSAA
|
| |