Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Clim_1826 |
Symbol | |
ID | 6355166 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium limicola DSM 245 |
Kingdom | Bacteria |
Replicon accession | NC_010803 |
Strand | + |
Start bp | 2001981 |
End bp | 2003336 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642669429 |
Product | sun protein |
Protein accession | YP_001943844 |
Protein GI | 189347315 |
COG category | [J] Translation, ribosomal structure and biogenesis [K] Transcription |
COG ID | [COG0144] tRNA and rRNA cytosine-C5-methylases [COG0781] Transcription termination factor |
TIGRFAM ID | [TIGR00563] ribosomal RNA small subunit methyltransferase RsmB [TIGR01951] transcription antitermination factor NusB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAAAA AAGAATCCAT GACATCCAGA GAACTTGCCC TGAAAGTCCT CCAGAACATT GAAAGCGGAG AAAAAAAATC GGGAACTGCC CTGCACGAAC AGCTTTCGCA CACATCGCTC AACCAGAATG ACCGTGCACT CGCAACGGAA CTCGTCAACG GCGTCCTGCG CCGCCGCCTG ACTCTCGACA CCTTCATCGG CAGATACTAC CATCACCGCT ATGAAAAAGC CGCTCCTGTA CTGAAAAACA TTCTTCGGAT CGGAGCCTAC CAGCTGATCT ATCTCGACCG TATTCCCCAC TGGGCAGCAG TCAACGAATC AGTCTCACTG GCACGCAGAT ACAAAGGAGA GCACATGGCA AAACTGGTCA ATGCCGTCCT CAGAAACATC ACTCCCGAAA CCATATCGGC TGACCTCGAC CCTTCAGGAA AAATGAAGGA AATCCAGCGT CTTTCAATCA CCGCCTCTCA TCCCGAATGG CTGCTGGAAC GATGGATCGC AAGATATGGA AAAGAACGCG CCGAAGCCAT CCTCGCATAC AACAACCGCA CGCCGCTCAC CGGCTTCAGA ATCAATCCCC TGAAAACGAC CCCTCAGGAA TTTCTCGCGA CCGAAAACTC GAAAACCGCC CGGATCGGAA AAAGCGGACT TGAGAACTTC TTCCTCTCAA CGGAGTTCTC CTGTTATGAA TCCGCCATCA AACAGGGCCT GCTCACCGTA CAGAACCCGA CGCAGGGCCT TGCCTGCCTG CTCTGCGATC CGCAGCCCGG AAGCACCCTC CTCGACCTCT GCGCTGCACC CGGAGGCAAA TCCACATTCT GCGGCGAACT CATGCAAAAC ACGGGAAACA TAACCGCCGT TGACCTTTAC AGCCAGAAAC TGCAGAAACT ATCCGAACAC GCAGGCGAAC TCGGCATCAC CATCATCTCG ACCGCCGAAG CCGATGCCCG TACATATTTG CCCGAACAAC CTCCTGCGGT CATTCTGCTC GATGCACCCT GCAGCGGCAC CGGCGTCCTT GCAAGACGAG CCGAACTGCG CTGGAAACTC ACGTCTGAAA CAATAACGGA ACTCGCACTC CTGCAGGTCG AACTGCTCGA CCATGCCGCA TCGCTGCTTG CCGAAAACGG TACGCTTCTC TACGCCACCT GCTCCATAGA ACCCAAAGAG AACGAGCAGC AGATAGAAGC ATTTCTCATC CGTCACCCGG AATTCATGCG GGACCCGGCC AGCGGTGCCC TTCCCGAACC CTTCGCATCG AAAGCCGTGC CGAACGGAAC CCTCCTGACG CTGCCGGGCG AACACGAAGG CTTTGACGGA GGCTTTGCCC AACGGTTGAA AAAAACCGCT CCATGA
|
Protein sequence | MTKKESMTSR ELALKVLQNI ESGEKKSGTA LHEQLSHTSL NQNDRALATE LVNGVLRRRL TLDTFIGRYY HHRYEKAAPV LKNILRIGAY QLIYLDRIPH WAAVNESVSL ARRYKGEHMA KLVNAVLRNI TPETISADLD PSGKMKEIQR LSITASHPEW LLERWIARYG KERAEAILAY NNRTPLTGFR INPLKTTPQE FLATENSKTA RIGKSGLENF FLSTEFSCYE SAIKQGLLTV QNPTQGLACL LCDPQPGSTL LDLCAAPGGK STFCGELMQN TGNITAVDLY SQKLQKLSEH AGELGITIIS TAEADARTYL PEQPPAVILL DAPCSGTGVL ARRAELRWKL TSETITELAL LQVELLDHAA SLLAENGTLL YATCSIEPKE NEQQIEAFLI RHPEFMRDPA SGALPEPFAS KAVPNGTLLT LPGEHEGFDG GFAQRLKKTA P
|
| |