Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1768 |
Symbol | |
ID | 4570112 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 2010117 |
End bp | 2011451 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 639766351 |
Product | sun protein |
Protein accession | YP_912209 |
Protein GI | 119357565 |
COG category | [J] Translation, ribosomal structure and biogenesis [K] Transcription |
COG ID | [COG0144] tRNA and rRNA cytosine-C5-methylases [COG0781] Transcription termination factor |
TIGRFAM ID | [TIGR00563] ribosomal RNA small subunit methyltransferase RsmB [TIGR01951] transcription antitermination factor NusB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGCAA GAGAAGCTGC CCTCAAGGCG CTGCAGGCTA TTGAACCCGG CAAGGAAAAA TCAGACCGGA TAGTACACGC AATACTCGAC AGGGCAACAA TGAACAGGCA AGACCGGGCT CTGACAACTG AACTGGTCAA CGGTGTCCTG CGAATGCGCA AAAAAATCGA TTTTATTATT TCGAAATTCT ATCATCATCG CTTTGAAAAA GCAGCACCTG TTCTACAGAA TATCCTGCGA CTTGGCGTCT ATCAACTGCT CTTTCTGGAA AAAATCCCGG AATGGGCCGC AGTCAGTGAA TGTGTGGATC TTGCAAGAAA GTACAAAGGC GAGCGGATGG CAAAACTTGT CAACGGTGTC TTGAGAAAAA TCACCCCTGA TAACGTCGTT ATGGACGAGT GGCTGAAAGG TTGTGAGGAT ATGGAACGAC TATCGGTGCA ATACTCCCAT CCAGAGTGGC TCATCAATCG ATGGAACGCT GTTTATGGCA GAGAAACCAC GCTTGCTTCC ATGACCTATA ACAACCATGC CCCCCTTTTT GGATTCAGAA TCAACACACT GAAACAAACG CCTGATGAAT TCCTTGCCGA TCCAGCACAT TCGTCCTTTC CTCAAGAGCG GTGCTTGACA GGTAATTTTT TCCTTTCAAA GGAGTTTGCC GGATTTGAGG CCTGCCTGAA ATCGGGAAAA CTGACCGTTC AGAATCCGAC GCAGGGAGTT GCATGCCTGC TGCTCAACCC CGTACCAGAA AGCAGGGTCC TTGATCTGTG TGCGGCACCG GGCGGCAAAG CCACGTTCAT GGCTGAGCTC ATGCAGAACA AAGGTTCAAT CACAGCAGTT GACCGATCGA GCGAAAAACT CGAAAAAACC AGGCAACATG CTGTTGAACT CGGCATAACA ATCATCAAAA CAATCTGCGC CGATGCCCGG TCATTTGTCC CGGAAGAGAC ACCACAAGCC GTTCTTCTCG ATGCGCCCTG TACAGGAACC GGCGTTTTGC AGAAACGTGC CGAACTCCGA TGGAAACTCT CAATGGAGAT GCTACAGGAA CTGGTAACGC TCCAGAGGGA ATTGCTTGAC CATGCAGCTT CGATATTGCC GGTAAACGGT ATTCTGCTCT ATGCAACCTG TTCGATAGAA CCGGAAGAAA ATGAGTTGCA AATAGAGGCT TTTCTGCGTC GTCATCCTGA ATTTTCAAGA GACACCTCTT GCGGCTCTCT CCCTGAACCG TTCAGGATGA GCGCGGCTGA AAAGGGTTCC ATCCTTACCC TTCCGGGCGA GCTTCCAGGA TTTGACGGGG GATTTGCTCA ACGGTTACGA AAAAACGCAC GGTAA
|
Protein sequence | MNAREAALKA LQAIEPGKEK SDRIVHAILD RATMNRQDRA LTTELVNGVL RMRKKIDFII SKFYHHRFEK AAPVLQNILR LGVYQLLFLE KIPEWAAVSE CVDLARKYKG ERMAKLVNGV LRKITPDNVV MDEWLKGCED MERLSVQYSH PEWLINRWNA VYGRETTLAS MTYNNHAPLF GFRINTLKQT PDEFLADPAH SSFPQERCLT GNFFLSKEFA GFEACLKSGK LTVQNPTQGV ACLLLNPVPE SRVLDLCAAP GGKATFMAEL MQNKGSITAV DRSSEKLEKT RQHAVELGIT IIKTICADAR SFVPEETPQA VLLDAPCTGT GVLQKRAELR WKLSMEMLQE LVTLQRELLD HAASILPVNG ILLYATCSIE PEENELQIEA FLRRHPEFSR DTSCGSLPEP FRMSAAEKGS ILTLPGELPG FDGGFAQRLR KNAR
|
| |