Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_2001 |
Symbol | |
ID | 4570829 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2315251 |
End bp | 2316567 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 639766583 |
Product | ribulose-1,5-bisphosphate carboxylase/oxygenase large subunit |
Protein accession | YP_912438 |
Protein GI | 119357794 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1850] Ribulose 1,5-bisphosphate carboxylase, large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000765161 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCTTATGT ACAACGAGGA TATAAAGGGT TTTTTTTCCT CTCGGGAAGA ACTGAATATG CCGGATTATT TGACGCTGGA GTACTATCTG GAGTGCGTTG GCGACATTGA GACCGCTCTT GCGCATTTCT GCAGTGAGCA ATCGACAGCC CAGTGGAAGC GGGTCGGCAT CGATGAGGAT TTCCGTCCTC GTTATGCGGC AATGGTTATC GGTCTTGACG TTCTTGGCGA ATTACCGCAA CTCAGTTATT CGGTACCGCA TTCTGAAACC GGAAAAATTC ATGCTTGCAG GGTTACCATT GCGCATCCTC ATCGCAATTT TGGTCCGAAG CTTCCTAATC TCATTTCAGC GGTATGTGGT GAGGGTGTTT ATTTTACTCC AGGGGTACCT GTCGTGAAAC TTCTCGATAT CGGGTTTCCC CAGGAGTTCC TTTCCGCTTT TGACGGGCCT AAATTCGGTA TAGCCGGTAT TCGGGATCTT CTGAAGGCTT ATAACCGTCC GATTTTTTTC GGAGTTGTCA AGCCCAATAT TGGTCTCAGC CCCGAGCATT TCGGCGAAAT TGCTTTTCAG AGCTGGCTTG GCGGACTCGA TATAGCCAAA GATGATGAAA TGCTTGCTGA TGTCGAGTGG TCAACCCTTG CGGATCGTTC TTGCGAACTT GGCAAGGCCC GTATCCGAGC TGAAAAAGAG ACCGGCGAGC CTAAAGTTTA CCTTGCCAAT ATAACCGATG AGGTTGATCA GCTTATCAAT CAGCATGACA TCGCTGTGAA AAATGGCGCC AATGCATTGC TTATTAACGC TTTACCGGTT GGTTTGAGTG CGGTCAGAAT GCTTGCGAAG CATACAAAGG TTCCGCTTAT CGGCCATTTT CCTTTTATCG CAGCCTTCAG CAGAATGGAG AAGTTTGGTG TACATTCAAG GGTTATGACC AAATTGCAGC GGCTTGCCGG TCTTGATTCC ATTATTATGC CGGGTTTTGG CAGCCGGATG ATGACGGCGG AAGAAGAGGT CAGGGATAAT ATCCAGGAGT GTTTGCAGGA GTTTGGTCAT ATCAGGCCGT CTCTTCCTGT TCCGGGAGGA AGCGATTCCG CCCTTACTCT TGAAGCTGTG TACCGCAAGG TCGGTAGTGT TGATTTCGGA TTTGTGCCCG GACGCGGTGT TTTTGGTCAT CCTATGGGGC CGAAGGCAGG CGCCAGCAGT ATTCGCCAGG CCTGGGATGC GATAGAAAAG GGTCTCTCTC TTGATGAATA TGCGATCGGG CGACCTGAGC TGATGGCGAT GGTTGCTGTC GAAAATGCTA AAAAACAGCA CTCCTGA
|
Protein sequence | MLMYNEDIKG FFSSREELNM PDYLTLEYYL ECVGDIETAL AHFCSEQSTA QWKRVGIDED FRPRYAAMVI GLDVLGELPQ LSYSVPHSET GKIHACRVTI AHPHRNFGPK LPNLISAVCG EGVYFTPGVP VVKLLDIGFP QEFLSAFDGP KFGIAGIRDL LKAYNRPIFF GVVKPNIGLS PEHFGEIAFQ SWLGGLDIAK DDEMLADVEW STLADRSCEL GKARIRAEKE TGEPKVYLAN ITDEVDQLIN QHDIAVKNGA NALLINALPV GLSAVRMLAK HTKVPLIGHF PFIAAFSRME KFGVHSRVMT KLQRLAGLDS IIMPGFGSRM MTAEEEVRDN IQECLQEFGH IRPSLPVPGG SDSALTLEAV YRKVGSVDFG FVPGRGVFGH PMGPKAGASS IRQAWDAIEK GLSLDEYAIG RPELMAMVAV ENAKKQHS
|
| |