Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0209 |
Symbol | |
ID | 4570590 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 233837 |
End bp | 235177 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 639764809 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_910700 |
Protein GI | 119356056 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACAGA TTTGGCACCT TGCAGGTATC GTTCTTTTAA CCGTTTCGGG TTTTCTTCCA GTTTCATCAG CCATCGGGTT TGATCCTGAT GCCGTTAAGC TGCTGGTGAA AAGTCCGAAA GAGTGGAACG CCTTTCGCCA GCAGCATTCC ATGCAGCCTG TTGATCTTGA CAAGGCAAAA CTTGAAGATG CCGATCTTGA GGGGGCGAAC CTCAGCAACA GTTCGCTTGT CAGAGCCGAG TTGAGCGGGG CTAATCTGAA CAATGCCGAT TTACGAGGAT CAAACCTTCA GCAGGCGTTT ATTAAAAAAG CGGATCTTAA AGGGGCCGAC TTGCGTGAGG CTTACCTTGT CAAGGTGAAT CTTAAAGAAG CATTCATGGA GAAGTCCATG CTTCAAAAAG CGAATCTTCA AAGCGCTAAT CTCAGATGGA CAAGATTTCA CAGGGCAGAT CTTGCAGGAT CGAACCTTCA GGATGCCGTT CTGTTTGAAA CCAGCTTTGT TGATGCTGAT CTTCGGGGCG CCAATCTCAA GGGTGCGCTC TATGTTGGAA ATGCCAATTT CAGCGGAGCA AAAATATCCA GTAACACCAT TACGCCTTCT GGTGAAAAAG CAACAGCGTC TTGGGCTGCG ATACGCGATG CTGAATATAT CAAAGAAGCG GATGCGGCAA TGCCGGTTTA TGCTTCCCTT CCGGTTATGG TATTTGCATC CCCATCGGCA GGCCTGAAGT CCGCATCCGC TTCCTCCGGT CAGGCAGTGA TGGGCAGCAA ACAGCAGCAA GCGCTGATGG TTGATGATGT CGAAACCTGG AATACCATGC GTGCCGCCAA TCCTGAGCTG AAAATAGAGT TGAAAGAGGA AAAACTTGAA AATGCAAGGC TGAAGGGAGT GAATTTGCAG AAAGCTTCGA TGCCTGGTGC CGATTTCGAG GATGCAAATC TTGACGAGGC CATGATGGAG GGGGCCGATC TGAGTAAAGC CGATTTCCAG AAAGCGGATA TGAAAAAAGT TAAACTTCAG GGGGCAAATC TTTCCGGAGC GAATCTTGAC CGTTCATTCA TGGAAGGGGC AGATCTGCGC AATGCCAATC TCAGCGGAGC GAACCTGTTC GGAGCTATGC TTAAGGATGC CAATCTCAGC GGCGCTAATC TCAGCGGAGC ATCCCTGTTT GAGACAGATC TTGAGGGAGC AAATCTTTCC GGAGCGAATC TTAAGGGAGC AAATCTCGTG GAGCCTAACC TGAAAAATGC GATCATCTCA CCGGACACCA TTCTTCCATC CGGCAAGAAT GCGACGCAGA GTTGGGCGGT TATAAAGGGA GCGACATTTG TTAAGCCTTA G
|
Protein sequence | MKQIWHLAGI VLLTVSGFLP VSSAIGFDPD AVKLLVKSPK EWNAFRQQHS MQPVDLDKAK LEDADLEGAN LSNSSLVRAE LSGANLNNAD LRGSNLQQAF IKKADLKGAD LREAYLVKVN LKEAFMEKSM LQKANLQSAN LRWTRFHRAD LAGSNLQDAV LFETSFVDAD LRGANLKGAL YVGNANFSGA KISSNTITPS GEKATASWAA IRDAEYIKEA DAAMPVYASL PVMVFASPSA GLKSASASSG QAVMGSKQQQ ALMVDDVETW NTMRAANPEL KIELKEEKLE NARLKGVNLQ KASMPGADFE DANLDEAMME GADLSKADFQ KADMKKVKLQ GANLSGANLD RSFMEGADLR NANLSGANLF GAMLKDANLS GANLSGASLF ETDLEGANLS GANLKGANLV EPNLKNAIIS PDTILPSGKN ATQSWAVIKG ATFVKP
|
| |