Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1326 |
Symbol | |
ID | 4570904 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 1518585 |
End bp | 1520078 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 639765915 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_911781 |
Protein GI | 119357137 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | [TIGR02145] Fibrobacter succinogenes major paralogous domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTCACC GGAGTTTCAA ACAACTCTTG CAGCAATTAC GCGACCGAAA CCGGGCGCCC CGCAAATGGT GGTATCCTGT GCCAAAAACA GATCCCCGTC ACCGGGATGT CATCGAGCAG TTGCATGAGA ACCACTCGAA AACGATCAAC AAAACCATGT TTTCGCTGCT CGGCGTCGGG CTCTACTGCC TCCTCAAGGT TCTTGGCGAA TCCGACAAAT CGCTTATCGT CGCAATCACA ACCATTCAGA CGCCGCTGGT TGGCACCTCC ATCTCGTTTC AGAGCTTTCT TCTCATAGCG CCCGTTCTGC TTGTCATCCT TGTAACATAC CTGCACATTC TCTACGGCTA CTGGCTGCAA CTCGAACGGA AACGGAAGGA GATAAACGAA GCGGCAGAGA GAGAGGGTGC GCCTACCATC GAGAGCATTC CCTCTCTTTT CAGCTTTCCC GATCGCCTGC CGCGCCTCTT CACCAACCTG ATATTTTACT GGTTCGTACC ACTCACGCTG TGGATTATGG CCAATAAAAC TTTTGCCCTC AGTGAATTAC GTTATCCTCT TTTTATGATC GCATCCGTTG TTACCCTTGC GATGCTGTTT TTACAGATCT ACCGTTGCCA ATCGAAGCGG TGGCTCCGGA ATTCGCCCCG ATGGGTAGCC TCAGGAGTAC TCTTGCTTTA TATGGTATAC GTTTCCTCCA ATCCCGAATC CCTCCGCAGA CCCCTGAATC TTCAGCGGGA GGATCTTCAT GGATCCTGGC TGCAAGGTCT CGATATGGCT GGTGCCGATA TGAATAACGC CAATCTTCAG GGAGCAAACC TTTCAGGGGC TGATTTGCGG AATGTCAATC TTCAGAATGC CAACCTTCAG GAAGCCGATC TCAGAAACTC GAAATTGCAG GGAGCTGATC TGAGATACGC AAAGTTTCAG AAATCCATTA TAGGGAATGC TGATTTCGAA GGGGCGGAGC TTGACCATGC AGACTTCAGG GATGCAAAAG AGAATGATCC CAATCGGTTC AAGGCTGCCA ATAACTATAA ATGCGCATTT TTCAGCGAGG GTTTACTTTC AGAACTATCT CTTTCGCCAA CCCACAATCA AGACCTTGAA AGAATTGGTA CTTACTATCG CGGGGGAAAA ATTGCCTATA TTTTTCAGCC GGGTGATCAG GGTTATATCG AGGGAGAACA GCATGGGGTG ATTGCTGCTA TAACGGATCT TCCGGGAGAA GACAAATACA CCTGGGATGC GGCAATAAAA GCCTGTGACG AGTTGGCAGA AAACGGTTAC AACGACTGGC GATTGCCGAG CCAGGACGAG TTGAACCAGC TCTATCTCAA CCGGAGTGCT GTTGGCGGTT TTGCTCCCGG CTTCTACTGG AGTTCTACGG AGAACGCTGC GTTCAACGCA TGGCTACAGA ACTTCGACGA TGGGTTCCAG CTCGACTTCA TCAAGAACCT CGAGTGGCGG GTGCGGCCTG TCCGGGCTTT TTAA
|
Protein sequence | MPHRSFKQLL QQLRDRNRAP RKWWYPVPKT DPRHRDVIEQ LHENHSKTIN KTMFSLLGVG LYCLLKVLGE SDKSLIVAIT TIQTPLVGTS ISFQSFLLIA PVLLVILVTY LHILYGYWLQ LERKRKEINE AAEREGAPTI ESIPSLFSFP DRLPRLFTNL IFYWFVPLTL WIMANKTFAL SELRYPLFMI ASVVTLAMLF LQIYRCQSKR WLRNSPRWVA SGVLLLYMVY VSSNPESLRR PLNLQREDLH GSWLQGLDMA GADMNNANLQ GANLSGADLR NVNLQNANLQ EADLRNSKLQ GADLRYAKFQ KSIIGNADFE GAELDHADFR DAKENDPNRF KAANNYKCAF FSEGLLSELS LSPTHNQDLE RIGTYYRGGK IAYIFQPGDQ GYIEGEQHGV IAAITDLPGE DKYTWDAAIK ACDELAENGY NDWRLPSQDE LNQLYLNRSA VGGFAPGFYW SSTENAAFNA WLQNFDDGFQ LDFIKNLEWR VRPVRAF
|
| |