Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1580 |
Symbol | |
ID | 4569762 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 1796058 |
End bp | 1796933 |
Gene Length | 876 bp |
Protein Length | 291 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639766162 |
Product | heat shock protein HtpX |
Protein accession | YP_912026 |
Protein GI | 119357382 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0501] Zn-dependent protease with chaperone function |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000235294 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGGG TGGTTCTTTT TTTGTTTACC AACCTTGCGG TGATGCTGGT GTTGTCGGTC AGTGCCCGTG TTCTGGGCGT AGACCGATTT TTGACCGGCA ACGGTCTGGA TATGGGCATG CTGCTTCTGT TTGCTGCTTT AATCGGTTTT GGCGGATCCT TTATTTCTCT TCTGATGTCC AAAACCATGG CGAAATGGAG TACCGGCGCA CGGGTTATCC AGCAACCCGC CAACCAGAAC GAGGTATGGC TCGTTGATAC CGTGAGTCAG CTTTCCAAAA AAGCCGGTTT GGCGATGCCC GAGGTGGCCA TCTACGACGG TGCTCCGAAT GCCTTCGCCA CAGGCCCCAG CAAGTCGAGA TCGCTGGTGG CGGTCTCGAC CGGACTGTTG CAGAGCATGG ATCGAAAACA GGTGGAAGCC GTGTTGGCTC ACGAGGTCGC CCACATCGAT AACGGCGACA TGGTTACCTT GACGCTGATA CAGGGTGTGC TCAATACCTT CGTGATTTTT CTGTCGCGCG TCATTGCCTA TGCTATTGAC AGCTTTCTTC GCAGCGACGA CGACGAGTCC GGCAGTCCGG GTATCGGCTA CTGGATCAGC AGCATTATTT TTGAAATCAT GTTCGGCATT CTGGCAAGCG TCGTCGTCAT GTACTTTTCT CGCAAGCGTG AGTATCGGGC CGACGCGGGA GCTGCTGTGC TGTTGGGCGA CCGGCGCCCG ATGATCGACG CCCTGCGAGC GCTGGGAGGT CTTCAGGCCG GCCAGTTGCC GAAGGAAATG GCTGCCAGCG GGATTGCGGG TGGCGGTATG ATGGCTCTTT TCAGCAGTCA CCCGCCCCTT GAATCGCGGA TTGCAGCGCT GGAATCGGCA CGCTGA
|
Protein sequence | MKRVVLFLFT NLAVMLVLSV SARVLGVDRF LTGNGLDMGM LLLFAALIGF GGSFISLLMS KTMAKWSTGA RVIQQPANQN EVWLVDTVSQ LSKKAGLAMP EVAIYDGAPN AFATGPSKSR SLVAVSTGLL QSMDRKQVEA VLAHEVAHID NGDMVTLTLI QGVLNTFVIF LSRVIAYAID SFLRSDDDES GSPGIGYWIS SIIFEIMFGI LASVVVMYFS RKREYRADAG AAVLLGDRRP MIDALRALGG LQAGQLPKEM AASGIAGGGM MALFSSHPPL ESRIAALESA R
|
| |