Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1169 |
Symbol | |
ID | 6374844 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 1257409 |
End bp | 1258713 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642683667 |
Product | protein of unknown function DUF107 |
Protein accession | YP_001959584 |
Protein GI | 189500114 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1030] Membrane-bound serine protease (ClpP class) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0123968 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCGTTT TGCTGCTTTT TGCGGCAGTG ACGGTTTTTT CATCAACTTT ACGCGCTGAA GAGGCAAAAG GCAACAAGAC AGTTCTTTTT CTCTCTCTTC AGGGTACGGT GAATCCTGGA AGCGCGGATT TTTTTGAGCG TGCGATTGAC CAGGCTGAAA AAGAGAAGGT TCACGCTATC CTTGTTGAAC TTGATACACC TGGCGGGCTT GTCTCCTCAT TGCGTGCAAT GGTACAGAGC GTGCTTGCTT CGCCTGTTCC TGTGATTGTT TATGTGGCGC CTCAGGGAGC TCAGGCTGCA TCTGCAGGGG CTCTGTTGAC ACTATCCGCA CATGTCGCTG CCATGTCTCC GGGCACGGAG ATCGGAGCGG CACATCCTGT CGGTCTCGGT GGAGGAGGTG ATGGTGATGA AACCATGAGT AAAAAGGCCG AGAATGATCT TGCCGCTTTT GCCCGGAGCA TAGCGGAAGA AAGGGGAAGA AATGCTGAGT GGGCGGAAAA CGCGGTACGG GAAAGTATTG CTTCAACCGC AAACGAAGCA CTCAAGGCTG GAGTTATCGA TTTTGTCGCC GCCGATCGTG CGGAACTTTT CAGGATGCTT GACGGCAGGA CGGTCGAAAC GATCGATGGC AGTCTGACGC TTGATTTGAC GGGAGCAGTT ATTGAAGAAT TTTCTCCGAC CTTGCAGGAA CAGATCCTTA TTAAGCTTGC CGACCCCAAT CTGGCATATA TTTTTATCAT GGTCGGGCTT GCAGGGCTCT ATTTCGAGTT AGCAAATCCG GGCTCTATTT TCCCCGGAGT ACTGGGCGCA ATATCGCTTC TTCTTGCTCT TTTCGCTCTT CAGGCTTTGC CTGTCAATGT CGTCGGTGTG TTGCTCATTG TTCTGGCGGT GGTATTTTTC GGGCTGGAAC TCTTTGTCGC TAGCGGCGGT ATACTGGCTC TGGCGGGCCT GGTAGCTCTT TTTGTCGGCT CTCTTATGCT TTTCAATACG GCTGAAACAG GGATTTCCAT TTCCATGACG GTTTTCCTTC CCGTATTTAT CATGGTGTCA GTATCCCTTT TGGCTATTGT CTGGCTCGTT ACCAAATCCT CAAGGCTGAA GCTTTCTTCC GGACCCGAAC AGCTGATCGG GGAGGAGGGC AGTGTGATTC ATGCCATTTT GCCCGGTCAG CCCGGAAAGG TGTTTGTTCA TGGCGAGCTT TGGGACGCGG AAAGCGGCGA AGAGATCCCT GAAAAGGGAG TCGCGATCGT GAAAGGTTTG AAAGGACTTA TTTTGCAGGT AACCAAAAAA CAGGAGAACG TATAA
|
Protein sequence | MIVLLLFAAV TVFSSTLRAE EAKGNKTVLF LSLQGTVNPG SADFFERAID QAEKEKVHAI LVELDTPGGL VSSLRAMVQS VLASPVPVIV YVAPQGAQAA SAGALLTLSA HVAAMSPGTE IGAAHPVGLG GGGDGDETMS KKAENDLAAF ARSIAEERGR NAEWAENAVR ESIASTANEA LKAGVIDFVA ADRAELFRML DGRTVETIDG SLTLDLTGAV IEEFSPTLQE QILIKLADPN LAYIFIMVGL AGLYFELANP GSIFPGVLGA ISLLLALFAL QALPVNVVGV LLIVLAVVFF GLELFVASGG ILALAGLVAL FVGSLMLFNT AETGISISMT VFLPVFIMVS VSLLAIVWLV TKSSRLKLSS GPEQLIGEEG SVIHAILPGQ PGKVFVHGEL WDAESGEEIP EKGVAIVKGL KGLILQVTKK QENV
|
| |