Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0905 |
Symbol | |
ID | 6374572 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 979894 |
End bp | 981402 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642683407 |
Product | protease Do |
Protein accession | YP_001959331 |
Protein GI | 189499861 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAGA ACAAACTCAC CGCTATTTTT TTGATTCTGG CAGGTGTTGC CATAGGAGGG CTCGTTTTTT CCAATCTGGA GTTTACCTTT CCGGGACAGC AGTATGAGGT GGCGGTAACC AATCATGCCG GTGTGGCGAC CGCAGCAAGC AAGGTCAAGG AGAGGCCGAT TCATACGCTC AGAGACTTCA ACGAAGCTTT TGTCGATATC GCTGAATCGG CAACACCTTC GGTCGTGACT ATTTTTACCG AAAAAACAGT CAACCGAAGG TTTGTCTCTC CCTTTGACCT TTTTGGCAGC CCTTTTGACG GTTTTTTTGA TCGTCCGCGC GGGAACCAAA CCCCCGAAAG TCGCAAGGAG GTGCTGCGTG GTTTAGGTTC AGGGGTTATT ATCAGCAAGG ACGGTTACAT TCTCACGAAC AACCATGTTA TTGAAAATGC GGATACTATC TACATCAGGA CCTATGATAA CAAGAGGCAC GAAGCCAAAA TTATCGGTTC AGACCCGAAA ACAGATATCG CGGTGATTAA AACCGATGCG AAAAATCTCA ACCCTATTGC AATCGGCGAC AGTGACGCAC TGCGGGTCGG GGAATGGGTG ATCGCTATCG GCAGCCCTCT GGGCGAAAAC CTTGCACGCA CGGTAACTCA GGGGATCGTC AGCGCCAAAG GCCGCGCGAA TGTCGGACTT GCCGATTATG AGGACTTTAT CCAGACTGAC GCGGCGATTA ATCCGGGAAA TTCAGGGGGC CCCCTGGTAA ATATCGATGG AGAACTTGTC GGTATCAATA CGGCTATAGC CAGCAGGACA GGAGGGTTTC AGGGCATTGG CTTTGCAGTG CCCTCCAATA TGGCACGTCA GATTATGCAG TCACTTGTCA GGAGCGGCAA GGTTACCAGA GGCTGGCTTG GCGTTACCAT ACAGGATGTC GATGAGAATA TCGCCAAAGG GTTGAAACTC GATAGAGCTG ATGGCGTTCT GGTCGGTACG GTTCTGGAAA ACAGTCCGGC AAAAGCAGGC GGGCTGAAGA CCGGAGATGT TATTCTTGAA ATAAATGGTA AAAAACTCAG GGATACCGTT GAACTTCGTA ATACCATAGC CAGGACATCC CCGGGAACGA CTGTTCAGCT GACTCTATGG AGGGACGGCG CTCTTAAAAA AGTATCTGTC AAGCTGAACG AGATACCCGA TCAGCCGGTG GCTGCCGAGC AGCAGCAGGA GATGGACGAG CTGCTTGGGT TCAATGCCGC TCCGCTATCA CCTGAGCTTG CCGCACAGTA CAGGTTACAG GCTGACGCGG GAAAGGTCGT GGTAACGGAA GTCACTCAAG GGAGCAACGC TTATCGCGCA GGACTGCGTA ACGGTGATAG CATAAAAGCG GTCAACAGAA AGAATATTTC CTCCTATAAG CAGTTTTTAT CCCTTGTCGG CAAGATGAAG CAGGGAGACC TTCTGTTTCT GCTCGTCGAG CGTGGCGGAA GTAAGGTCTA TTTTGCGTTT AACCTGTAA
|
Protein sequence | MKKNKLTAIF LILAGVAIGG LVFSNLEFTF PGQQYEVAVT NHAGVATAAS KVKERPIHTL RDFNEAFVDI AESATPSVVT IFTEKTVNRR FVSPFDLFGS PFDGFFDRPR GNQTPESRKE VLRGLGSGVI ISKDGYILTN NHVIENADTI YIRTYDNKRH EAKIIGSDPK TDIAVIKTDA KNLNPIAIGD SDALRVGEWV IAIGSPLGEN LARTVTQGIV SAKGRANVGL ADYEDFIQTD AAINPGNSGG PLVNIDGELV GINTAIASRT GGFQGIGFAV PSNMARQIMQ SLVRSGKVTR GWLGVTIQDV DENIAKGLKL DRADGVLVGT VLENSPAKAG GLKTGDVILE INGKKLRDTV ELRNTIARTS PGTTVQLTLW RDGALKKVSV KLNEIPDQPV AAEQQQEMDE LLGFNAAPLS PELAAQYRLQ ADAGKVVVTE VTQGSNAYRA GLRNGDSIKA VNRKNISSYK QFLSLVGKMK QGDLLFLLVE RGGSKVYFAF NL
|
| |