Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1855 |
Symbol | |
ID | 4571197 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2148931 |
End bp | 2150445 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 639766437 |
Product | protease Do |
Protein accession | YP_912295 |
Protein GI | 119357651 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.19147 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA AAAAAAGCAT GTTGAAATAC CTGCTGCTTG TCTTTGCGGG TATTCTTGTT GGCGGCCTTG TTTTTGCCAA TGTCGAATTC AGCATTCCGG GCCAGGGGAA AGTTGCGATA GTAAAAAATA ATCCCAACTA TGCCAACGCG AAAAACAATT TCGAGAACTA TCCCATACAC TCGCTCAGGG ATTTCAACGA AGCCTTTGTC CAGATCGCCG AATCAGCGAC GCCCTCTGTT GTTACCGTGT TTACCGAAAA AACCGTGAGC CGCAGGCTCA TCAGCCCCTT TGACTTCTTC GGAAGATCAT TCGACGATTT TTTCGGAACG CCAGGAGGCG CGCGATCCCC GAATGAGCGA AAAGAGGTGC GTCACGGTCT CGGTTCGGGG GTAATCGTGA CGGACGACGG TTATATTCTT ACCAATAACC ATGTTATCGA GAATGCCGAT GCGATTTATA TCCGCACAAG CGATAACAAG AAAATAGATG CCACAATTAT CGGCAAGGAT CCCAAGACTG ATCTTGCTGT GATCAAGGTC AATGCTCGCG GCCTGAAACC CATCATGATC GGCAACAGCG ACAACCTGAG GGTTGGTGAA TGGGTGATTG CCATAGGCAG TCCGCTCGGC GAAAATCTTG CGCGAACGGT TACCCAGGGA ATCGTAAGCG CAATAGGCCG CGCAAATGTA GGCCTTGCCG ACTATGAGGA TTTCATTCAG ACCGATGCGG CCATCAATCC GGGCAATTCA GGTGGTCCGC TTGTCAACAT CAATGGAGAG CTTGTTGGTA TTAACACAGC TATTGCAAGC CGCACAGGGG GTTTTGAAGG CATAGGTTTT GCGGTACCCT CCAATATGGC TCAACAGGTT CTGACCGCTC TTATTACAAA AGGAAAGGTG AGCAGGGGAT ATCTTGGCAT CAGTATCCAG GATATCGATG AAAATATTGC CAAAGGTCTT CAGCTCCCTA AAGCTGAAGG AGTTATTGTT GGAACGGTGG TCGCCGGAAG TCCGGCTGCA CGAAGCGGAA TGAAAACCGG TGACATCATT ACGGAGTTCA ACGACAAAAA AGTCACGGGC AGCGCAGAGC TTCGCAATAC TATTGCAGCA ATGCAGCCCG GTTCGACGGC CCGTCTTCGC ATTCTTCGCG ATGGTCAGAT CAGGATGTAT GCAGTCAAGC TTGAAGAGCA GCCGTTACAA GAGGTTGCTT CCAGAGAGGT CGCTCGTTCC AGCGAAGTCC TTGGATTCAG GTCCCAGGAG CTTACGCCCG AACTTGCCCG GCAGTATCAG TTAAAAGAGG CATCAGGGAA GATGATCGTC ACCGGCGTCG ACCAGTCCAG CAACGCATTC CGTGCAGGGC TTCGCGCAGG CGATGTGATC GTATCTGTCA ATAAACAGCC GATAACCACC TCAGCGCAGT ACAGCGAGAT ACTGAGCAAG GTTAAAAGCG GCGATCTGCT TTTTCTTCTG GTTGAGCGGG GTGGCAACAA ACTTTATCTG GCCTTTAATG TTTAG
|
Protein sequence | MKKKKSMLKY LLLVFAGILV GGLVFANVEF SIPGQGKVAI VKNNPNYANA KNNFENYPIH SLRDFNEAFV QIAESATPSV VTVFTEKTVS RRLISPFDFF GRSFDDFFGT PGGARSPNER KEVRHGLGSG VIVTDDGYIL TNNHVIENAD AIYIRTSDNK KIDATIIGKD PKTDLAVIKV NARGLKPIMI GNSDNLRVGE WVIAIGSPLG ENLARTVTQG IVSAIGRANV GLADYEDFIQ TDAAINPGNS GGPLVNINGE LVGINTAIAS RTGGFEGIGF AVPSNMAQQV LTALITKGKV SRGYLGISIQ DIDENIAKGL QLPKAEGVIV GTVVAGSPAA RSGMKTGDII TEFNDKKVTG SAELRNTIAA MQPGSTARLR ILRDGQIRMY AVKLEEQPLQ EVASREVARS SEVLGFRSQE LTPELARQYQ LKEASGKMIV TGVDQSSNAF RAGLRAGDVI VSVNKQPITT SAQYSEILSK VKSGDLLFLL VERGGNKLYL AFNV
|
| |