Gene Cphamn1_0905 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0905 
Symbol 
ID6374572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp979894 
End bp981402 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content51% 
IMG OID642683407 
Productprotease Do 
Protein accessionYP_001959331 
Protein GI189499861 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGA ACAAACTCAC CGCTATTTTT TTGATTCTGG CAGGTGTTGC CATAGGAGGG 
CTCGTTTTTT CCAATCTGGA GTTTACCTTT CCGGGACAGC AGTATGAGGT GGCGGTAACC
AATCATGCCG GTGTGGCGAC CGCAGCAAGC AAGGTCAAGG AGAGGCCGAT TCATACGCTC
AGAGACTTCA ACGAAGCTTT TGTCGATATC GCTGAATCGG CAACACCTTC GGTCGTGACT
ATTTTTACCG AAAAAACAGT CAACCGAAGG TTTGTCTCTC CCTTTGACCT TTTTGGCAGC
CCTTTTGACG GTTTTTTTGA TCGTCCGCGC GGGAACCAAA CCCCCGAAAG TCGCAAGGAG
GTGCTGCGTG GTTTAGGTTC AGGGGTTATT ATCAGCAAGG ACGGTTACAT TCTCACGAAC
AACCATGTTA TTGAAAATGC GGATACTATC TACATCAGGA CCTATGATAA CAAGAGGCAC
GAAGCCAAAA TTATCGGTTC AGACCCGAAA ACAGATATCG CGGTGATTAA AACCGATGCG
AAAAATCTCA ACCCTATTGC AATCGGCGAC AGTGACGCAC TGCGGGTCGG GGAATGGGTG
ATCGCTATCG GCAGCCCTCT GGGCGAAAAC CTTGCACGCA CGGTAACTCA GGGGATCGTC
AGCGCCAAAG GCCGCGCGAA TGTCGGACTT GCCGATTATG AGGACTTTAT CCAGACTGAC
GCGGCGATTA ATCCGGGAAA TTCAGGGGGC CCCCTGGTAA ATATCGATGG AGAACTTGTC
GGTATCAATA CGGCTATAGC CAGCAGGACA GGAGGGTTTC AGGGCATTGG CTTTGCAGTG
CCCTCCAATA TGGCACGTCA GATTATGCAG TCACTTGTCA GGAGCGGCAA GGTTACCAGA
GGCTGGCTTG GCGTTACCAT ACAGGATGTC GATGAGAATA TCGCCAAAGG GTTGAAACTC
GATAGAGCTG ATGGCGTTCT GGTCGGTACG GTTCTGGAAA ACAGTCCGGC AAAAGCAGGC
GGGCTGAAGA CCGGAGATGT TATTCTTGAA ATAAATGGTA AAAAACTCAG GGATACCGTT
GAACTTCGTA ATACCATAGC CAGGACATCC CCGGGAACGA CTGTTCAGCT GACTCTATGG
AGGGACGGCG CTCTTAAAAA AGTATCTGTC AAGCTGAACG AGATACCCGA TCAGCCGGTG
GCTGCCGAGC AGCAGCAGGA GATGGACGAG CTGCTTGGGT TCAATGCCGC TCCGCTATCA
CCTGAGCTTG CCGCACAGTA CAGGTTACAG GCTGACGCGG GAAAGGTCGT GGTAACGGAA
GTCACTCAAG GGAGCAACGC TTATCGCGCA GGACTGCGTA ACGGTGATAG CATAAAAGCG
GTCAACAGAA AGAATATTTC CTCCTATAAG CAGTTTTTAT CCCTTGTCGG CAAGATGAAG
CAGGGAGACC TTCTGTTTCT GCTCGTCGAG CGTGGCGGAA GTAAGGTCTA TTTTGCGTTT
AACCTGTAA
 
Protein sequence
MKKNKLTAIF LILAGVAIGG LVFSNLEFTF PGQQYEVAVT NHAGVATAAS KVKERPIHTL 
RDFNEAFVDI AESATPSVVT IFTEKTVNRR FVSPFDLFGS PFDGFFDRPR GNQTPESRKE
VLRGLGSGVI ISKDGYILTN NHVIENADTI YIRTYDNKRH EAKIIGSDPK TDIAVIKTDA
KNLNPIAIGD SDALRVGEWV IAIGSPLGEN LARTVTQGIV SAKGRANVGL ADYEDFIQTD
AAINPGNSGG PLVNIDGELV GINTAIASRT GGFQGIGFAV PSNMARQIMQ SLVRSGKVTR
GWLGVTIQDV DENIAKGLKL DRADGVLVGT VLENSPAKAG GLKTGDVILE INGKKLRDTV
ELRNTIARTS PGTTVQLTLW RDGALKKVSV KLNEIPDQPV AAEQQQEMDE LLGFNAAPLS
PELAAQYRLQ ADAGKVVVTE VTQGSNAYRA GLRNGDSIKA VNRKNISSYK QFLSLVGKMK
QGDLLFLLVE RGGSKVYFAF NL