Gene Cpha266_1855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1855 
Symbol 
ID4571197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2148931 
End bp2150445 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content50% 
IMG OID639766437 
Productprotease Do 
Protein accessionYP_912295 
Protein GI119357651 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.19147 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA AAAAAAGCAT GTTGAAATAC CTGCTGCTTG TCTTTGCGGG TATTCTTGTT 
GGCGGCCTTG TTTTTGCCAA TGTCGAATTC AGCATTCCGG GCCAGGGGAA AGTTGCGATA
GTAAAAAATA ATCCCAACTA TGCCAACGCG AAAAACAATT TCGAGAACTA TCCCATACAC
TCGCTCAGGG ATTTCAACGA AGCCTTTGTC CAGATCGCCG AATCAGCGAC GCCCTCTGTT
GTTACCGTGT TTACCGAAAA AACCGTGAGC CGCAGGCTCA TCAGCCCCTT TGACTTCTTC
GGAAGATCAT TCGACGATTT TTTCGGAACG CCAGGAGGCG CGCGATCCCC GAATGAGCGA
AAAGAGGTGC GTCACGGTCT CGGTTCGGGG GTAATCGTGA CGGACGACGG TTATATTCTT
ACCAATAACC ATGTTATCGA GAATGCCGAT GCGATTTATA TCCGCACAAG CGATAACAAG
AAAATAGATG CCACAATTAT CGGCAAGGAT CCCAAGACTG ATCTTGCTGT GATCAAGGTC
AATGCTCGCG GCCTGAAACC CATCATGATC GGCAACAGCG ACAACCTGAG GGTTGGTGAA
TGGGTGATTG CCATAGGCAG TCCGCTCGGC GAAAATCTTG CGCGAACGGT TACCCAGGGA
ATCGTAAGCG CAATAGGCCG CGCAAATGTA GGCCTTGCCG ACTATGAGGA TTTCATTCAG
ACCGATGCGG CCATCAATCC GGGCAATTCA GGTGGTCCGC TTGTCAACAT CAATGGAGAG
CTTGTTGGTA TTAACACAGC TATTGCAAGC CGCACAGGGG GTTTTGAAGG CATAGGTTTT
GCGGTACCCT CCAATATGGC TCAACAGGTT CTGACCGCTC TTATTACAAA AGGAAAGGTG
AGCAGGGGAT ATCTTGGCAT CAGTATCCAG GATATCGATG AAAATATTGC CAAAGGTCTT
CAGCTCCCTA AAGCTGAAGG AGTTATTGTT GGAACGGTGG TCGCCGGAAG TCCGGCTGCA
CGAAGCGGAA TGAAAACCGG TGACATCATT ACGGAGTTCA ACGACAAAAA AGTCACGGGC
AGCGCAGAGC TTCGCAATAC TATTGCAGCA ATGCAGCCCG GTTCGACGGC CCGTCTTCGC
ATTCTTCGCG ATGGTCAGAT CAGGATGTAT GCAGTCAAGC TTGAAGAGCA GCCGTTACAA
GAGGTTGCTT CCAGAGAGGT CGCTCGTTCC AGCGAAGTCC TTGGATTCAG GTCCCAGGAG
CTTACGCCCG AACTTGCCCG GCAGTATCAG TTAAAAGAGG CATCAGGGAA GATGATCGTC
ACCGGCGTCG ACCAGTCCAG CAACGCATTC CGTGCAGGGC TTCGCGCAGG CGATGTGATC
GTATCTGTCA ATAAACAGCC GATAACCACC TCAGCGCAGT ACAGCGAGAT ACTGAGCAAG
GTTAAAAGCG GCGATCTGCT TTTTCTTCTG GTTGAGCGGG GTGGCAACAA ACTTTATCTG
GCCTTTAATG TTTAG
 
Protein sequence
MKKKKSMLKY LLLVFAGILV GGLVFANVEF SIPGQGKVAI VKNNPNYANA KNNFENYPIH 
SLRDFNEAFV QIAESATPSV VTVFTEKTVS RRLISPFDFF GRSFDDFFGT PGGARSPNER
KEVRHGLGSG VIVTDDGYIL TNNHVIENAD AIYIRTSDNK KIDATIIGKD PKTDLAVIKV
NARGLKPIMI GNSDNLRVGE WVIAIGSPLG ENLARTVTQG IVSAIGRANV GLADYEDFIQ
TDAAINPGNS GGPLVNINGE LVGINTAIAS RTGGFEGIGF AVPSNMAQQV LTALITKGKV
SRGYLGISIQ DIDENIAKGL QLPKAEGVIV GTVVAGSPAA RSGMKTGDII TEFNDKKVTG
SAELRNTIAA MQPGSTARLR ILRDGQIRMY AVKLEEQPLQ EVASREVARS SEVLGFRSQE
LTPELARQYQ LKEASGKMIV TGVDQSSNAF RAGLRAGDVI VSVNKQPITT SAQYSEILSK
VKSGDLLFLL VERGGNKLYL AFNV