Gene Cphamn1_1287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1287 
Symbol 
ID6374964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1399293 
End bp1400900 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content48% 
IMG OID642683784 
Productcarboxyl-terminal protease 
Protein accessionYP_001959699 
Protein GI189500229 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.863729 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACGAC TATCTCAAGT GTTCCTGATT ATTGCGTGTT TATTGCTGGG TGGCATCATC 
GGTTATATGG CGCGGAATAA TTCTTCAGGT TCTGTTTTCG GTCAACAGAA AAAGGTGCTT
GATGTGTTGA ATATGATTTC AGCGCTATAC GTTGATGAGG TGGATGATCA GCATGTGATC
GATGCAGGAA TCGAGGGCAT GACAGAGGCT CTTGATCCGC ATACGACATA TCTTACCGCT
GAACAGGTGG GCTTTTCAGA ATCGGAATTT CAGGGCAACT TTGAGGGCAT AGGTATCGAA
TTCGATATTG TTCATGATAC GCTTCTCGTG GTGACTCCTC TGGCGAGCGG ACCAAGTAGT
TCGGCAGGAA TTATGCCGGG TGACAGGATC ATCGCTATCG ATACGCTTTC CGCTGTTGGT
ATTTCCCCTA TGGAGGTTGT CCGCCGGCTT CGCGGAAAAA AGGGTTCTGT TGTGAACCTT
CAAATTTACA GGCCCTATTC ATCCAGGACA TTTTCCGTTG CGGTTGTTCG AGACAAGATT
CCTACGTATA GTGTGGATGC TTTTTTTATG CTCGACAGCT TTACCGGGTA TATCCGGATG
AGCAGGTTTG TTGCAACTAC GGCAGACGAG TTCAAAGAGG CGATGAAAAA ACTTCTTGAG
CAGGGTATGA GGAAACTTAT TATCGATGTT CGGGGAAATC CGGGCGGGTA TCTCGATCAG
GCGGTAGAGA TCGCTGACGA GTTGCTTCAG GAGGGAAAGC TTATTGTCTA TACAAAGAGC
AGGAATGGCG GTATTGATGA AATGAGCTAT ACGTCGACTT CCGGAGGTGT TTTCGAGAAC
CGTGAGGTTA TGGTGCTTGT TGACAGGGGG AGTGCTTCTG CTGCCGAGAT TCTTGCCGGC
GCACTTCAGG ATAACGGGCG GGCAAAGATA GCAGGAGAAC TGACTTTCGG CAAGGGACTT
GTTCAGCGTC AGTTTGATCT TGGCGACGGA TCGGCATTGC GTTTGACGAT CGCGAGGTAT
TATACGCCTT CAGGCAGAAG GATACAGCGG GATTTCAGTC CGGGAAGCGA AGGGAGGGAA
GAGTATTATC ATGAGAGCGC AGAAGGGAGA GATGGAGAGA AACTGTTTAA GGACAGCGGC
AGTTTAGAGG TTGGCAGCGA TGTCGATGGT GTTACGGTAT ACAGGCCAAA AGAAGGCCTT
TTTTCTGAAT CAGGAGGTAT TGTGCCTGAT TTCTGGGTTG TTGAGCGTCC TGTTGATGAT
TTTTACAGCA GATTGCTGAC AGAAGGGGTT TTTGATGAAA CCGCGTTACG AGTCATCGAT
GATCCATCAA GCGCGGTAAG GACATTCAGT GAAGATGAAA AGAGGTTTAT TAGCGCTTAT
TCAGAAGATG CAATGGCTGA AGACTACCTG AGGAAAATAG CTGCAGAAAA GGATATTCCT
TTCGATGAGA CTGTGTTCAG GCGTGAAAAG CCCGCTATGC TGACGGCGGT GAAATCACGT
ATTGCCCGCC AGCTTTTCGG CATAGAGGCG CAGATAAGGG TTCTTGCCGA TGAGTCTGAC
AAGATGCTGC AGTTTGCCAG GTATTACTTG CGTCGTGAAG CTTCTTAA
 
Protein sequence
MSRLSQVFLI IACLLLGGII GYMARNNSSG SVFGQQKKVL DVLNMISALY VDEVDDQHVI 
DAGIEGMTEA LDPHTTYLTA EQVGFSESEF QGNFEGIGIE FDIVHDTLLV VTPLASGPSS
SAGIMPGDRI IAIDTLSAVG ISPMEVVRRL RGKKGSVVNL QIYRPYSSRT FSVAVVRDKI
PTYSVDAFFM LDSFTGYIRM SRFVATTADE FKEAMKKLLE QGMRKLIIDV RGNPGGYLDQ
AVEIADELLQ EGKLIVYTKS RNGGIDEMSY TSTSGGVFEN REVMVLVDRG SASAAEILAG
ALQDNGRAKI AGELTFGKGL VQRQFDLGDG SALRLTIARY YTPSGRRIQR DFSPGSEGRE
EYYHESAEGR DGEKLFKDSG SLEVGSDVDG VTVYRPKEGL FSESGGIVPD FWVVERPVDD
FYSRLLTEGV FDETALRVID DPSSAVRTFS EDEKRFISAY SEDAMAEDYL RKIAAEKDIP
FDETVFRREK PAMLTAVKSR IARQLFGIEA QIRVLADESD KMLQFARYYL RREAS