Gene Cphamn1_0021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0021 
Symbol 
ID6373663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp25214 
End bp26905 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content52% 
IMG OID642682542 
Productcarboxyl-terminal protease 
Protein accessionYP_001958491 
Protein GI189499021 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000380746 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.822612 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGTT ATTCGACAAG CGGTATCGGC AGCATGGTGA TGCGGGTTTT TGTTTCTCTT 
CTCTTCTTTT CGGGAACGTT TCTGAAAACG GCGGAAAGCC GTGAGAGCGA CTCTTTTTAC
ATCGCAAAAA ACATTGAGCT GCTCGGTGAG GTTTACCGGA ACGTTTCTGA AAATTATGTT
GATAGTATAG ACACGGCGGA GTTCATGTAT GCCGGTATAG ATGGTATGCT TGAAACGCTT
GATCCCTATA CGGTTTTTCT TGACGAAAAA GAGTCGGATG AACTCGGAGA GCTGACAAGC
GGGCACTATG CCGGCATCGG AGTCAGGATA TCGGAAATTG CCGGGGAGGT CTATGTTCTT
TCGGTTTTTG ACGGGTCTCC GGCGGCAAAA GCCGGACTTC GTGTCGGCGA TCGTATCGAG
AAGGTTGACC GGCATATCGT TAAGGGAAAA GATCTTGATG AGGTGAAGAC TTTCATCAAA
GGGCCAGCCG GAAGCGAGGT CGTTCTCACT GTTGAAAGGT ACGGGAAGAA GAGCAGGGTT
CGGGCAAGGA TTACCCGTCG CGAAGTCAGG GTTAACAGCA TACGCTATTC GGGACTGCTT
GGAGAGATCG GATATCTCGT GATGGACTCA TTCGGGAACA GGAGTCCGGA TGAGCTCAAG
AGGGCGATCA ATGAACTGGA TGCCGCATCG AGAATCAGGA AACGCCCCAT GGCCGGCGTG
ATTCTGGATC TACGGAACAA TCCGGGCGGG TTGCTTGAGG CCGCCGTTGA TGTCAGCGGG
CTTTTTGTCA GCAAGGGCAG TCAGGTCGTT TCAACCATGG GTCGTGATCC TGAAAGCAGA
ATCAGCTACG AGACAAAGCG TGCTCCTGTC GTCGAAAAGC GACCTCTGGC CATACTGATC
AATAAAAACA GCGCGTCAGC CGCCGAGATT GTTGCAGGGG CAATCCAGGA GCTTGATCGT
GGCGTGATTG TCGGAAACCG TTCGTTCGGC AAAGGGCTTG TTCAGTCAGT AATCACCCTT
CCCTATGACT CCAAGCTGAA GATGACAACG TCCAAGTACT ATACACCTTC CGGTCGTCTG
ATCCAGCAGG AGCATGACTG GACAGGGGGG CCCCGCAAGG TTCTTGAGCG CGAAGAAAAG
CCGGCAGATG GTATGGTGTT TTATACCCGT AACAAGCGTA AGGTTTATGG AGGAGGAGGT
ATTCTTCCCG ACATCAGGCT TGACGGATAT ATTCCCGGCA GCTATGAAGC CGCTCTGCGC
AAGGACGGAA TGCTTTTCCG CTTCGCCAGT ACATACAGGG CATCACATGA TCGAATCCCG
CAGGCCGGCA TTGACAGAAA ATCGTTGATG CGCGATTTTG CCGGTTTTCT CGAAAGGGAG
TCGTTCATCT ACAAATCCAA ACCGGAGGAG CTTCTTGAGG AAGTCAGGAA ATCCCTTGCC
GGGAATCATA AAGAGGCTCA TCCTGGGATC GATTCGCTCG TCGCATCTCT TGAAAAGGAG
ATGAAACTTC TTGCCGGTAA GCAGAAATCA AGCGAATCGA AACAGGTCGC TCTTGCTCTT
GAGCAGGAAA TCCTGCGTCA TTACGATGAG GAGGCTGCTC TACGAAGCCG AATAGAGGAC
GATCCTGTCG TTAAAAAAGC GTTGGAAGTT CTGCAGGACC CGGGCAGGTA CAGCACGTTG
CTCAGCCCAT AG
 
Protein sequence
MKRYSTSGIG SMVMRVFVSL LFFSGTFLKT AESRESDSFY IAKNIELLGE VYRNVSENYV 
DSIDTAEFMY AGIDGMLETL DPYTVFLDEK ESDELGELTS GHYAGIGVRI SEIAGEVYVL
SVFDGSPAAK AGLRVGDRIE KVDRHIVKGK DLDEVKTFIK GPAGSEVVLT VERYGKKSRV
RARITRREVR VNSIRYSGLL GEIGYLVMDS FGNRSPDELK RAINELDAAS RIRKRPMAGV
ILDLRNNPGG LLEAAVDVSG LFVSKGSQVV STMGRDPESR ISYETKRAPV VEKRPLAILI
NKNSASAAEI VAGAIQELDR GVIVGNRSFG KGLVQSVITL PYDSKLKMTT SKYYTPSGRL
IQQEHDWTGG PRKVLEREEK PADGMVFYTR NKRKVYGGGG ILPDIRLDGY IPGSYEAALR
KDGMLFRFAS TYRASHDRIP QAGIDRKSLM RDFAGFLERE SFIYKSKPEE LLEEVRKSLA
GNHKEAHPGI DSLVASLEKE MKLLAGKQKS SESKQVALAL EQEILRHYDE EAALRSRIED
DPVVKKALEV LQDPGRYSTL LSP