Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0021 |
Symbol | |
ID | 6373663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 25214 |
End bp | 26905 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642682542 |
Product | carboxyl-terminal protease |
Protein accession | YP_001958491 |
Protein GI | 189499021 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000380746 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.822612 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGTT ATTCGACAAG CGGTATCGGC AGCATGGTGA TGCGGGTTTT TGTTTCTCTT CTCTTCTTTT CGGGAACGTT TCTGAAAACG GCGGAAAGCC GTGAGAGCGA CTCTTTTTAC ATCGCAAAAA ACATTGAGCT GCTCGGTGAG GTTTACCGGA ACGTTTCTGA AAATTATGTT GATAGTATAG ACACGGCGGA GTTCATGTAT GCCGGTATAG ATGGTATGCT TGAAACGCTT GATCCCTATA CGGTTTTTCT TGACGAAAAA GAGTCGGATG AACTCGGAGA GCTGACAAGC GGGCACTATG CCGGCATCGG AGTCAGGATA TCGGAAATTG CCGGGGAGGT CTATGTTCTT TCGGTTTTTG ACGGGTCTCC GGCGGCAAAA GCCGGACTTC GTGTCGGCGA TCGTATCGAG AAGGTTGACC GGCATATCGT TAAGGGAAAA GATCTTGATG AGGTGAAGAC TTTCATCAAA GGGCCAGCCG GAAGCGAGGT CGTTCTCACT GTTGAAAGGT ACGGGAAGAA GAGCAGGGTT CGGGCAAGGA TTACCCGTCG CGAAGTCAGG GTTAACAGCA TACGCTATTC GGGACTGCTT GGAGAGATCG GATATCTCGT GATGGACTCA TTCGGGAACA GGAGTCCGGA TGAGCTCAAG AGGGCGATCA ATGAACTGGA TGCCGCATCG AGAATCAGGA AACGCCCCAT GGCCGGCGTG ATTCTGGATC TACGGAACAA TCCGGGCGGG TTGCTTGAGG CCGCCGTTGA TGTCAGCGGG CTTTTTGTCA GCAAGGGCAG TCAGGTCGTT TCAACCATGG GTCGTGATCC TGAAAGCAGA ATCAGCTACG AGACAAAGCG TGCTCCTGTC GTCGAAAAGC GACCTCTGGC CATACTGATC AATAAAAACA GCGCGTCAGC CGCCGAGATT GTTGCAGGGG CAATCCAGGA GCTTGATCGT GGCGTGATTG TCGGAAACCG TTCGTTCGGC AAAGGGCTTG TTCAGTCAGT AATCACCCTT CCCTATGACT CCAAGCTGAA GATGACAACG TCCAAGTACT ATACACCTTC CGGTCGTCTG ATCCAGCAGG AGCATGACTG GACAGGGGGG CCCCGCAAGG TTCTTGAGCG CGAAGAAAAG CCGGCAGATG GTATGGTGTT TTATACCCGT AACAAGCGTA AGGTTTATGG AGGAGGAGGT ATTCTTCCCG ACATCAGGCT TGACGGATAT ATTCCCGGCA GCTATGAAGC CGCTCTGCGC AAGGACGGAA TGCTTTTCCG CTTCGCCAGT ACATACAGGG CATCACATGA TCGAATCCCG CAGGCCGGCA TTGACAGAAA ATCGTTGATG CGCGATTTTG CCGGTTTTCT CGAAAGGGAG TCGTTCATCT ACAAATCCAA ACCGGAGGAG CTTCTTGAGG AAGTCAGGAA ATCCCTTGCC GGGAATCATA AAGAGGCTCA TCCTGGGATC GATTCGCTCG TCGCATCTCT TGAAAAGGAG ATGAAACTTC TTGCCGGTAA GCAGAAATCA AGCGAATCGA AACAGGTCGC TCTTGCTCTT GAGCAGGAAA TCCTGCGTCA TTACGATGAG GAGGCTGCTC TACGAAGCCG AATAGAGGAC GATCCTGTCG TTAAAAAAGC GTTGGAAGTT CTGCAGGACC CGGGCAGGTA CAGCACGTTG CTCAGCCCAT AG
|
Protein sequence | MKRYSTSGIG SMVMRVFVSL LFFSGTFLKT AESRESDSFY IAKNIELLGE VYRNVSENYV DSIDTAEFMY AGIDGMLETL DPYTVFLDEK ESDELGELTS GHYAGIGVRI SEIAGEVYVL SVFDGSPAAK AGLRVGDRIE KVDRHIVKGK DLDEVKTFIK GPAGSEVVLT VERYGKKSRV RARITRREVR VNSIRYSGLL GEIGYLVMDS FGNRSPDELK RAINELDAAS RIRKRPMAGV ILDLRNNPGG LLEAAVDVSG LFVSKGSQVV STMGRDPESR ISYETKRAPV VEKRPLAILI NKNSASAAEI VAGAIQELDR GVIVGNRSFG KGLVQSVITL PYDSKLKMTT SKYYTPSGRL IQQEHDWTGG PRKVLEREEK PADGMVFYTR NKRKVYGGGG ILPDIRLDGY IPGSYEAALR KDGMLFRFAS TYRASHDRIP QAGIDRKSLM RDFAGFLERE SFIYKSKPEE LLEEVRKSLA GNHKEAHPGI DSLVASLEKE MKLLAGKQKS SESKQVALAL EQEILRHYDE EAALRSRIED DPVVKKALEV LQDPGRYSTL LSP
|
| |