Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1704 |
Symbol | |
ID | 3746970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 2214894 |
End bp | 2216699 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637774241 |
Product | peptidase S49, protease IV |
Protein accession | YP_379998 |
Protein GI | 78189660 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00705] signal peptide peptidase SppA, 67K type [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.100824 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAATA GTTCCATTCC ACAAAAGCGT CGCGGCTGTT TTCGCCCGGG CTGCTTATGG TTTTTAGTTG TGCCGCTCTT TATTGTGGTT GCACTTTTTT GGGCGTTTCG CTCTTCGCAC GATATGCCCG ATCGTTTTGT GTTGGTTGTT CCTCTTAGTG GCAAATTAGC CGAAGTCAAT AACGAGCGCT CCTCACTGCC CTTTATGCCA TCACAAGGCG ATTTATCGCT GCAAGAGGTG CTCTTTGTGC TGCACGAAGC TGCCAAGGAT GAGCAAGTAA GTGAAGTGCT GCTGCAGCTT GATGGCGTTG AAGCTGCACC CGCTAAAATT GCCGAAGTAC GCGCGGCTGT TGCTGACGTG CGCCGCAAAG GCAAAAAGGT GAGCGCATTT TTATACCGTG CAGAGGATAG CGATTACTTG CTTGCTACTG CGGCTGATAC CATTATTATG CAACGCGGTG CTTCGCTTTT GCTGGATGGC TTAAAAGCGG AGTCGCTTTT TTATACGGGA ACATTAAACA AGCTCGGCAT TACCGTACAA GCCGCTCAAT GGAAAGAGTA CAAAAGCGGC ATTGAGCCTT TTACCCGCAC AAGTGCCAGC AAAGAATACC GTGAGCAAAT CAACATGCTG CTTGATGATG TTTACAACAA CTACCTTTCA GCCGTAAGCG AACGGCGTAA AATAAGCCGA TCGGCATTTG AGGCTATTAT TAATAACGAG GCGTTGCTTT CGGCAGAACG TGCTAAAGCG CTTGGTCTTG TTGACCGCAT TGCAACTTTT TGGGATGTAG AGCGCTCTAT GACCAAACAG CTTACGGGCG AAGAGCTAAG TAGCGAGAAT AATGCGCTGG TTCATGCTGC CGATTACCGC AATGCAATGG ATTACCCGCA ACACTCCAGC ACAAGCGATG CCATTGCCGT TATTACCATG TCGGGTCCCA TTATGCGCTC GGTAGATAAC CTTGATGACG GCATTGATGT CGCCACTATG CAACATTCGC TTGAAGCTGC CCTTGAAAAC AAGAGCGTCA AAGCCATTGT GCTCCGCATT GATAGCCCGG GTGGCGAAGC TATTGCCTCA GCCGATATTT TGCAAATGAT TAACGCTGCT GCTACCAAAA AAACGCTTGT CGTCTCAATG TCGGGCGTTG CTGCATCAGG CGGTTACATG GTAGCGCTTG GCGGCAAAAC CATTGTAGCA CATCCGCTCA CTATTACGGG TTCCATTGGC GTTTATGCGC TCAAACCAAC CATTCAAGGA TTGGCTGAAA AGGTTGGCTT GCAACGCGAA GTTATTACAA GAGGACGTTT TGCTGATGCC ACTTCACCCT TTACTCCGCT TGAAGGAGAA GCCTACAACA AATTTGTAGC CTCAGCAGGC GACGTCTATA ACGACTTTAT CAGCAAAGTT GCAACATCAC GCCGCATGAA GGTAACAGCC GTTGACTCTG TTGCAGGCGG ACGGGTATGG ACGGGCAGCC GTGCCAAGCA AGTTGGTTTG GTTGACCGCA TGGGTGGGCT TTTTGATGCC CTTGCTTTAG CCAAAGAGCG TGCAGGCATT AGCAAAGATA AAGAGCCAAC CATTCTCCTC TATCCCCTTC AGCAAGGATG GCTACAATCG CTGCTGGGTG GCGCTACCCT CAATTCAGTA ACCAAAGCAA TTGCAACCGC GCTTCTCGGT AACGTTTTAC CAATAAACGT GGAGCAACAG CCACTTTCCG CCATGCAACC ATTTTACGAT ATGCTGATTC GTTCAGGCAA ACCGCACATG GTAGCACTTA TGCCCGCTGA AGTGGTGGTG AAGTAA
|
Protein sequence | MNNSSIPQKR RGCFRPGCLW FLVVPLFIVV ALFWAFRSSH DMPDRFVLVV PLSGKLAEVN NERSSLPFMP SQGDLSLQEV LFVLHEAAKD EQVSEVLLQL DGVEAAPAKI AEVRAAVADV RRKGKKVSAF LYRAEDSDYL LATAADTIIM QRGASLLLDG LKAESLFYTG TLNKLGITVQ AAQWKEYKSG IEPFTRTSAS KEYREQINML LDDVYNNYLS AVSERRKISR SAFEAIINNE ALLSAERAKA LGLVDRIATF WDVERSMTKQ LTGEELSSEN NALVHAADYR NAMDYPQHSS TSDAIAVITM SGPIMRSVDN LDDGIDVATM QHSLEAALEN KSVKAIVLRI DSPGGEAIAS ADILQMINAA ATKKTLVVSM SGVAASGGYM VALGGKTIVA HPLTITGSIG VYALKPTIQG LAEKVGLQRE VITRGRFADA TSPFTPLEGE AYNKFVASAG DVYNDFISKV ATSRRMKVTA VDSVAGGRVW TGSRAKQVGL VDRMGGLFDA LALAKERAGI SKDKEPTILL YPLQQGWLQS LLGGATLNSV TKAIATALLG NVLPINVEQQ PLSAMQPFYD MLIRSGKPHM VALMPAEVVV K
|
| |