Gene Cag_1704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1704 
Symbol 
ID3746970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2214894 
End bp2216699 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content49% 
IMG OID637774241 
Productpeptidase S49, protease IV 
Protein accessionYP_379998 
Protein GI78189660 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00705] signal peptide peptidase SppA, 67K type
[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.100824 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAATA GTTCCATTCC ACAAAAGCGT CGCGGCTGTT TTCGCCCGGG CTGCTTATGG 
TTTTTAGTTG TGCCGCTCTT TATTGTGGTT GCACTTTTTT GGGCGTTTCG CTCTTCGCAC
GATATGCCCG ATCGTTTTGT GTTGGTTGTT CCTCTTAGTG GCAAATTAGC CGAAGTCAAT
AACGAGCGCT CCTCACTGCC CTTTATGCCA TCACAAGGCG ATTTATCGCT GCAAGAGGTG
CTCTTTGTGC TGCACGAAGC TGCCAAGGAT GAGCAAGTAA GTGAAGTGCT GCTGCAGCTT
GATGGCGTTG AAGCTGCACC CGCTAAAATT GCCGAAGTAC GCGCGGCTGT TGCTGACGTG
CGCCGCAAAG GCAAAAAGGT GAGCGCATTT TTATACCGTG CAGAGGATAG CGATTACTTG
CTTGCTACTG CGGCTGATAC CATTATTATG CAACGCGGTG CTTCGCTTTT GCTGGATGGC
TTAAAAGCGG AGTCGCTTTT TTATACGGGA ACATTAAACA AGCTCGGCAT TACCGTACAA
GCCGCTCAAT GGAAAGAGTA CAAAAGCGGC ATTGAGCCTT TTACCCGCAC AAGTGCCAGC
AAAGAATACC GTGAGCAAAT CAACATGCTG CTTGATGATG TTTACAACAA CTACCTTTCA
GCCGTAAGCG AACGGCGTAA AATAAGCCGA TCGGCATTTG AGGCTATTAT TAATAACGAG
GCGTTGCTTT CGGCAGAACG TGCTAAAGCG CTTGGTCTTG TTGACCGCAT TGCAACTTTT
TGGGATGTAG AGCGCTCTAT GACCAAACAG CTTACGGGCG AAGAGCTAAG TAGCGAGAAT
AATGCGCTGG TTCATGCTGC CGATTACCGC AATGCAATGG ATTACCCGCA ACACTCCAGC
ACAAGCGATG CCATTGCCGT TATTACCATG TCGGGTCCCA TTATGCGCTC GGTAGATAAC
CTTGATGACG GCATTGATGT CGCCACTATG CAACATTCGC TTGAAGCTGC CCTTGAAAAC
AAGAGCGTCA AAGCCATTGT GCTCCGCATT GATAGCCCGG GTGGCGAAGC TATTGCCTCA
GCCGATATTT TGCAAATGAT TAACGCTGCT GCTACCAAAA AAACGCTTGT CGTCTCAATG
TCGGGCGTTG CTGCATCAGG CGGTTACATG GTAGCGCTTG GCGGCAAAAC CATTGTAGCA
CATCCGCTCA CTATTACGGG TTCCATTGGC GTTTATGCGC TCAAACCAAC CATTCAAGGA
TTGGCTGAAA AGGTTGGCTT GCAACGCGAA GTTATTACAA GAGGACGTTT TGCTGATGCC
ACTTCACCCT TTACTCCGCT TGAAGGAGAA GCCTACAACA AATTTGTAGC CTCAGCAGGC
GACGTCTATA ACGACTTTAT CAGCAAAGTT GCAACATCAC GCCGCATGAA GGTAACAGCC
GTTGACTCTG TTGCAGGCGG ACGGGTATGG ACGGGCAGCC GTGCCAAGCA AGTTGGTTTG
GTTGACCGCA TGGGTGGGCT TTTTGATGCC CTTGCTTTAG CCAAAGAGCG TGCAGGCATT
AGCAAAGATA AAGAGCCAAC CATTCTCCTC TATCCCCTTC AGCAAGGATG GCTACAATCG
CTGCTGGGTG GCGCTACCCT CAATTCAGTA ACCAAAGCAA TTGCAACCGC GCTTCTCGGT
AACGTTTTAC CAATAAACGT GGAGCAACAG CCACTTTCCG CCATGCAACC ATTTTACGAT
ATGCTGATTC GTTCAGGCAA ACCGCACATG GTAGCACTTA TGCCCGCTGA AGTGGTGGTG
AAGTAA
 
Protein sequence
MNNSSIPQKR RGCFRPGCLW FLVVPLFIVV ALFWAFRSSH DMPDRFVLVV PLSGKLAEVN 
NERSSLPFMP SQGDLSLQEV LFVLHEAAKD EQVSEVLLQL DGVEAAPAKI AEVRAAVADV
RRKGKKVSAF LYRAEDSDYL LATAADTIIM QRGASLLLDG LKAESLFYTG TLNKLGITVQ
AAQWKEYKSG IEPFTRTSAS KEYREQINML LDDVYNNYLS AVSERRKISR SAFEAIINNE
ALLSAERAKA LGLVDRIATF WDVERSMTKQ LTGEELSSEN NALVHAADYR NAMDYPQHSS
TSDAIAVITM SGPIMRSVDN LDDGIDVATM QHSLEAALEN KSVKAIVLRI DSPGGEAIAS
ADILQMINAA ATKKTLVVSM SGVAASGGYM VALGGKTIVA HPLTITGSIG VYALKPTIQG
LAEKVGLQRE VITRGRFADA TSPFTPLEGE AYNKFVASAG DVYNDFISKV ATSRRMKVTA
VDSVAGGRVW TGSRAKQVGL VDRMGGLFDA LALAKERAGI SKDKEPTILL YPLQQGWLQS
LLGGATLNSV TKAIATALLG NVLPINVEQQ PLSAMQPFYD MLIRSGKPHM VALMPAEVVV
K