Gene Cagg_2258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2258 
Symbol 
ID7266670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2756484 
End bp2757749 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content57% 
IMG OID643567088 
Productcarboxyl-terminal protease 
Protein accessionYP_002463574 
Protein GI219849141 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0647499 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000459659 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCCTGCCC TAATCCGCTT CTTACAATTT CGACTCCCGC TCTGGTTGAT TATGCCGGTG 
TTGCTGATCA CCCTCATCGG CGGCATGGGT AGTGGGGTCT GGTTGGGAAT CTGGCTCAAC
CGCCCTCAAA CCGTCAGCGC TTGCCCCGAA ACCACTGCGG TTTGCACCGA TTTTGCCGTC
TTTTGGGATG TGTGGCAATT GGCCCGCGAA CAGTATGTTG ATCCTGCGGC CGCCGAGCCT
AACCGCATGC TCGAAGGGGC CATTGATGGC ATGCTAGCGA CATTGGGTGA TGAAGGTCAT
ACCCGTTTCT TGACGGCTGC GGAAGCAGCG CAATGGCAAG AGTCACTTAC CGGCTCGTTT
GAGGGTATCG GTATTTACGT TGGGCAACGG AACGGCGCGT TACTTGTTTT AGACCTCATT
GAGGGATCGC CGGCGGCCAC TTCCGGTTTA CGTGCCGGTG ACCGGATCGT AGCAGTCGAT
GGCACCTCAG TTGAGGACTG GACAATTGAA CAGTTGGTCG CACGCATCCG TGGTCCAACG
GGAACATCGG TCACCCTTGA GGTCGTGCGG GAGAACGACG AAGTGTTACG CTTTACCATT
ACCCGCGCGA AGATCACCGC ACAGAGTGTA ACGTGGGCAA TGTTGCCCGA TCAGATCGCC
CTGATCCGCA TCACTTCGTT CGATGAGCAG GCGGCTAGTG GGTTGCGTAA GGCCTTAACT
GAAGCGCAGG CGGCGGGTAT TAGGGGGATT ATCCTCGATC TCCGCAATAA CCCCGGTGGA
TTGCTCAGCA CATTATTGAT GATTGCCGGT GAGTTCTTGC CCGCCGAGAC ACCGGTACTC
ATTGAGCGTA ACCGTGATGG CACACAGCAC GTCTCTAAGA CGCGCAAGGC AGGGATTGCC
CAAGATATAC CGCTCGTCGT CCTGATCAAT GGCGGGTCGG CGAGCGCTGC CGAGATTTTG
GCCGGGGCAT TACAAGATGC CGGACGGGCG GTGTTGGTTG GAGAAAAGAC GGTAGGTACC
GGTACGGTCT TGACACCGTT CCGTCTCCGT AACGGCGCCC AATTGCTCCT GGGAACACAA
GAGTGGCGCA CCCCATCGGG ACGCCAAATT CGTGGTAAGG GGATCGAACC GGATCGGGTC
GTGGCACAGC CGCTCGACGT ACCAATCCTT TGGCCATCGG AAGTGCGGAA CCTGAGTGCC
GAAGCGTTGG CTGCGAGCGG CGATGCGCAA TTGTTAGCAG CAATTGCCGC ACTCCAACGA
GAGTAA
 
Protein sequence
MPALIRFLQF RLPLWLIMPV LLITLIGGMG SGVWLGIWLN RPQTVSACPE TTAVCTDFAV 
FWDVWQLARE QYVDPAAAEP NRMLEGAIDG MLATLGDEGH TRFLTAAEAA QWQESLTGSF
EGIGIYVGQR NGALLVLDLI EGSPAATSGL RAGDRIVAVD GTSVEDWTIE QLVARIRGPT
GTSVTLEVVR ENDEVLRFTI TRAKITAQSV TWAMLPDQIA LIRITSFDEQ AASGLRKALT
EAQAAGIRGI ILDLRNNPGG LLSTLLMIAG EFLPAETPVL IERNRDGTQH VSKTRKAGIA
QDIPLVVLIN GGSASAAEIL AGALQDAGRA VLVGEKTVGT GTVLTPFRLR NGAQLLLGTQ
EWRTPSGRQI RGKGIEPDRV VAQPLDVPIL WPSEVRNLSA EALAASGDAQ LLAAIAALQR
E