Gene Cagg_2562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2562 
Symbol 
ID7267151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3118089 
End bp3121238 
Gene Length3150 bp 
Protein Length1049 aa 
Translation table11 
GC content59% 
IMG OID643567386 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_002463867 
Protein GI219849434 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.236427 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGCGA CGCCAAAACG ACAATTGATT GCACTGGCGG TAAGTCTGTT GTTAACCATG 
GCATTGGTCA CACCAAGTAC TGCTCAACCG CCCCGACCGA TTCCTAATCT TGATGCTTTA
CGTCTAGATG AACCGGTAAC CATCGATCAA GCACCGATCA AGCTATCACC GAGTCTTGTC
AATGTGACCG GACGGCACGC AGTTGTCATT CGTCTCAGTG AGAATCCGAC TGCGCGTATC
CCACAGGGTG CCGCACAAGT GGCCCAACTC AACCGCATCA GCCAGCAGCA GGAGCGAGTT
TTGGCGCGAT TGCGCGCGAT TGATCCGACA CTGACCGAGC TGGCACGGCT GCGTGTTGCG
CTGAACGCTG TCATCGTCGA AGTAGACGGT GCAGCACTGC CGGCCTTGGC CCGTGATATT
GAGGTTGTGC GGATCAATCC GGTGGTCGAT TATGAGCGTG CCGACCAACC TCCGATGGAG
ACGGTGCCGT ACATCGGAGC CACACCCGAA GTACAAGCCG CCGGCTATCG TGGTAAAGAT
GTACGGGTAG CCGTGCTCGA TAGTGGTATC GATTATACCC ATGCCGCATT TGGCGGCCCC
GGTACGCTCG AAGCGTATCA AGCGGCGTAT GGCACCAACC CTTCCGATTC GCGCAATAAG
ACGCTGGATG GTCTCTTCCC AACCGATCGG GTCAAAGGGG GTTATGACTT TGTCGGCGAA
GTGTGGCCAA ATGGCCCGCT GATGCCCGAC CCCGATCCGA TCGACTGTGG TGTATCTGGT
CTGAGCAGCG GCACCTGCGC TGGCGGGCAT GGCACTCATG TCGCCGACAT TATCGGCGGC
CAACAGGGTG TGGCGCCGGA AGTCGATCTG TTTGCGGTGA AAGTCTGTTC GGCGGTGTCG
TCGTCGTGCA GCGGCGTCGC TATTCTCCAA GGTCTCGACT GGGTAGCCGA CCCCAACGGC
GACGGTATCA CCGACGATCA CATGCACATT GTGAATATGT CGCTGGGTGC GTCGTATGGG
CAGAACTACG ACGACGACTC GGCGATTGCG GTTGACAATT TGCAGCCGCT GGGCATTTTG
GTGGTGGCTT CGGCTGGCAA TAGCTCCGAC CGCCCCTACA TCACCGGCAC CCCTGCCGGC
GCGCGCACAG CCCTCTCGGT AGCACAGACG GCGGTACCAA GTGATTCATC GTATCCGATT
ACCGTACTCA GCACAACGGT GTACGGTGTT GCCCAGCCGT GGGCGCCGAT CCCGAGCACG
GCAGTCAGTG GCATCTTGGT CTATGGTGCA TCGTTGGGCA ACGCACTCGG TTGTACCGCA
TACCCGCCCG GTTCACTGAC CGGCCGAATC CTACTGGTTG ACCGCGGTAC CTGCGCTATC
AGCATCAAGG GTTCCAATGG CGCAGCCGCC GGTGCTGTAG CCGTGATCGT GGCGAACAAC
GTGGCCGGTG CAGTCCCGCC GACGTTCAGC TTCGGTGGTG GCTCACCGAC CGTACCGGTA
CTGTCGATCA CACAGACCGC CGGCAATGCT TTGAAAGCGC GGGTAAATAA CTCGGCGACC
GTAGACTTTG CGAATCCGGT CAGTAACGCC GGTAGTGTGG TTGGCACCTC GTCGCGCGGG
CCGACCATGG GTCAGATGAC CTATGGCAAT CAGATAATGT ATGGTCAGAT CATCAAGCCG
GAAATCGGTG CACCGGGTGC CTCGATCTCG GCGGTAGCCG GTAGTGGTAC CGGTGTTGAA
CCATTCGGTG GGACCTCAGG GGCTGCTCCG ATGGTGGCCG GTGCAGCGGC ACTGCTCTAC
AACGCCTCAA ACTGGAGCCT CTCGCCGTGG GAGTTGAAGG CGCGTCTGAT CAACACGGCG
GAAACGAATA TCTACAACGG TCCGCCGGTC TTCGTTGGTC CGACCTTGGC CCCAATTACC
CGTATCGGTG GTGGTGAGGT GCGCATCAAC CGCGCAATCG CGGCTCAGGC CGGTGCGTGG
GAATTGAGCA ACGGAGCGGC AACCATCTCA TTCGGTTTGG TTGAGGTAAC GCGCAACCCG
ACCACGTTAC GTCGCACGAT TGTGGTGCGC AACTACGGCG ATACGTCGCT CACCTACACG
ATCACGCCGA CCTTCCGCTT TGCTAACGAT GCCGCGAGTG GGGCGATTAC CCCAGGCACA
TCGGTGACCA CGATCACAGT GCCACCGCGT AGCCGGCGCA CGTTCACGCT GACGTTGCGT
ATTGATCCGA CGAAACTGCC GGCGTGGGTG CTCAATTCAG GGCTAAATGG TGGTAACGGT
GCAGCACTGA CCGCCGTCGA GTACGATGGC TATCTGGTGT TGGATGCGCC TGGCACCACC
AACGACTTGA CGATGCCGTG GCATGTGTTG CCGCGAGCTG CGGGTAACGT GAGCGCACCT
GCTTCAGTAC GGGCGACGAC CAGCAGCCCG GCGATGGCGC GTCTGACTAA CACCGGTGCC
AATCCGGTGT CGGTCGATCC GTTTACGCTG ATCGGCAGCA ATAACACGGT GAACGCTTCG
CTGGGGGCCG GGATGCAGAT GCCGCCGATG GATCTGCGCT ATGTCGGTGT CCGTGCTTTC
GATGGGACCG GTGTTTGTCC GGCTAATAGC CCAATAATCC AGTTTGCAGT TACGACGCAC
CAGCGGATTA CTCATTCCAA CTATCCGGTG GAGATCGATC TGCTGTTCGA TACCAACCGT
GATGGTACTC CCGATTATGT TGGCTACACG GCTGAGGTGG GCAGCTTTGC CAGCGATGGA
CGCAATGCCT TCTTTGTCGG TCCGGTGGGT GGCCCGTACA GTGCCTTCTT CTTCACCAGC
CATGCTACCA ACAGCACCAA CACCGTGGTC ACGTTGTGCG GCAGCCAGAT CGGGGTCACT
GCACTGGGCC AGCAGATCAA CGTCGATGCC TACACCTATG ACAACTACTT CACCGGCTTC
GAGTTGAGCA AGATCGAGGG GATGAGCACG GTGCTCGGCG CCCCACGCTT CGATATTGAC
ATCGGTAGTG GAGTGGTGCC GGCAAACGGT TCGCTCACGA CGACGGTGTA CCGCTTCAAC
GCCCCAAGTG ATGTGACCGA CTCGGGCATT CTGTTGCAGT ACTCCTTCGC GCCGGCAGGT
CGTGAGGCGA GTGCGATCAT TGTTCGTTAG
 
Protein sequence
MIATPKRQLI ALAVSLLLTM ALVTPSTAQP PRPIPNLDAL RLDEPVTIDQ APIKLSPSLV 
NVTGRHAVVI RLSENPTARI PQGAAQVAQL NRISQQQERV LARLRAIDPT LTELARLRVA
LNAVIVEVDG AALPALARDI EVVRINPVVD YERADQPPME TVPYIGATPE VQAAGYRGKD
VRVAVLDSGI DYTHAAFGGP GTLEAYQAAY GTNPSDSRNK TLDGLFPTDR VKGGYDFVGE
VWPNGPLMPD PDPIDCGVSG LSSGTCAGGH GTHVADIIGG QQGVAPEVDL FAVKVCSAVS
SSCSGVAILQ GLDWVADPNG DGITDDHMHI VNMSLGASYG QNYDDDSAIA VDNLQPLGIL
VVASAGNSSD RPYITGTPAG ARTALSVAQT AVPSDSSYPI TVLSTTVYGV AQPWAPIPST
AVSGILVYGA SLGNALGCTA YPPGSLTGRI LLVDRGTCAI SIKGSNGAAA GAVAVIVANN
VAGAVPPTFS FGGGSPTVPV LSITQTAGNA LKARVNNSAT VDFANPVSNA GSVVGTSSRG
PTMGQMTYGN QIMYGQIIKP EIGAPGASIS AVAGSGTGVE PFGGTSGAAP MVAGAAALLY
NASNWSLSPW ELKARLINTA ETNIYNGPPV FVGPTLAPIT RIGGGEVRIN RAIAAQAGAW
ELSNGAATIS FGLVEVTRNP TTLRRTIVVR NYGDTSLTYT ITPTFRFAND AASGAITPGT
SVTTITVPPR SRRTFTLTLR IDPTKLPAWV LNSGLNGGNG AALTAVEYDG YLVLDAPGTT
NDLTMPWHVL PRAAGNVSAP ASVRATTSSP AMARLTNTGA NPVSVDPFTL IGSNNTVNAS
LGAGMQMPPM DLRYVGVRAF DGTGVCPANS PIIQFAVTTH QRITHSNYPV EIDLLFDTNR
DGTPDYVGYT AEVGSFASDG RNAFFVGPVG GPYSAFFFTS HATNSTNTVV TLCGSQIGVT
ALGQQINVDA YTYDNYFTGF ELSKIEGMST VLGAPRFDID IGSGVVPANG SLTTTVYRFN
APSDVTDSGI LLQYSFAPAG REASAIIVR