Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_2562 |
Symbol | |
ID | 7267151 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 3118089 |
End bp | 3121238 |
Gene Length | 3150 bp |
Protein Length | 1049 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643567386 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_002463867 |
Protein GI | 219849434 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.236427 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGCGA CGCCAAAACG ACAATTGATT GCACTGGCGG TAAGTCTGTT GTTAACCATG GCATTGGTCA CACCAAGTAC TGCTCAACCG CCCCGACCGA TTCCTAATCT TGATGCTTTA CGTCTAGATG AACCGGTAAC CATCGATCAA GCACCGATCA AGCTATCACC GAGTCTTGTC AATGTGACCG GACGGCACGC AGTTGTCATT CGTCTCAGTG AGAATCCGAC TGCGCGTATC CCACAGGGTG CCGCACAAGT GGCCCAACTC AACCGCATCA GCCAGCAGCA GGAGCGAGTT TTGGCGCGAT TGCGCGCGAT TGATCCGACA CTGACCGAGC TGGCACGGCT GCGTGTTGCG CTGAACGCTG TCATCGTCGA AGTAGACGGT GCAGCACTGC CGGCCTTGGC CCGTGATATT GAGGTTGTGC GGATCAATCC GGTGGTCGAT TATGAGCGTG CCGACCAACC TCCGATGGAG ACGGTGCCGT ACATCGGAGC CACACCCGAA GTACAAGCCG CCGGCTATCG TGGTAAAGAT GTACGGGTAG CCGTGCTCGA TAGTGGTATC GATTATACCC ATGCCGCATT TGGCGGCCCC GGTACGCTCG AAGCGTATCA AGCGGCGTAT GGCACCAACC CTTCCGATTC GCGCAATAAG ACGCTGGATG GTCTCTTCCC AACCGATCGG GTCAAAGGGG GTTATGACTT TGTCGGCGAA GTGTGGCCAA ATGGCCCGCT GATGCCCGAC CCCGATCCGA TCGACTGTGG TGTATCTGGT CTGAGCAGCG GCACCTGCGC TGGCGGGCAT GGCACTCATG TCGCCGACAT TATCGGCGGC CAACAGGGTG TGGCGCCGGA AGTCGATCTG TTTGCGGTGA AAGTCTGTTC GGCGGTGTCG TCGTCGTGCA GCGGCGTCGC TATTCTCCAA GGTCTCGACT GGGTAGCCGA CCCCAACGGC GACGGTATCA CCGACGATCA CATGCACATT GTGAATATGT CGCTGGGTGC GTCGTATGGG CAGAACTACG ACGACGACTC GGCGATTGCG GTTGACAATT TGCAGCCGCT GGGCATTTTG GTGGTGGCTT CGGCTGGCAA TAGCTCCGAC CGCCCCTACA TCACCGGCAC CCCTGCCGGC GCGCGCACAG CCCTCTCGGT AGCACAGACG GCGGTACCAA GTGATTCATC GTATCCGATT ACCGTACTCA GCACAACGGT GTACGGTGTT GCCCAGCCGT GGGCGCCGAT CCCGAGCACG GCAGTCAGTG GCATCTTGGT CTATGGTGCA TCGTTGGGCA ACGCACTCGG TTGTACCGCA TACCCGCCCG GTTCACTGAC CGGCCGAATC CTACTGGTTG ACCGCGGTAC CTGCGCTATC AGCATCAAGG GTTCCAATGG CGCAGCCGCC GGTGCTGTAG CCGTGATCGT GGCGAACAAC GTGGCCGGTG CAGTCCCGCC GACGTTCAGC TTCGGTGGTG GCTCACCGAC CGTACCGGTA CTGTCGATCA CACAGACCGC CGGCAATGCT TTGAAAGCGC GGGTAAATAA CTCGGCGACC GTAGACTTTG CGAATCCGGT CAGTAACGCC GGTAGTGTGG TTGGCACCTC GTCGCGCGGG CCGACCATGG GTCAGATGAC CTATGGCAAT CAGATAATGT ATGGTCAGAT CATCAAGCCG GAAATCGGTG CACCGGGTGC CTCGATCTCG GCGGTAGCCG GTAGTGGTAC CGGTGTTGAA CCATTCGGTG GGACCTCAGG GGCTGCTCCG ATGGTGGCCG GTGCAGCGGC ACTGCTCTAC AACGCCTCAA ACTGGAGCCT CTCGCCGTGG GAGTTGAAGG CGCGTCTGAT CAACACGGCG GAAACGAATA TCTACAACGG TCCGCCGGTC TTCGTTGGTC CGACCTTGGC CCCAATTACC CGTATCGGTG GTGGTGAGGT GCGCATCAAC CGCGCAATCG CGGCTCAGGC CGGTGCGTGG GAATTGAGCA ACGGAGCGGC AACCATCTCA TTCGGTTTGG TTGAGGTAAC GCGCAACCCG ACCACGTTAC GTCGCACGAT TGTGGTGCGC AACTACGGCG ATACGTCGCT CACCTACACG ATCACGCCGA CCTTCCGCTT TGCTAACGAT GCCGCGAGTG GGGCGATTAC CCCAGGCACA TCGGTGACCA CGATCACAGT GCCACCGCGT AGCCGGCGCA CGTTCACGCT GACGTTGCGT ATTGATCCGA CGAAACTGCC GGCGTGGGTG CTCAATTCAG GGCTAAATGG TGGTAACGGT GCAGCACTGA CCGCCGTCGA GTACGATGGC TATCTGGTGT TGGATGCGCC TGGCACCACC AACGACTTGA CGATGCCGTG GCATGTGTTG CCGCGAGCTG CGGGTAACGT GAGCGCACCT GCTTCAGTAC GGGCGACGAC CAGCAGCCCG GCGATGGCGC GTCTGACTAA CACCGGTGCC AATCCGGTGT CGGTCGATCC GTTTACGCTG ATCGGCAGCA ATAACACGGT GAACGCTTCG CTGGGGGCCG GGATGCAGAT GCCGCCGATG GATCTGCGCT ATGTCGGTGT CCGTGCTTTC GATGGGACCG GTGTTTGTCC GGCTAATAGC CCAATAATCC AGTTTGCAGT TACGACGCAC CAGCGGATTA CTCATTCCAA CTATCCGGTG GAGATCGATC TGCTGTTCGA TACCAACCGT GATGGTACTC CCGATTATGT TGGCTACACG GCTGAGGTGG GCAGCTTTGC CAGCGATGGA CGCAATGCCT TCTTTGTCGG TCCGGTGGGT GGCCCGTACA GTGCCTTCTT CTTCACCAGC CATGCTACCA ACAGCACCAA CACCGTGGTC ACGTTGTGCG GCAGCCAGAT CGGGGTCACT GCACTGGGCC AGCAGATCAA CGTCGATGCC TACACCTATG ACAACTACTT CACCGGCTTC GAGTTGAGCA AGATCGAGGG GATGAGCACG GTGCTCGGCG CCCCACGCTT CGATATTGAC ATCGGTAGTG GAGTGGTGCC GGCAAACGGT TCGCTCACGA CGACGGTGTA CCGCTTCAAC GCCCCAAGTG ATGTGACCGA CTCGGGCATT CTGTTGCAGT ACTCCTTCGC GCCGGCAGGT CGTGAGGCGA GTGCGATCAT TGTTCGTTAG
|
Protein sequence | MIATPKRQLI ALAVSLLLTM ALVTPSTAQP PRPIPNLDAL RLDEPVTIDQ APIKLSPSLV NVTGRHAVVI RLSENPTARI PQGAAQVAQL NRISQQQERV LARLRAIDPT LTELARLRVA LNAVIVEVDG AALPALARDI EVVRINPVVD YERADQPPME TVPYIGATPE VQAAGYRGKD VRVAVLDSGI DYTHAAFGGP GTLEAYQAAY GTNPSDSRNK TLDGLFPTDR VKGGYDFVGE VWPNGPLMPD PDPIDCGVSG LSSGTCAGGH GTHVADIIGG QQGVAPEVDL FAVKVCSAVS SSCSGVAILQ GLDWVADPNG DGITDDHMHI VNMSLGASYG QNYDDDSAIA VDNLQPLGIL VVASAGNSSD RPYITGTPAG ARTALSVAQT AVPSDSSYPI TVLSTTVYGV AQPWAPIPST AVSGILVYGA SLGNALGCTA YPPGSLTGRI LLVDRGTCAI SIKGSNGAAA GAVAVIVANN VAGAVPPTFS FGGGSPTVPV LSITQTAGNA LKARVNNSAT VDFANPVSNA GSVVGTSSRG PTMGQMTYGN QIMYGQIIKP EIGAPGASIS AVAGSGTGVE PFGGTSGAAP MVAGAAALLY NASNWSLSPW ELKARLINTA ETNIYNGPPV FVGPTLAPIT RIGGGEVRIN RAIAAQAGAW ELSNGAATIS FGLVEVTRNP TTLRRTIVVR NYGDTSLTYT ITPTFRFAND AASGAITPGT SVTTITVPPR SRRTFTLTLR IDPTKLPAWV LNSGLNGGNG AALTAVEYDG YLVLDAPGTT NDLTMPWHVL PRAAGNVSAP ASVRATTSSP AMARLTNTGA NPVSVDPFTL IGSNNTVNAS LGAGMQMPPM DLRYVGVRAF DGTGVCPANS PIIQFAVTTH QRITHSNYPV EIDLLFDTNR DGTPDYVGYT AEVGSFASDG RNAFFVGPVG GPYSAFFFTS HATNSTNTVV TLCGSQIGVT ALGQQINVDA YTYDNYFTGF ELSKIEGMST VLGAPRFDID IGSGVVPANG SLTTTVYRFN APSDVTDSGI LLQYSFAPAG REASAIIVR
|
| |