Gene Cagg_1767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1767 
Symbol 
ID7267679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2158868 
End bp2161189 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content59% 
IMG OID643566608 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_002463103 
Protein GI219848670 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.126212 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATAC CATTCGACAG TACGCCTCCC CGCAATAGCA TCTCATGGGC CAAAGGCTGC 
GCACTGACGT CGATTCTCGG CATTGGATTA GGCATCAATG CCATCTTGCG ATTGGTGGAA
ATCGGCGTAA CCCTCTTCGC CAGCGGCGAT ACAGGGCGCA TGATCGTCAT CTGGATTAGC
TTGGTTATCG GTACCGCGAT ATTTGGGCTG TGGGTCTGGA TCGGCAGTGG AGCCGTGCGG
ACAGCAGCAC TTCGCTGGTT GGCTGCCTTA CCATTACCGG CAATGCTGGC CCTGACCATT
ATGATTCCCT CTGCCGAGAG CCAATTGCTG ATGGTTACTC AACTCGGACT AAGCCTGATC
TACACACTGG TTTTGGCATG GCTGATACGT CGTCCGCTAC AATGGCAGAC AACTCCAATC
CTCTTCAGTG CAGGGGCGTT CCTCGCGTTG CCGTGGCTAG CTATCGGCAC GCCCGGTTCG
TGGGTCGACG TAGGCCTCGC CTTAGCACTC GGTGCGACAA TAGGCTGGGC TGGCGGTCAA
TTACTCGCAT GGCGACCACC GCTCAGTTGG GTACACGGTG AGGTACCGAT GAGGGTCTGG
CTCGGCGAAG GGGCTGCGCT CAGTCTCCTT CTGCTTGTGC TGAGTCGTGC GCTCGGCCCG
AACGGTGCAG CCCTGTTATT CTTCTGTTGC GCGCCGGTCG GGGGCTGGTT ATGGTTGGCA
ATTGGGCGGC ACGTCGATGT GAAGGCCATC GTTCCCCTCT CGGCCCTATT TGGTCTCTTC
TTTTTTGGGG TGCCACTCGT CCTGACGGAT TCCGACGCAT TAGTGCCATT GTTATTATTT
GGCAGCTTCC CAGAGGGTTT TCATCTAGCC TTGCTGGCCG CTCTTGGGCA GGCACTGCTA
GCCTTACTGG TCATACCATT AGTCCTTTTG CTCCCCAATC GTCTGCTCCA TACTGCCGGT
GCAGCACTTG TGGCAGTGGC CGGGCTGACG CTGCTCATCG GAGGTGGACG AGCAACACCG
GCGGGGGATC GCTGGTTCGT GGTGTTGCGC GATCAAGCAG ATGTGAGCGA CCTCACGACC
ATCACCGATT ACCAAACCCG TCGCAGCGCC GTGTATCGCC GTCTAACCGA TCATGCCATC
GCAACGCAAG CCGATCTTCG AACAGCGCTT GACGGCTTAG GTGTGCGCTA CACACCCTAC
TATCTCGTCA ACGCCTTAGA AGTTGAAGGT GATCTCCCAC TGCGTGTTTG GTTGGCAAAC
CGGCCCGAAG TGGCCGAGGT GATGCCATCA CCATTTTTAC GCCCGTTACC GCTACCAATC
CAAACAGCGC GTGGCGACGA ACCACCACCT GATGAAACAC CCACCAATCT CACCGTTATC
GGTGTACCGG AGGTCTGGGC ATTAGGAGTG CGTGGCGCCG GCATTCTCGT CGGCCAAGCC
GATTCGGGCG TCGATGCCGA ACATCCTGAA TTGAGCGATG CGTATGCCGG TCAGACGGAA
AACGGGGTCG TCCACGCTTA TCACTGGCTC GATCCGTGGA CGGGTGCAGC AGCACCGTAT
GATCACAGTG GGCACGGCAC CCACACCCTC GCGACAATCT TGGGAAATCG GGTCGGAGTC
GCACCGGATG CGCAATGGAT CGGGTGCGTC AATCTGGCCC GTAACCTCGG CAACGCACCG
CGCTATCTCG ACTGTATGCA ATTCCTGTTT GCGCCGTACC CACCGGGTGG CGACCCTTTA
CGCGACGGCG ACCCAACACG AGGTGCGCAC ATCCTCAACA ACTCATGGGG CTGCCCGCAA
GACCTCGAAG GCTGCACGCC CACATCACTC CAACCGGTGG TCCGTGCGCT CCGTGCCGCC
AGCGTCTTTG TGGTTGTGAG TGCCGGCAAT GACGGACCGA CATGCAGTTC ACTCAACACT
CCGCTCGCCA TCTATGACGA AGTGATGACG GTCGGCGCCG TGAATAACGA GGGTCGCCTG
GCACCTTTCA GCAGTGTCGG CCCGGTTGTC AGTGATGGTA GCCTCCGCCC TAAACCCGAC
CTGCTCGCAC CGGGGGTAGA GGTGTTATCG GCGTTTCCCA ATACTACCTA TTATCGAGCG
AGCGGCACGT CGATGGCCGG CCCACACGTG GCCGGAGCAG TAGCCCTCCT CTGGTCAGCA
AACCCCGCTC TGATCGGAAA CATTGACGCG ACCGAGCAGA TACTCCGCGA TACTGCCCGT
CCCTATCCCT TCGACGATGG CGACCGGTGT GGTGCAGGGA ATGGCACCGG AGCCGGTATT
CTCGATGTTG CATCCGCAGT ACGGCGAGCA CTGACACGAT AA
 
Protein sequence
MTIPFDSTPP RNSISWAKGC ALTSILGIGL GINAILRLVE IGVTLFASGD TGRMIVIWIS 
LVIGTAIFGL WVWIGSGAVR TAALRWLAAL PLPAMLALTI MIPSAESQLL MVTQLGLSLI
YTLVLAWLIR RPLQWQTTPI LFSAGAFLAL PWLAIGTPGS WVDVGLALAL GATIGWAGGQ
LLAWRPPLSW VHGEVPMRVW LGEGAALSLL LLVLSRALGP NGAALLFFCC APVGGWLWLA
IGRHVDVKAI VPLSALFGLF FFGVPLVLTD SDALVPLLLF GSFPEGFHLA LLAALGQALL
ALLVIPLVLL LPNRLLHTAG AALVAVAGLT LLIGGGRATP AGDRWFVVLR DQADVSDLTT
ITDYQTRRSA VYRRLTDHAI ATQADLRTAL DGLGVRYTPY YLVNALEVEG DLPLRVWLAN
RPEVAEVMPS PFLRPLPLPI QTARGDEPPP DETPTNLTVI GVPEVWALGV RGAGILVGQA
DSGVDAEHPE LSDAYAGQTE NGVVHAYHWL DPWTGAAAPY DHSGHGTHTL ATILGNRVGV
APDAQWIGCV NLARNLGNAP RYLDCMQFLF APYPPGGDPL RDGDPTRGAH ILNNSWGCPQ
DLEGCTPTSL QPVVRALRAA SVFVVVSAGN DGPTCSSLNT PLAIYDEVMT VGAVNNEGRL
APFSSVGPVV SDGSLRPKPD LLAPGVEVLS AFPNTTYYRA SGTSMAGPHV AGAVALLWSA
NPALIGNIDA TEQILRDTAR PYPFDDGDRC GAGNGTGAGI LDVASAVRRA LTR