Gene Cagg_0783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0783 
Symbol 
ID7268102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp969256 
End bp971049 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content56% 
IMG OID643565634 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_002462143 
Protein GI219847710 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATCC ACCTTATCTG CTTGTGCCTG ATTGCCTTAC TCACCGGTGT GATCCCGCCC 
ACCCCTGTAC AAGCTGCTGA TACAACCGAT CTGATCGTCA AAGTTACCGC TGCACGCTAC
GCCTCGACGG TCGGGCGTAT GCTTGGTGCC TACCAGATCA GGAGGATCGG CGAACAGACC
TATCTGATGT ACTTGCCGGA CGGCATGCCG GCAACGTTCG CCGAGAGACG GATACGGAGC
GCGGAGGGCG TTATCTACGC CGTCCCTGAC TATGAGCGCA CAGTCAGCCG GACACCGAAC
GACGTGTTGC TGCCCAAACA ATGGTCACTC GGTACGATGG GTGTATACGA TGCGTGGGAG
ATCACCACCG GTAGTCCGTT GCCCATCGCA GTGATCGATA CCGGGGTTGA TGCCGGTCAC
CCTGAGCTAT CCGGGCGGGT ACTCGGTGGT TTCAATGCGA TCACCGGGAG CACCGATGCC
AGCGATGACA ATGGGCATGG CACTGCTGTT GCAGGGCTAA TCGCTGCCGC CGGTGACAAC
GGTGAAGGCA TTGCCGGGCT GTGTTGGAGC TGCCTGATTA TTCCGGTCAA GGCTTGTTTG
AGCAACGGAC GTTGCCGTGA TTCGAGTGTC ATTAGTGCCA TCCGTTGGGC TACCGACAAC
GGCGCACGCA TCATTAATCT GAGTCTGGGA GGAAGTTTTG ACTCGCCTGC ACTACGCGAT
GCCGTGCGTT ATGCGAGTGC GCGAGGTGTC TTGGTGGTAG CAGCGAGTGG TAACGAGCGT
GCGGATGGCA ATGCACCCAA TTATCCGGCT GCCTACCCTG AAACGATTGC CGTTGGGGCG
ACCGGCTATA ACGATGAAGT TACCAGTTTC TCGAACACCG GCAATTTCAT CGATCTCGTT
GCACCGGGGA TCGATATTGC GACGACCTTA CCCAACAACA GGTATGCCAT CGTAACCGGA
ACTTCCTTTG CCGGGCCATT TGTTGTGGGA GCGGCAGCAC TGGTGATGGC GATTCGTCCC
GACCTATCGC ACAACGATGT CCGTTGTATT CTCGCTATTT CTACCGATGA CCGTGGGACA
CCGGGCCGTG ATGACGAGTA TGGCTACGGG CGTTTGAACG TTTGGCATGC CGTTATGACG
GCAAGTACCT ACGGCGGCTG CCCGCTCGGC GCACCAACGA CAACCGGTAA CCGCGATTTC
GATCCCTTTG CGCGGGTAGA CCCACCGAGT GATGGTCGTT ACTACTTCCC GGAAACCGGT
CATACACTCG GCGGTGGCTT CCGCACGTAT TGGGAGCAGA ATGGTGGGCT ACCGATTTTC
GGTTTTCCAA TCAGTGAAGA GTTTACCGAA ATCGGCAGCG ATGGAAAACC GGTTACAGTT
CAATATTTCG AGCGTCATCG TTTTGAGTGG CGGCCAGAGA ACACGCCACC GTACCACGTC
CTGCTGTCAC GGATGGGTGA TGATCTCTTG CGTCGGCAAG GTCGGGATTG GTATACCTTT
GAACGGAGCG GTCCGATTCA GGGTTGCCTC TACTTTGCCG AAACAAATCA GGCACTATGT
GAGCCGTTCT TGAGCTATTG GCGGAACCAT GGGCTTGAAT TCGATGGCAA ACCCGGCAAG
AGCTATGCCG AGAGCCTCGC ATTGTTCGGT TTACCACTCT CTATGCCGCG CATTGAAGAG
ACGCAACCCG GCAAGGTTTT GATCGTGCAG TGGTTTGAGC GCGCCCGCTT CGAGTTACAT
CCCGATGGAA GTGTGTTGCT GGGATTGTTG GGGAATGAGC TGGTCGGACG ATAA
 
Protein sequence
MRIHLICLCL IALLTGVIPP TPVQAADTTD LIVKVTAARY ASTVGRMLGA YQIRRIGEQT 
YLMYLPDGMP ATFAERRIRS AEGVIYAVPD YERTVSRTPN DVLLPKQWSL GTMGVYDAWE
ITTGSPLPIA VIDTGVDAGH PELSGRVLGG FNAITGSTDA SDDNGHGTAV AGLIAAAGDN
GEGIAGLCWS CLIIPVKACL SNGRCRDSSV ISAIRWATDN GARIINLSLG GSFDSPALRD
AVRYASARGV LVVAASGNER ADGNAPNYPA AYPETIAVGA TGYNDEVTSF SNTGNFIDLV
APGIDIATTL PNNRYAIVTG TSFAGPFVVG AAALVMAIRP DLSHNDVRCI LAISTDDRGT
PGRDDEYGYG RLNVWHAVMT ASTYGGCPLG APTTTGNRDF DPFARVDPPS DGRYYFPETG
HTLGGGFRTY WEQNGGLPIF GFPISEEFTE IGSDGKPVTV QYFERHRFEW RPENTPPYHV
LLSRMGDDLL RRQGRDWYTF ERSGPIQGCL YFAETNQALC EPFLSYWRNH GLEFDGKPGK
SYAESLALFG LPLSMPRIEE TQPGKVLIVQ WFERARFELH PDGSVLLGLL GNELVGR