Gene Cagg_0023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0023 
Symbol 
ID7269020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp34843 
End bp37869 
Gene Length3027 bp 
Protein Length1008 aa 
Translation table11 
GC content56% 
IMG OID643564896 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_002461412 
Protein GI219846979 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.421595 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAGAC GATCTACCTC ACTTATCCTC TTCATCATTG CTTACTTGGT CAGCGTACTG 
GTATTCACGC CGAGTACTCC AGTGCAGGCC GATCCGGCTA CACTCTCCGC TACCCCAACC
AGCCTTGCAG CAACTGTCGA GCTTGGCGAT ACGGCCACCC TCTCCCTTAC CATCACCAAT
ACCAGCAGCA ATTCATTAAC TCTCCTCCTC TATGCTGGCT ATCCACCAAC CGCCAGCCCG
GCGCGTATGG CGCTACCATC ACTGCCGGTG CCACTACCAC AACAAGCCGA GCGGATCGAT
CCCGCTTTAC AAACCGAACT GGCGCATGGC CCAACTCGCT TCCTCGTCTT CTTCGCCGAC
CGACCCGACC TCGGCGCAGC GTTGTTGATT CGTGATTGGG CAGCACGCGG TGAGTACGTT
TACCACACTT TGACCGAACA TGCCGAACGC AGCCAACGTG CTGTGCGTGC GATGCTCGAT
GCCGCCGGTA TTCGCTATAC TCCGCTCTGG ATCGTCAACG CCTTGCTGGT TGAAGGAGAT
GCAACCCTGG CCCAAGCCCT CGCTGACCAC GCCGACGTTG CTATGCTAAG CGCCGACCAC
GAGCTGCAAG TAGCTCCGTC GGCATTGACA ACAGCTGTTA GTTGCAGTCC ATCTGCAAAT
AACGTTTGCT GGAATATTGA TCGGATCAGA GCCGACCGCG TCTGGCGCGA GTTCGGTGTC
ACCGGTGAGG GGATCACCGT CGCAAATATC GATAGCGGCG TTGCGTATAC CCATCCGGCG
CTTGTTGGTC AATACCGTGG CAACCTTGGC GGTGGTGTGT TTGACCACAA TTATAACTGG
TTCGACCCGG TCGGTAACAC AACCGCACCA ACCGCATCGG GTAGTCACGG TACCCACGTG
ATGGGTACCA TGGTGTCTAA CCCACCCGAC CAACCGGCCA TGGGTGTAGC TCCGGGTGCC
CGTTGGATCG CAGCCCGAGC CTGTGATACG CTCAACTGTA CCGATAGCAA CATCATCGCT
GCGGCACAAT GGGTGTTGGC GCCAACCGAT CTTAACGGTA ACAATCCGCA ACCGAGTCGT
CGTCCGCATA TCCTCAATAA TTCGTGGGCA TTTAGCAGTG GTGGTAACCC GATCTATACC
GGTTATACCG CAGCCTGGAA AGCTGCCGGC ATTTTTACAA CTTTTGCCGC CGGCAATACC
GGTAATACAA CGTGTAGTAC GATCGCCTCA CCCGGTGACT ACGCCGATGT CGTTGCAGTT
GGCGCCATCA AGCAAGATGA TCGACTGGCC CCTTTCAGTG CTATTGGCCC GACCGGTGAC
GGTCGCATCA AACCCGATCT GGTTGCGCCA GGAGTAGGCA TCTACTCAAC CGATGCTTCA
ACGGGCTACA TAGCGCTCAG CGGTACCTCA ATGGCTGCCC CGCACGTAGC CGGGACAGTT
GCCCTGCTTT GGTCGGCTAA TCCGCAATTG ATCGGCGATT ATGATGGGAC GTATGCCCTT
CTCACGAATA CAGCATTCCC AATTACCGGG GACACCACGT TTATGGGATC AACCCATAGC
GCCTGCCGTC CTATTGGTGT CCCGAACAAC ATTTACGGTT ATGGGCGGCT CGATGCCTTT
GCTGCGGTAG CGGCGGCTAA GGTTGATATT CCATGGCTGA CTCTTCCGCC AACACCGACG
GCAACCCTAA CAAGTAGCGG CAATACAACA CTGTCTATCA CACTCGATGC CCGCAAAGTT
CCCGGACCGG GTGTCTACTC GGCACGTCTG TTGATCTACG CCAACAATCT GACCGACCCA
CCGCTGGCCG TCCCGATAAC AATGACCGTG CCACCACGAC CTACACACGC GACGATTACC
GGTACGGTGA CCGATAGTGA GACCGGTCGG CCATTACCGG CGACGATTAC AACGACTGAC
GGTGTGCGAC TGATGACCAG CCCCACCGGA ACATATAGCT TGACGGTACC CGGCAGTTCT
ACCCAACACG TGACCGCTGC TGCCGTTGGC TTTGTTACGC AAACCCAAAC GATCACGCCA
AGCAATGGGA GTACGTCAAC CCTCAACTTT GCGCTCGATC CGATTCGCCC CCGCTTGACC
ACATTGCAAG ATGTGATACC GGCTACCGTT GATTTTCAGC AAACCGTCAC GCTCAATTTG
TCGTTGCGTA ACGATGGCAA TGCCCCCCTC GCCTACACCG TCCAGATTGA CAACGAGCCT
TATGGTGTCT GGCGAAGTGA CGAACCGGAC GGGCCGAGCG GTGGTTGGAT CGATCCACCG
ACCGGTAGAC AGGTGCTCAA CCTGGATGAT GATGGGAATA GTGATGCTCT CGATCTCGGC
TTTGATTTTC CGTTTGGCAG CACGTTTTAC CGCCAAGTCT ACATTGGAGC AAATGGGATT
ATTGCCTTCG CACCCTTCAC GACCAGCTAC TTCATTCCAT CATGCTTTCC ATTATCTGAA
ACTACCTCGG CGGCGATTAG CCCACTTCAC GTCGATTTTA ATAGTCTTGA TGGCGGTGAG
ATTAGCTTCG CTCAAGTGAG TAGTGGCGCA CTGATCACGT GGGATGATGT TCCGCTGTAC
GGTACGACCC GCCGGCTTAG TGTGCAGGCA CTCTTGCAAC CCAATGGTGT TATTCGCTTC
CATTACCGGA ATGTAGCCGA TTTGCAGCCC ACCGATCAGG CTACAATTGG CCTCCAGTTT
GACGATCAAA GCCAGCATGT AGCTTGTGAT GCTGGGGATG AACTGCCACT CGATCTGAGC
GATGGGTTGG TCATCGAGCT GCGACCTCAG ATCAATCCAC GGGCATGGCT CAATATAGTG
TCCGGTGACA GTGGCACACT AGCCGCTTCC AGTCAGACGG ATATCCCACT CACTGCGCGA
TGGGTCGGCC CTATGTATAC GACCTCACAG GCACGAGTGC AGATTCGCAG TAACGATCCG
CAGAAACCGG TTGCCACTGT ACGTGTCCAA CTGAACGAAG GTACCCCTGC GCCCTATCAA
GTGTTCATTC CGTTCGTATT CCGGTAA
 
Protein sequence
MMRRSTSLIL FIIAYLVSVL VFTPSTPVQA DPATLSATPT SLAATVELGD TATLSLTITN 
TSSNSLTLLL YAGYPPTASP ARMALPSLPV PLPQQAERID PALQTELAHG PTRFLVFFAD
RPDLGAALLI RDWAARGEYV YHTLTEHAER SQRAVRAMLD AAGIRYTPLW IVNALLVEGD
ATLAQALADH ADVAMLSADH ELQVAPSALT TAVSCSPSAN NVCWNIDRIR ADRVWREFGV
TGEGITVANI DSGVAYTHPA LVGQYRGNLG GGVFDHNYNW FDPVGNTTAP TASGSHGTHV
MGTMVSNPPD QPAMGVAPGA RWIAARACDT LNCTDSNIIA AAQWVLAPTD LNGNNPQPSR
RPHILNNSWA FSSGGNPIYT GYTAAWKAAG IFTTFAAGNT GNTTCSTIAS PGDYADVVAV
GAIKQDDRLA PFSAIGPTGD GRIKPDLVAP GVGIYSTDAS TGYIALSGTS MAAPHVAGTV
ALLWSANPQL IGDYDGTYAL LTNTAFPITG DTTFMGSTHS ACRPIGVPNN IYGYGRLDAF
AAVAAAKVDI PWLTLPPTPT ATLTSSGNTT LSITLDARKV PGPGVYSARL LIYANNLTDP
PLAVPITMTV PPRPTHATIT GTVTDSETGR PLPATITTTD GVRLMTSPTG TYSLTVPGSS
TQHVTAAAVG FVTQTQTITP SNGSTSTLNF ALDPIRPRLT TLQDVIPATV DFQQTVTLNL
SLRNDGNAPL AYTVQIDNEP YGVWRSDEPD GPSGGWIDPP TGRQVLNLDD DGNSDALDLG
FDFPFGSTFY RQVYIGANGI IAFAPFTTSY FIPSCFPLSE TTSAAISPLH VDFNSLDGGE
ISFAQVSSGA LITWDDVPLY GTTRRLSVQA LLQPNGVIRF HYRNVADLQP TDQATIGLQF
DDQSQHVACD AGDELPLDLS DGLVIELRPQ INPRAWLNIV SGDSGTLAAS SQTDIPLTAR
WVGPMYTTSQ ARVQIRSNDP QKPVATVRVQ LNEGTPAPYQ VFIPFVFR