Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_0023 |
Symbol | |
ID | 7269020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 34843 |
End bp | 37869 |
Gene Length | 3027 bp |
Protein Length | 1008 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643564896 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_002461412 |
Protein GI | 219846979 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.421595 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAGAC GATCTACCTC ACTTATCCTC TTCATCATTG CTTACTTGGT CAGCGTACTG GTATTCACGC CGAGTACTCC AGTGCAGGCC GATCCGGCTA CACTCTCCGC TACCCCAACC AGCCTTGCAG CAACTGTCGA GCTTGGCGAT ACGGCCACCC TCTCCCTTAC CATCACCAAT ACCAGCAGCA ATTCATTAAC TCTCCTCCTC TATGCTGGCT ATCCACCAAC CGCCAGCCCG GCGCGTATGG CGCTACCATC ACTGCCGGTG CCACTACCAC AACAAGCCGA GCGGATCGAT CCCGCTTTAC AAACCGAACT GGCGCATGGC CCAACTCGCT TCCTCGTCTT CTTCGCCGAC CGACCCGACC TCGGCGCAGC GTTGTTGATT CGTGATTGGG CAGCACGCGG TGAGTACGTT TACCACACTT TGACCGAACA TGCCGAACGC AGCCAACGTG CTGTGCGTGC GATGCTCGAT GCCGCCGGTA TTCGCTATAC TCCGCTCTGG ATCGTCAACG CCTTGCTGGT TGAAGGAGAT GCAACCCTGG CCCAAGCCCT CGCTGACCAC GCCGACGTTG CTATGCTAAG CGCCGACCAC GAGCTGCAAG TAGCTCCGTC GGCATTGACA ACAGCTGTTA GTTGCAGTCC ATCTGCAAAT AACGTTTGCT GGAATATTGA TCGGATCAGA GCCGACCGCG TCTGGCGCGA GTTCGGTGTC ACCGGTGAGG GGATCACCGT CGCAAATATC GATAGCGGCG TTGCGTATAC CCATCCGGCG CTTGTTGGTC AATACCGTGG CAACCTTGGC GGTGGTGTGT TTGACCACAA TTATAACTGG TTCGACCCGG TCGGTAACAC AACCGCACCA ACCGCATCGG GTAGTCACGG TACCCACGTG ATGGGTACCA TGGTGTCTAA CCCACCCGAC CAACCGGCCA TGGGTGTAGC TCCGGGTGCC CGTTGGATCG CAGCCCGAGC CTGTGATACG CTCAACTGTA CCGATAGCAA CATCATCGCT GCGGCACAAT GGGTGTTGGC GCCAACCGAT CTTAACGGTA ACAATCCGCA ACCGAGTCGT CGTCCGCATA TCCTCAATAA TTCGTGGGCA TTTAGCAGTG GTGGTAACCC GATCTATACC GGTTATACCG CAGCCTGGAA AGCTGCCGGC ATTTTTACAA CTTTTGCCGC CGGCAATACC GGTAATACAA CGTGTAGTAC GATCGCCTCA CCCGGTGACT ACGCCGATGT CGTTGCAGTT GGCGCCATCA AGCAAGATGA TCGACTGGCC CCTTTCAGTG CTATTGGCCC GACCGGTGAC GGTCGCATCA AACCCGATCT GGTTGCGCCA GGAGTAGGCA TCTACTCAAC CGATGCTTCA ACGGGCTACA TAGCGCTCAG CGGTACCTCA ATGGCTGCCC CGCACGTAGC CGGGACAGTT GCCCTGCTTT GGTCGGCTAA TCCGCAATTG ATCGGCGATT ATGATGGGAC GTATGCCCTT CTCACGAATA CAGCATTCCC AATTACCGGG GACACCACGT TTATGGGATC AACCCATAGC GCCTGCCGTC CTATTGGTGT CCCGAACAAC ATTTACGGTT ATGGGCGGCT CGATGCCTTT GCTGCGGTAG CGGCGGCTAA GGTTGATATT CCATGGCTGA CTCTTCCGCC AACACCGACG GCAACCCTAA CAAGTAGCGG CAATACAACA CTGTCTATCA CACTCGATGC CCGCAAAGTT CCCGGACCGG GTGTCTACTC GGCACGTCTG TTGATCTACG CCAACAATCT GACCGACCCA CCGCTGGCCG TCCCGATAAC AATGACCGTG CCACCACGAC CTACACACGC GACGATTACC GGTACGGTGA CCGATAGTGA GACCGGTCGG CCATTACCGG CGACGATTAC AACGACTGAC GGTGTGCGAC TGATGACCAG CCCCACCGGA ACATATAGCT TGACGGTACC CGGCAGTTCT ACCCAACACG TGACCGCTGC TGCCGTTGGC TTTGTTACGC AAACCCAAAC GATCACGCCA AGCAATGGGA GTACGTCAAC CCTCAACTTT GCGCTCGATC CGATTCGCCC CCGCTTGACC ACATTGCAAG ATGTGATACC GGCTACCGTT GATTTTCAGC AAACCGTCAC GCTCAATTTG TCGTTGCGTA ACGATGGCAA TGCCCCCCTC GCCTACACCG TCCAGATTGA CAACGAGCCT TATGGTGTCT GGCGAAGTGA CGAACCGGAC GGGCCGAGCG GTGGTTGGAT CGATCCACCG ACCGGTAGAC AGGTGCTCAA CCTGGATGAT GATGGGAATA GTGATGCTCT CGATCTCGGC TTTGATTTTC CGTTTGGCAG CACGTTTTAC CGCCAAGTCT ACATTGGAGC AAATGGGATT ATTGCCTTCG CACCCTTCAC GACCAGCTAC TTCATTCCAT CATGCTTTCC ATTATCTGAA ACTACCTCGG CGGCGATTAG CCCACTTCAC GTCGATTTTA ATAGTCTTGA TGGCGGTGAG ATTAGCTTCG CTCAAGTGAG TAGTGGCGCA CTGATCACGT GGGATGATGT TCCGCTGTAC GGTACGACCC GCCGGCTTAG TGTGCAGGCA CTCTTGCAAC CCAATGGTGT TATTCGCTTC CATTACCGGA ATGTAGCCGA TTTGCAGCCC ACCGATCAGG CTACAATTGG CCTCCAGTTT GACGATCAAA GCCAGCATGT AGCTTGTGAT GCTGGGGATG AACTGCCACT CGATCTGAGC GATGGGTTGG TCATCGAGCT GCGACCTCAG ATCAATCCAC GGGCATGGCT CAATATAGTG TCCGGTGACA GTGGCACACT AGCCGCTTCC AGTCAGACGG ATATCCCACT CACTGCGCGA TGGGTCGGCC CTATGTATAC GACCTCACAG GCACGAGTGC AGATTCGCAG TAACGATCCG CAGAAACCGG TTGCCACTGT ACGTGTCCAA CTGAACGAAG GTACCCCTGC GCCCTATCAA GTGTTCATTC CGTTCGTATT CCGGTAA
|
Protein sequence | MMRRSTSLIL FIIAYLVSVL VFTPSTPVQA DPATLSATPT SLAATVELGD TATLSLTITN TSSNSLTLLL YAGYPPTASP ARMALPSLPV PLPQQAERID PALQTELAHG PTRFLVFFAD RPDLGAALLI RDWAARGEYV YHTLTEHAER SQRAVRAMLD AAGIRYTPLW IVNALLVEGD ATLAQALADH ADVAMLSADH ELQVAPSALT TAVSCSPSAN NVCWNIDRIR ADRVWREFGV TGEGITVANI DSGVAYTHPA LVGQYRGNLG GGVFDHNYNW FDPVGNTTAP TASGSHGTHV MGTMVSNPPD QPAMGVAPGA RWIAARACDT LNCTDSNIIA AAQWVLAPTD LNGNNPQPSR RPHILNNSWA FSSGGNPIYT GYTAAWKAAG IFTTFAAGNT GNTTCSTIAS PGDYADVVAV GAIKQDDRLA PFSAIGPTGD GRIKPDLVAP GVGIYSTDAS TGYIALSGTS MAAPHVAGTV ALLWSANPQL IGDYDGTYAL LTNTAFPITG DTTFMGSTHS ACRPIGVPNN IYGYGRLDAF AAVAAAKVDI PWLTLPPTPT ATLTSSGNTT LSITLDARKV PGPGVYSARL LIYANNLTDP PLAVPITMTV PPRPTHATIT GTVTDSETGR PLPATITTTD GVRLMTSPTG TYSLTVPGSS TQHVTAAAVG FVTQTQTITP SNGSTSTLNF ALDPIRPRLT TLQDVIPATV DFQQTVTLNL SLRNDGNAPL AYTVQIDNEP YGVWRSDEPD GPSGGWIDPP TGRQVLNLDD DGNSDALDLG FDFPFGSTFY RQVYIGANGI IAFAPFTTSY FIPSCFPLSE TTSAAISPLH VDFNSLDGGE ISFAQVSSGA LITWDDVPLY GTTRRLSVQA LLQPNGVIRF HYRNVADLQP TDQATIGLQF DDQSQHVACD AGDELPLDLS DGLVIELRPQ INPRAWLNIV SGDSGTLAAS SQTDIPLTAR WVGPMYTTSQ ARVQIRSNDP QKPVATVRVQ LNEGTPAPYQ VFIPFVFR
|
| |