Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acry_1317 |
Symbol | |
ID | 5159825 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidiphilium cryptum JF-5 |
Kingdom | Bacteria |
Replicon accession | NC_009484 |
Strand | + |
Start bp | 1471857 |
End bp | 1473383 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640553233 |
Product | protease Do |
Protein accession | YP_001234447 |
Protein GI | 148260320 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.278198 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTGT TCCTCGCCCG CCGCCGGCCC GGGCAGCTCG CTGCCCTCAC CGCCGTCATG CTGGCCGGCG GCACGCTTGC CGCGATTTCG CTCGACAGCG CCTTCGCCAA CGACAAGGCG TTCGACGCCA CGTCGAAAGT CCAGAAGGAC TTCAAGCCGA TCCCGAGCTT CGCCCCCCTG GTCAAGGACG TCAGCCCCGC GGTCGTCTCG GTGACCGTGC ATCTCAAGGT CCAGCAGGCC GACAACACGC AGGCGCAGAA CGGCATGCCG CCCGGCATGC CCTTCGCCTT CCCCTTCCCC TTTCCGCAGC CGCAGCAGCC CCAGGCCGTG GAAGCCAAGG GCTCGGGCTT CTTCATCTCG TCCGACGGCT ACATCGTGAC CAACAATCAC GTGGTGAAGA ACGCGAAGTC GGTCTTCGTC ACGCTGTCCG ACGGATCGAA GCTGCCGGCC AAGATCGTCG GCACCGACCC GAGCACCGAT CTCGCGGTGC TCAAGGTCAA GCGCGACAAG CCCTTCCCCT ACCTGCAGCT CGGCGACTCG GCGAAGGTCG TGCCCGGCCA GTGGGTGATC GCGATCGGCA ACCCGTTCGG CCTCGCCGAA ACGGTGACGA CCGGCGTCGT CTCCGCCCTC GGCCGCGACA TCGGCGACGG CCAGTACGAC AGCTTCATCC AGATCGACGC GCCGATCAAC GAGGGCAATT CCGGCGGCCC GCTGCTCAAC CAGCGCGGCG AGGTCATCGG CGTGAACACC GCGATCCTCA CGCCGTCCGG CGGCTCGGTC GGGATCGGCT TCTCGATCCC CTCCGACATG GTCAGGCGGA TCGCCGACGA GCTGATCAAG TCCGGCCACG TCACCCGCGG CTTCATCGGC GTGCAGGTGC AGACGATCAC GCCGGAAATG GCCCAGGCCA TGGGCGTTCC CGTGCATGAC GGCCGCGCCG ACGGCGCGCT GATCGCCGAG ACCATGCCGA ACGGGCCGGC GGCCAAGGCC GGCCTGAAGC CCGGCGACAT CATCACCAAG GTCGATGGCA AGATGGTGCG CGACCCGCGC GAACTCGCCC TCGCCATCTC CGGCATCAAG CCGGACGGCA AGGCCAGCAT CACCTATCTG CGCGGCGGCG CGTCGCACGA GCTGAACCTG CGCGTCGAGA AGATGCCGGC CAATGCCGAG GCGGCGTTCG CGCCGGGCGG CAGCCAGAGC GGCCCGGCCA TGCACAAGCC GGAACTCGGC CTCTCCCTCG CCCCGCTGAG CGATGCCGCC CGTCAGCAGC TCAACCTGCC GGACAATGTC TCGGGCGCGC TGATCGCCCA TGTCGCGCCG AACTCCCCGG CCGACGAGGC CGGGCTGCGC TCGGGTGACG TCATCGTCGG CGTCGGCAGC ATGACGGTGA ACAACCCCGA CCAGGCGGTC GCCGCGATCC GCAAGGCCGA AGCCGCGAAG GCGAAGGCGA TCGCGCTGCG GGTGATGCGC GGCAACCAGG CCCTGTTCGT CGCGGTGCCG CTGCCCAAGG AAAAGGCCGG CAAGTAA
|
Protein sequence | MKLFLARRRP GQLAALTAVM LAGGTLAAIS LDSAFANDKA FDATSKVQKD FKPIPSFAPL VKDVSPAVVS VTVHLKVQQA DNTQAQNGMP PGMPFAFPFP FPQPQQPQAV EAKGSGFFIS SDGYIVTNNH VVKNAKSVFV TLSDGSKLPA KIVGTDPSTD LAVLKVKRDK PFPYLQLGDS AKVVPGQWVI AIGNPFGLAE TVTTGVVSAL GRDIGDGQYD SFIQIDAPIN EGNSGGPLLN QRGEVIGVNT AILTPSGGSV GIGFSIPSDM VRRIADELIK SGHVTRGFIG VQVQTITPEM AQAMGVPVHD GRADGALIAE TMPNGPAAKA GLKPGDIITK VDGKMVRDPR ELALAISGIK PDGKASITYL RGGASHELNL RVEKMPANAE AAFAPGGSQS GPAMHKPELG LSLAPLSDAA RQQLNLPDNV SGALIAHVAP NSPADEAGLR SGDVIVGVGS MTVNNPDQAV AAIRKAEAAK AKAIALRVMR GNQALFVAVP LPKEKAGK
|
| |