Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3421 |
Symbol | apr |
ID | 5712479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 3601513 |
End bp | 3603003 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641269350 |
Product | subtilisin DY |
Protein accession | YP_001534755 |
Protein GI | 159045961 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCCAT TCAAGGCGCT TCTGGCGCTG GTTCTTGTCG GGCTCTTATG GTCGTCGCCG TCACAGGTGG CGGCCCAAAC TGCGCCTGCA TTGCCTGAAT GCGTCAATCC CGGCGACCTG ATCACGATAC CGCGGGTGCG CAACCCCCCG CCGATCCGGG CGACCTTGTT CCTCGATTTC GGCGATTTCC GGGTGCGCGT GCTGATCCGG AACGTGACCG CGTCCAGTTT CAGTTTCCGC ATGCCCCGCC TGCAGCGCAC TCCCCGAAAC GGAGAGTTCG AACTGGTCGT TCGGCTGCTG CTGGGGGTGG AACGGACGCT GCGCACGGGC CGCATTTGTC AGGGCAGGCT CTTGGAAGAG GCCTTGGACC GTATTCCGGA TGTTCCAATT CGCCCCGCCA CCGGCAACGA GGTGGCCGCT CCCAGTGGGG GTCCGGAATA TGTCCTCGCG GGTACGGACC AGGAGATCGC CCGGGCGCGT GTCGTGCTGC GTGGGGCGCG GGCGCAGATC TTGCGCTCCC AGCGGCTGGG CTCTCTTGGT CAGAGCCTTC TCTTTGTGGA TCTCGCCGGG GCACTCACCG AAGCGCAGGC TCGCGCGCTG CTTGCTCGCG AAGGCATACG TTCGGCCATC GGGACGCATA CGGTCTACAG TTTGTCTCAG TCCAGTGGCG GGCGCGCCGG ACTACGCCTG TTCGCGACAG CCCTGGTGCG GCCCGATCCG GGGCGCAACT GTACCCTGAC CCGCCCCGTG CGGGTCGGTC TGATTGACGG ACCCCTCGAT CTGCGCACGC CGTCCCTGAC CAATGTACGG GTGACCAGCC TGTCGGTTCT CAGACCGCGG GAGCGGCCCG GTTCTACCGC CCATGGCACG GGGATTGCCG CCCTGATCGC GGGCCAGGCC ACCACGCAAG GTCCGGCGGG TCTCGCACCT GGGGCGGAGT TACTCTCGGT GGTGGCGTTT GCACGGGCGG GGGGGCGCGA CCTCGCCCGG CTCGAAAATA TCGCACTCGG TCTCGATTGG CTGGTCGAGC GGGGTGCAGA CGTGGTCAAC ATGTCGCTTG CTGGCCCCCC GAACGAGGCG TTGGCTGCCC TGGTGGAGAT CGCCGATCAG CAGGGACTGA TCATGGTGGC CGCCGCCGGC AACCGGGGCG AGCCGTCCCT GGGATATCCT GCCGCCGATC CGCGCGTTCT TGCGATTACG GCGATCGATG CGGACAAACG GATCTACCGC CGGGCCAGTT TCGGGGCGGG TATGGATTTC TCGGCGCCGG GCGTCGATAT CGCGGTGCCG GATCGGCGTG GCTGGTCCTA TCGCTCGGGC ACGTCCTACG CCGCGGCAGT CGCCACCGGG CTTGTGGCGC AGAAGCTGGC GCAGCAGCGG CTGACTACGG ATCAGTTGCG TGCAAGCTTT CGGCGCAGTG CCGAGGACCT CGGGCCTTCG GGATATGACC CCCGATTTGG TTGGGGTCTC ATGCGCGGCG ACCCCTGCTA A
|
Protein sequence | MKPFKALLAL VLVGLLWSSP SQVAAQTAPA LPECVNPGDL ITIPRVRNPP PIRATLFLDF GDFRVRVLIR NVTASSFSFR MPRLQRTPRN GEFELVVRLL LGVERTLRTG RICQGRLLEE ALDRIPDVPI RPATGNEVAA PSGGPEYVLA GTDQEIARAR VVLRGARAQI LRSQRLGSLG QSLLFVDLAG ALTEAQARAL LAREGIRSAI GTHTVYSLSQ SSGGRAGLRL FATALVRPDP GRNCTLTRPV RVGLIDGPLD LRTPSLTNVR VTSLSVLRPR ERPGSTAHGT GIAALIAGQA TTQGPAGLAP GAELLSVVAF ARAGGRDLAR LENIALGLDW LVERGADVVN MSLAGPPNEA LAALVEIADQ QGLIMVAAAG NRGEPSLGYP AADPRVLAIT AIDADKRIYR RASFGAGMDF SAPGVDIAVP DRRGWSYRSG TSYAAAVATG LVAQKLAQQR LTTDQLRASF RRSAEDLGPS GYDPRFGWGL MRGDPC
|
| |