Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_2882 |
Symbol | |
ID | 7294362 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | - |
Start bp | 3217789 |
End bp | 3220608 |
Gene Length | 2820 bp |
Protein Length | 939 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643591295 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_002488935 |
Protein GI | 220913626 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCAACC CCCGGCCCTT CGTTGCCGCT GCCCTCACGC TGGTCACCGT GCTCGCCGGT GCTGCCCTGA CGGCCCTTCC CGCCCAGGCC ACCGAGCCAC CCCAGCCGGC CAAGCCAACC GCGGCTGCCG AGCTGCCCCT GCGGGATCCG AGTCCACGGT CCGCTTCGGC TGCCGTGCCA ACGGACCAGT TCATTGTGGG CCTCAAGAGC CGCGGCAATA TCGCTTCCGA AGCAGCCATA ACTTCGCAGG CAGCCCGGTC AGCCGCAGGC CGCGTGGGCA CCGCCGCCCA GTACGTGCGT GCCACAGCGA CCGGCGCGCA GGTGGTGAAG ACCGACAAGT CACTTCACGG TGCCGACGCT GACACCTTCC TCGCGGCACT CCGCTCCAGC CCCGACGTCG CCTACGCCGA GCCGGACACT ATAGTTCAGG CAACCGCGGC CGACCCCAAC GATCCTGGCT ACGGGTCCCA GTGGAGCCTT TGGTACGATC CGTCCGGCAT CCGGGTATCC GGCGCTTGGG ACCACAACCG CGGCGAAGGT GCCGTAGTTG CCGTGGTGGA CAGTGGCATC ACCTACCATT CGGACCTGGC CCCCAACGTC CTCGCCGGTT ACGACATGAT GTCCAACCCG GAGTGGGCGC GCGACGGCGA CGGACGCGAT GCCAACCCGC AAGACCAGGG CGATTGGGCC AGTGACAACC AGTGCGAGCC GGGCTTCCCG GCGTCACGCT CCTCTTGGCA TGGCACGCAC GTGGCCGGAA TTATTGCTGC CGTCGGAAAC AACAACAACG GCATCACTGG CGCCGCGCCC GCTGCCAAGA TCCTGCCCAT CCGCGCGATC GGCCCCTGCG GCGGCTACAT CTCGGACGTG GCCGACTCGA TCATCTGGGC TGCGGGTGGC ACAGTGGCCG GCGTCCCTGC CAACCCCACC CGAGCCAACG TGGTGAACCT CAGCCTGGGT GGCACCGCCG CCTGCTCCAC TACGGAGCAG AACGCCATCA ACTTTGCCCA CAACGCCGGC ACAGCAGTGG TGGTTGCGGC CGGCAACTCC TCCCGCCCGG CAGCTGAAAT GAGCCCGGCC AACTGCGAGA ACGTCATCAC CGTGGCCGCC GCCGGACCCG ATGGAAGCCG CGCTCCATAT TCCAACTATG GTGCCGCCGT CGACATCACC GCACCGGGTG GCGACATGAC TGCTGATGCC TGGGACGGAA TCGTGTCCAC CTCGAACTTC GGATCCACTT TGCCCGAAGG CGAAGCCTAC GAACTGCTGC AGGGGACGTC CATGGCAGCC CCTCACGTCT CCGCCGTCGC TGCCATGCTT ATGGTTGAGG TGGGCGGCGG TTACACCCCC GACATGGTCA AAGCGCGACT CAACGCCACG GCCCGGCTCC TCCAGGGCAG CGGCTGCCCC GTAGGCTGCG GCGCCGGCTT GGTTGACGCC GCCAAGGCCT TGGCCCTGAC GGCCTCAGAC CTGCCAGCCA ATACGGTCAT CCCGGCAGCC GTGACGTTCT CCGACAAGGA CGGCACGGAT GCCGACACAG TGATAGTTCC GTCGTCACGG GGCGTGGAGT ACGTGCGGGA CGGCGCCGTG CTGGCGGCCG GGTCGCACCC CGGCTCCGGC ACTGTGACTG TCTCAGCGCG CGCAATTCCC GGCTTCTACC TGTGGGCAAG TGCCACAGCC CAGTGGAGCT ATAAGTTCGC TGCCACATAC GCGCCGGTGG CTGGATCGCT GATCAGCGTC ACGCCGTTCC GCGCCCTGGA CACCCGGAGC AGTTCCATCG TGGCGAAGGA CTCCACGGTC TCCTTCCAGG TGGCCGGGCG TAACGGAATT CCCGCCAAGG TGTCTGCCGT GGTCTTCAAC CTCACTGTGG CCGAAGCCCG GTCCTTCGGG TTTGTCACTG CCTTCGCTTC CGGGACCGCG CGTCCGGACG CATCCAACCT GAACTTCGAC AAGGGCCAGA TCGTGGCCAA CTCCGTCACT GTGCCCGTGG GCGCCGACGG AAAGGTGACA CTCTTCAACA GGTCAGCAGG CGCCACGCAC CTCATTGCCG ACATCTCCGG CTACTACCGT GAGGGCGAGG TCAAGGCGGC GGGCGCCTTC AAGTCGATCG AACCCAAGCG GTTCCTGGAT ACCCGGAGCA CGACGGCCGT CGGGCCCGAC ACTGCCCGGG CCTTCCAGGT GGCAGGAGCC AACGGATTGC CGGCCACCGT TTCCGCCGTT GTACTGAACC TGACGGTGGC GGAAGCGAAG TCCAACGGCT TCATCACCGC CTATCCCACG GGGGTCAACC GGCCGGATGC CTCCAACATC AACTTTGCCG CCGGGCAGAT CATCCCCAAC TCCGTGACCG TCCCAGTGGG CCCGGACGGG AAAGTGATGC TTTACAACCG CTCCAACGGG GCCACCCACC TGATCGCGGA CGTCTCCGGC TACTACCTGT CAGGAACCCC GACGGCGGGC GGCACCTTCC AGCCGCTGGC CGCCCCGACG CGCTTCCTGG ACACCCGAGC GGGTTACCCG CTACCGCCGG ACCTGGCAAC GTCATTTCAG GCTGCCCGCG AACACGGCAT TCCCGAAGGT GCTACCGCGA TGGTAATGAA CCTCACCGTG GCCCAGGCGA CGTCGAACGG TTTCGTCACC GCTTACCCGA TGGCGGCAAC CCGTCCGGAC ATCTCGAGCG TCAACTTCGA CAGGGGCCAG ATCGTTGCAA ACTCGGTAAC CACCCCGCTC GGCCAGCTCG GACGAGTCGC CCTGTTCAAC CGCTCAGCGG GGTCCACGCA CCTCATCGCG GACGTCTCCG GATACTTCCT TCCGGGCTGA
|
Protein sequence | MFNPRPFVAA ALTLVTVLAG AALTALPAQA TEPPQPAKPT AAAELPLRDP SPRSASAAVP TDQFIVGLKS RGNIASEAAI TSQAARSAAG RVGTAAQYVR ATATGAQVVK TDKSLHGADA DTFLAALRSS PDVAYAEPDT IVQATAADPN DPGYGSQWSL WYDPSGIRVS GAWDHNRGEG AVVAVVDSGI TYHSDLAPNV LAGYDMMSNP EWARDGDGRD ANPQDQGDWA SDNQCEPGFP ASRSSWHGTH VAGIIAAVGN NNNGITGAAP AAKILPIRAI GPCGGYISDV ADSIIWAAGG TVAGVPANPT RANVVNLSLG GTAACSTTEQ NAINFAHNAG TAVVVAAGNS SRPAAEMSPA NCENVITVAA AGPDGSRAPY SNYGAAVDIT APGGDMTADA WDGIVSTSNF GSTLPEGEAY ELLQGTSMAA PHVSAVAAML MVEVGGGYTP DMVKARLNAT ARLLQGSGCP VGCGAGLVDA AKALALTASD LPANTVIPAA VTFSDKDGTD ADTVIVPSSR GVEYVRDGAV LAAGSHPGSG TVTVSARAIP GFYLWASATA QWSYKFAATY APVAGSLISV TPFRALDTRS SSIVAKDSTV SFQVAGRNGI PAKVSAVVFN LTVAEARSFG FVTAFASGTA RPDASNLNFD KGQIVANSVT VPVGADGKVT LFNRSAGATH LIADISGYYR EGEVKAAGAF KSIEPKRFLD TRSTTAVGPD TARAFQVAGA NGLPATVSAV VLNLTVAEAK SNGFITAYPT GVNRPDASNI NFAAGQIIPN SVTVPVGPDG KVMLYNRSNG ATHLIADVSG YYLSGTPTAG GTFQPLAAPT RFLDTRAGYP LPPDLATSFQ AAREHGIPEG ATAMVMNLTV AQATSNGFVT AYPMAATRPD ISSVNFDRGQ IVANSVTTPL GQLGRVALFN RSAGSTHLIA DVSGYFLPG
|
| |