Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1052 |
Symbol | |
ID | 4446452 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 1130060 |
End bp | 1131331 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639688855 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_830546 |
Protein GI | 116669613 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.223191 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACAGA CCCTGGTAGC TAGTTTCGCA GCAGTAGTCA TGGGCTTCAG CGGAATTGCC GCAGCGGTCC CAAGCAATGC GGCCCCAAGC AACGCAGCTG CGCAGTCATC ACATATCGCG GGACAAATCA TGGTTAAATT CCGCGACGAT GGCGCGGCGG CCGGTGTGCT TCGCCAGCAT GGGCTCAAGG TGGGTTCCGG TATCGGCAGC ACCGGCGCTC AACTCATCAA GGTGCCGGCA GGCAAAGAAT TGACCCTCAT CGAAGCCCTA AACCGGAACC CGGCGGTTGA ATACGCCGAA CCGGACGAGA TCGCGACTGC CGATACCGAT GACCCGTTTT TCCCCCGCCA ATACGCGCTG CAGAACGACG GCCAATCATT TACCAACACC CTCAGCACGA TAACTGTTGC TAAGGGCACG GTGGACGCTG ATGTGGACGC CGTCGAAGCG TGGAGCATCA CTAAAGGCAG GGACACCCGA GTCGCCATTA TCGACTCAGG CGTTGCAAAT GACCACGAGG ATATTTCGGA GAAGGTCGTT GCGCGGATCA ACTTCAGTGA TGCGGCAACC GGCGACGACA AATACGGCCA CGGCACCCAT GTGGCCGGGA TCGTTGCCGC GATCGCCGGC AACGGCAAGG GTGTCGCCGG CGTGTGCCCG GAGTGCACCA TCCTGGACGC CAAAGTGCTC AACGACAACG GGTCCGGTTC CACCTCGGCC ATTGCCAAGG GCATCGACTG GGCCGTGAAC AATGGTGCCA GGGTGATCAA CATGAGCCTT GGAATGCGCG TCTCGTCACG CACGCTCGAG GCGGCCGTCA ACAACGCTTG GAACCGGGGT GTGGTGCTGG TGGCCGCGGC GGGCAACGCC GGTACTCCGG CCCAGATCTA CCCGGGCGCC TACTCTAACG TCATTGCCGT GGCGGCAACA GATAACAATG ACGACAAGGC ATCGTTCTCC AGCTACGGTT CCAAGTGGGT GGATATCGCG GCGCCGGGTG TCAACGTCTA CTCGACCTTC CCGGTCCGCC CCTTCGTCCT GGGTACGCAA AACGGCCGGT CCATGGGCTA TGACATCGCC AGCGGCACCT CAATGGCCTC GCCGATCGTG GCCGCCACTG CCGCTCTCCT CTGGAGCACG CAGACCTGCC CTACGAACGC TGACGTCCGG GCAAAGGTCC TGTCCACCAC GGAGCGAAAG CCCGGCACTG AAACCTTCTG GGCGAACGGC CGAGTGAACG CCTTCAAGGC CGTCGACGGG TCCTGCTCCT AA
|
Protein sequence | MKQTLVASFA AVVMGFSGIA AAVPSNAAPS NAAAQSSHIA GQIMVKFRDD GAAAGVLRQH GLKVGSGIGS TGAQLIKVPA GKELTLIEAL NRNPAVEYAE PDEIATADTD DPFFPRQYAL QNDGQSFTNT LSTITVAKGT VDADVDAVEA WSITKGRDTR VAIIDSGVAN DHEDISEKVV ARINFSDAAT GDDKYGHGTH VAGIVAAIAG NGKGVAGVCP ECTILDAKVL NDNGSGSTSA IAKGIDWAVN NGARVINMSL GMRVSSRTLE AAVNNAWNRG VVLVAAAGNA GTPAQIYPGA YSNVIAVAAT DNNDDKASFS SYGSKWVDIA APGVNVYSTF PVRPFVLGTQ NGRSMGYDIA SGTSMASPIV AATAALLWST QTCPTNADVR AKVLSTTERK PGTETFWANG RVNAFKAVDG SCS
|
| |