Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2875 |
Symbol | |
ID | 4444708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 3234241 |
End bp | 3236196 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639690698 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_832354 |
Protein GI | 116671421 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.311143 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGATC CGAGAAAGAA GCTCCTTTTG GCAAAAGCAG CAAGGGCACC AATCAGGCGC CATCGCATAT TGGCGCTGTC GCTCGCGGCA GCAGTCGGCA CCCTGGCCTT CGCGAGTTCA CCGGTCTTCG CCACCACCGG AACCGGAGAC GGTTCCTCCC CAGGAGCCGC CGCGGCCCAA AGCCCCCAGG GGCAGGCTGT GGCCGCAGGC GAGGCCTATA CAAAGTTCAT CGTGGACTAC AAGGAAAGTG CGGCCAACTC CACGCCTAAC GGCCGCGCCA ACGCCTGGGG CAAGGCAGCC AACGCGCAGG GCGTGACAGT CAAGGAAGTC CGCACCCTGG CCACCGGCGG AACGCTGATC GAGGCGGACA AGGCCCTGGC CGGGCAGGCG GCCAAGGACT TCATGGCCGG GATTGCGGCC TCGGGGTCGG TCGAATCCGT AGAGCCCGAC GCGCGGATGA CAGTGGCGCT CACACCCAAC GACCCGCGCT ATGGCGAGCA GTGGGACTTC ACCGCCACGA ACGGCATGCG GATTCCCGGT GCGTGGGACG TCGCCACCGG CACCGGCGTG ACAGTGGCCG TCATCGACAC CGGCATCACT GCCCACCCCG ACCTGGACGC CAACGTGCTC CCCGGTTACG ACTTTGTCTC GGACGCCACC GCTGCACGGG ACGGCAACGG CCGCGATGCC AACGCGCAGG ACCAGGGCGA CTGGTACGCC GCCGGCGAGT GCGGCCAGAC AACGGCCGGC AACAGCTCCT GGCACGGCAC CCACGTGGCC GGCACCGTGG CAGCCGTGAC CGGCAACGCG ACCGGAGTGG CCGGCGTGGC ACCCAATGCC AAAGTGGTCC CGGTCCGCGT CCTCGCCAAG TGCGGCGGTT CCCTGTCCGA CATCGCCGAC GCGATCATCT GGTCAGCCGG CGGAACCGTC TCGGGCATCC CGGCCAACGC CAACCCGGCA AAGGTCATCA ACATGAGCCT GGGCGGATCC GGCTCCTGCG GCACCACCTA CCAAGCGGCC ATCGATTCCG CAGTTTCGCG GGGAGCCACC GTGGTGGTTG CCGCCGGCAA CAGCAACCAG GATGCTTCCG GTTTCCGCCC GGCGAACTGC AACAACGTTG TGAGTGTGGC CGCGAGCAAC CCGGGCGGCA GCCTCTCCTA CTACTCCAAC TACGGCGCCA CGGTTGACCT GACCGCACCG GGCGGAGACG TCAGGGTGAC CGGCGGCGGG ATCCTCTCCA CCATCAATAC CGGCACCACC ACGCCGTCTT CGGCGGGCTA CGCCAACTAC CAAGGGACAT CCATGGCGGC CCCGCACGTG GCCGGGCTAG CTGCACTTAT GAAATCCAAA ACCTCCTCCC TCACCCCGGC CCAGGTCGAG TCCACGCTCA AGCAGGGAAC CCGCGCGATG CCGGGCGGCT GCACCACCGG CTGCGGCGCC GGCCTCTCTG ACGCCACCAA GACCATGGGA CTGCTGGGCG GGACCACTCC CCCGCCGTCA GGCAACCTCC TCCTGAACCC CGGCTTCGAA GAGGGCGCCG CCTCATGGAC CTCTGACCAC GCCGATACCT TCGAAACCGG TACAAATGCC CGGACCGGTT CCCGCTTCGC CGGACTCAAC GGCTGGGGCC AGGCGACGTC CTACAAGCTG GACCAGGCTT TCGCAGTCCC CTCCACCGTG TCCGCCGCGT CGCTGTCCTT CTACTTGAAA GTGCAGTCCG ACGAAACGAC TGCCAGCTCG GCCTATGACA CCCTCAAAGT CCAGATGATC AGCGGCGGCA CCACCACCAC GTTGGCCACG TACTCCAACC TCAACGAGTC CACCGGCTAC GTGCAGAAGC AGCTGGATCT TTCCGCCTAC AAGGGCAAGA GCGTGACCTT GCGCTTCCTG GGTGTCGAGG ATTCATCGCT GTCGACGTAC TTCTACCTTG ATGACACGTC CGTGACCACG TCCTAG
|
Protein sequence | MPDPRKKLLL AKAARAPIRR HRILALSLAA AVGTLAFASS PVFATTGTGD GSSPGAAAAQ SPQGQAVAAG EAYTKFIVDY KESAANSTPN GRANAWGKAA NAQGVTVKEV RTLATGGTLI EADKALAGQA AKDFMAGIAA SGSVESVEPD ARMTVALTPN DPRYGEQWDF TATNGMRIPG AWDVATGTGV TVAVIDTGIT AHPDLDANVL PGYDFVSDAT AARDGNGRDA NAQDQGDWYA AGECGQTTAG NSSWHGTHVA GTVAAVTGNA TGVAGVAPNA KVVPVRVLAK CGGSLSDIAD AIIWSAGGTV SGIPANANPA KVINMSLGGS GSCGTTYQAA IDSAVSRGAT VVVAAGNSNQ DASGFRPANC NNVVSVAASN PGGSLSYYSN YGATVDLTAP GGDVRVTGGG ILSTINTGTT TPSSAGYANY QGTSMAAPHV AGLAALMKSK TSSLTPAQVE STLKQGTRAM PGGCTTGCGA GLSDATKTMG LLGGTTPPPS GNLLLNPGFE EGAASWTSDH ADTFETGTNA RTGSRFAGLN GWGQATSYKL DQAFAVPSTV SAASLSFYLK VQSDETTASS AYDTLKVQMI SGGTTTTLAT YSNLNESTGY VQKQLDLSAY KGKSVTLRFL GVEDSSLSTY FYLDDTSVTT S
|
| |