Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1050 |
Symbol | |
ID | 4446450 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 1127480 |
End bp | 1128721 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639688853 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_830544 |
Protein GI | 116669611 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.141106 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA TCTTTGTAGC CGGTTTAGCA GCGGTAGTCA TGGGCCTCGG CGGAATTGCC GCGGCAGTTC CGAGTAGCGC GGCCGCCGAG CCGTCACCCG CCGCGGGCCA GATCATGGTG AAATTCCGTG ACCCCGGCGC CGCGGGAGGC GCTCTGCGCC AGCATGGCCT CAGCGAGGGC CCTGGCATCG GCAGTACCGG CGCCCAGCTC ATCAAAGTAC CGGCCGGCAA AGAACTTCAA CTCATTGACG CCCTGAGCCG GAACCCGGCA GTTGAGTATG CCGAGCCGGA CGAGCTTGTC ACTGCCGCTA CGGCGGACGA GTATTTCCCC CGCCAGTACG CCCTGCAGAA CAACGGCCAG TCGTACACCA ACACCGCCGG CACCCTGTCA GTCGCCGGAG GCACAGCAGA CGCCGATGTG GACGCCGTCG AAGCGTGGTC CGTCACGACC GGCAGCGGCA TAAGAGTTGC CGTTCTCGAT TCAGGTGTCG CTAGCGACAA CCCCGACATC AACCCGAAGG TTGTTGCACG CGGCAACTTC AGCGGCGCGG CCACCAACGA AGACAACTAC GGCCACGGCA CCCACGTCGC CGGCATCGTT GCCGCCACTG CCAACAACAC GATGGGTGTG GCGGGCGTGT GCCCGGGGTG CACGATCCTG GCCGGAAAAG TTCTCAACGA CAGCGGGATT GGATCCAGTT CGGGCCTTGC GAACGGCATC AACTGGGCGG TCAGCAACGG CGCGAAGGTG ATCAACATGA GCATCGGAGT GCGGGCGTCA CGCACTCTCG AAACGGCCGT CAATAACGCC TGGGGCAAGG GCGTGGTGCT CGTTGCCGCA GCAGGCAACG GCGGCAACCA GACCAAGATC TACCCGGGCG CCTACCCCAA CGTCATTGCC GTCGCCGCAA CCGATAACAA CGACGCCAAG GCATCCTTCT CCACCTACGG AGCCAGTTGG GTGGACGTCG CGGCGCCCGG AGTCAACGTC TATTCAACGT TCCCGAACCA CACCTTCGTC CTTGGGACGC AGAACAACCG CTCATTCGGT TATGACGTCG GCAACGGGAC CTCAATGTCC TCACCCATCG TCGCTGCCAC GGCCGCGCTT GCCTGGAGCT CGCATCCTGG CGCCACGCAA ACCTCAGTCC GCGCAAATAT CGAATCAACC GCTGACAAGA TTTCCGGTAC GGGCACGTAC TGGGCTTATG GCCGCGTGAA CGCGGACCGG GCCGTCCGTT AG
|
Protein sequence | MKKIFVAGLA AVVMGLGGIA AAVPSSAAAE PSPAAGQIMV KFRDPGAAGG ALRQHGLSEG PGIGSTGAQL IKVPAGKELQ LIDALSRNPA VEYAEPDELV TAATADEYFP RQYALQNNGQ SYTNTAGTLS VAGGTADADV DAVEAWSVTT GSGIRVAVLD SGVASDNPDI NPKVVARGNF SGAATNEDNY GHGTHVAGIV AATANNTMGV AGVCPGCTIL AGKVLNDSGI GSSSGLANGI NWAVSNGAKV INMSIGVRAS RTLETAVNNA WGKGVVLVAA AGNGGNQTKI YPGAYPNVIA VAATDNNDAK ASFSTYGASW VDVAAPGVNV YSTFPNHTFV LGTQNNRSFG YDVGNGTSMS SPIVAATAAL AWSSHPGATQ TSVRANIEST ADKISGTGTY WAYGRVNADR AVR
|
| |