Gene Arth_2875 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2875 
Symbol 
ID4444708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3234241 
End bp3236196 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content67% 
IMG OID639690698 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_832354 
Protein GI116671421 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.311143 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGATC CGAGAAAGAA GCTCCTTTTG GCAAAAGCAG CAAGGGCACC AATCAGGCGC 
CATCGCATAT TGGCGCTGTC GCTCGCGGCA GCAGTCGGCA CCCTGGCCTT CGCGAGTTCA
CCGGTCTTCG CCACCACCGG AACCGGAGAC GGTTCCTCCC CAGGAGCCGC CGCGGCCCAA
AGCCCCCAGG GGCAGGCTGT GGCCGCAGGC GAGGCCTATA CAAAGTTCAT CGTGGACTAC
AAGGAAAGTG CGGCCAACTC CACGCCTAAC GGCCGCGCCA ACGCCTGGGG CAAGGCAGCC
AACGCGCAGG GCGTGACAGT CAAGGAAGTC CGCACCCTGG CCACCGGCGG AACGCTGATC
GAGGCGGACA AGGCCCTGGC CGGGCAGGCG GCCAAGGACT TCATGGCCGG GATTGCGGCC
TCGGGGTCGG TCGAATCCGT AGAGCCCGAC GCGCGGATGA CAGTGGCGCT CACACCCAAC
GACCCGCGCT ATGGCGAGCA GTGGGACTTC ACCGCCACGA ACGGCATGCG GATTCCCGGT
GCGTGGGACG TCGCCACCGG CACCGGCGTG ACAGTGGCCG TCATCGACAC CGGCATCACT
GCCCACCCCG ACCTGGACGC CAACGTGCTC CCCGGTTACG ACTTTGTCTC GGACGCCACC
GCTGCACGGG ACGGCAACGG CCGCGATGCC AACGCGCAGG ACCAGGGCGA CTGGTACGCC
GCCGGCGAGT GCGGCCAGAC AACGGCCGGC AACAGCTCCT GGCACGGCAC CCACGTGGCC
GGCACCGTGG CAGCCGTGAC CGGCAACGCG ACCGGAGTGG CCGGCGTGGC ACCCAATGCC
AAAGTGGTCC CGGTCCGCGT CCTCGCCAAG TGCGGCGGTT CCCTGTCCGA CATCGCCGAC
GCGATCATCT GGTCAGCCGG CGGAACCGTC TCGGGCATCC CGGCCAACGC CAACCCGGCA
AAGGTCATCA ACATGAGCCT GGGCGGATCC GGCTCCTGCG GCACCACCTA CCAAGCGGCC
ATCGATTCCG CAGTTTCGCG GGGAGCCACC GTGGTGGTTG CCGCCGGCAA CAGCAACCAG
GATGCTTCCG GTTTCCGCCC GGCGAACTGC AACAACGTTG TGAGTGTGGC CGCGAGCAAC
CCGGGCGGCA GCCTCTCCTA CTACTCCAAC TACGGCGCCA CGGTTGACCT GACCGCACCG
GGCGGAGACG TCAGGGTGAC CGGCGGCGGG ATCCTCTCCA CCATCAATAC CGGCACCACC
ACGCCGTCTT CGGCGGGCTA CGCCAACTAC CAAGGGACAT CCATGGCGGC CCCGCACGTG
GCCGGGCTAG CTGCACTTAT GAAATCCAAA ACCTCCTCCC TCACCCCGGC CCAGGTCGAG
TCCACGCTCA AGCAGGGAAC CCGCGCGATG CCGGGCGGCT GCACCACCGG CTGCGGCGCC
GGCCTCTCTG ACGCCACCAA GACCATGGGA CTGCTGGGCG GGACCACTCC CCCGCCGTCA
GGCAACCTCC TCCTGAACCC CGGCTTCGAA GAGGGCGCCG CCTCATGGAC CTCTGACCAC
GCCGATACCT TCGAAACCGG TACAAATGCC CGGACCGGTT CCCGCTTCGC CGGACTCAAC
GGCTGGGGCC AGGCGACGTC CTACAAGCTG GACCAGGCTT TCGCAGTCCC CTCCACCGTG
TCCGCCGCGT CGCTGTCCTT CTACTTGAAA GTGCAGTCCG ACGAAACGAC TGCCAGCTCG
GCCTATGACA CCCTCAAAGT CCAGATGATC AGCGGCGGCA CCACCACCAC GTTGGCCACG
TACTCCAACC TCAACGAGTC CACCGGCTAC GTGCAGAAGC AGCTGGATCT TTCCGCCTAC
AAGGGCAAGA GCGTGACCTT GCGCTTCCTG GGTGTCGAGG ATTCATCGCT GTCGACGTAC
TTCTACCTTG ATGACACGTC CGTGACCACG TCCTAG
 
Protein sequence
MPDPRKKLLL AKAARAPIRR HRILALSLAA AVGTLAFASS PVFATTGTGD GSSPGAAAAQ 
SPQGQAVAAG EAYTKFIVDY KESAANSTPN GRANAWGKAA NAQGVTVKEV RTLATGGTLI
EADKALAGQA AKDFMAGIAA SGSVESVEPD ARMTVALTPN DPRYGEQWDF TATNGMRIPG
AWDVATGTGV TVAVIDTGIT AHPDLDANVL PGYDFVSDAT AARDGNGRDA NAQDQGDWYA
AGECGQTTAG NSSWHGTHVA GTVAAVTGNA TGVAGVAPNA KVVPVRVLAK CGGSLSDIAD
AIIWSAGGTV SGIPANANPA KVINMSLGGS GSCGTTYQAA IDSAVSRGAT VVVAAGNSNQ
DASGFRPANC NNVVSVAASN PGGSLSYYSN YGATVDLTAP GGDVRVTGGG ILSTINTGTT
TPSSAGYANY QGTSMAAPHV AGLAALMKSK TSSLTPAQVE STLKQGTRAM PGGCTTGCGA
GLSDATKTMG LLGGTTPPPS GNLLLNPGFE EGAASWTSDH ADTFETGTNA RTGSRFAGLN
GWGQATSYKL DQAFAVPSTV SAASLSFYLK VQSDETTASS AYDTLKVQMI SGGTTTTLAT
YSNLNESTGY VQKQLDLSAY KGKSVTLRFL GVEDSSLSTY FYLDDTSVTT S