Gene Arth_1052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1052 
Symbol 
ID4446452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1130060 
End bp1131331 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content62% 
IMG OID639688855 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_830546 
Protein GI116669613 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.223191 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAGA CCCTGGTAGC TAGTTTCGCA GCAGTAGTCA TGGGCTTCAG CGGAATTGCC 
GCAGCGGTCC CAAGCAATGC GGCCCCAAGC AACGCAGCTG CGCAGTCATC ACATATCGCG
GGACAAATCA TGGTTAAATT CCGCGACGAT GGCGCGGCGG CCGGTGTGCT TCGCCAGCAT
GGGCTCAAGG TGGGTTCCGG TATCGGCAGC ACCGGCGCTC AACTCATCAA GGTGCCGGCA
GGCAAAGAAT TGACCCTCAT CGAAGCCCTA AACCGGAACC CGGCGGTTGA ATACGCCGAA
CCGGACGAGA TCGCGACTGC CGATACCGAT GACCCGTTTT TCCCCCGCCA ATACGCGCTG
CAGAACGACG GCCAATCATT TACCAACACC CTCAGCACGA TAACTGTTGC TAAGGGCACG
GTGGACGCTG ATGTGGACGC CGTCGAAGCG TGGAGCATCA CTAAAGGCAG GGACACCCGA
GTCGCCATTA TCGACTCAGG CGTTGCAAAT GACCACGAGG ATATTTCGGA GAAGGTCGTT
GCGCGGATCA ACTTCAGTGA TGCGGCAACC GGCGACGACA AATACGGCCA CGGCACCCAT
GTGGCCGGGA TCGTTGCCGC GATCGCCGGC AACGGCAAGG GTGTCGCCGG CGTGTGCCCG
GAGTGCACCA TCCTGGACGC CAAAGTGCTC AACGACAACG GGTCCGGTTC CACCTCGGCC
ATTGCCAAGG GCATCGACTG GGCCGTGAAC AATGGTGCCA GGGTGATCAA CATGAGCCTT
GGAATGCGCG TCTCGTCACG CACGCTCGAG GCGGCCGTCA ACAACGCTTG GAACCGGGGT
GTGGTGCTGG TGGCCGCGGC GGGCAACGCC GGTACTCCGG CCCAGATCTA CCCGGGCGCC
TACTCTAACG TCATTGCCGT GGCGGCAACA GATAACAATG ACGACAAGGC ATCGTTCTCC
AGCTACGGTT CCAAGTGGGT GGATATCGCG GCGCCGGGTG TCAACGTCTA CTCGACCTTC
CCGGTCCGCC CCTTCGTCCT GGGTACGCAA AACGGCCGGT CCATGGGCTA TGACATCGCC
AGCGGCACCT CAATGGCCTC GCCGATCGTG GCCGCCACTG CCGCTCTCCT CTGGAGCACG
CAGACCTGCC CTACGAACGC TGACGTCCGG GCAAAGGTCC TGTCCACCAC GGAGCGAAAG
CCCGGCACTG AAACCTTCTG GGCGAACGGC CGAGTGAACG CCTTCAAGGC CGTCGACGGG
TCCTGCTCCT AA
 
Protein sequence
MKQTLVASFA AVVMGFSGIA AAVPSNAAPS NAAAQSSHIA GQIMVKFRDD GAAAGVLRQH 
GLKVGSGIGS TGAQLIKVPA GKELTLIEAL NRNPAVEYAE PDEIATADTD DPFFPRQYAL
QNDGQSFTNT LSTITVAKGT VDADVDAVEA WSITKGRDTR VAIIDSGVAN DHEDISEKVV
ARINFSDAAT GDDKYGHGTH VAGIVAAIAG NGKGVAGVCP ECTILDAKVL NDNGSGSTSA
IAKGIDWAVN NGARVINMSL GMRVSSRTLE AAVNNAWNRG VVLVAAAGNA GTPAQIYPGA
YSNVIAVAAT DNNDDKASFS SYGSKWVDIA APGVNVYSTF PVRPFVLGTQ NGRSMGYDIA
SGTSMASPIV AATAALLWST QTCPTNADVR AKVLSTTERK PGTETFWANG RVNAFKAVDG
SCS