Gene Arth_2309 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2309 
Symbol 
ID4445352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2597828 
End bp2598808 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content66% 
IMG OID639690118 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_831789 
Protein GI116670856 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00460586 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGCAG CCAGCAGCGG GATCAACGGA GCCGACACTA TTGAAGACCC GCTGGATTCC 
TATTCCGCAA CAGTCATCCG CGTCGCGGAA ACCGTCACCC CGCACGTAGC CGCCGTCGAA
ATGGCCGGCA CGGGCCGCAA CGGCGGATAC CGGGTTGGCG CGGGGTCGGC CGTCCTGTTC
ACCTCCGACG GCTACCTGGT GACCAACGCG CATGTGGTGG GCTCCGCGGG AAAGGGGCAC
GCGGTGTTTG CCGACGGCAC CCGGACGGCT GTTGAAGTGG TGGGTGCCGA TCCCCTGTCC
GACCTCGCTG TTGTCCACGG CAAAGCACCC ATGGTCCGGC CGGCCGAATT CGGCGACGCC
GAACTCCTCA AGGTCGGGCA GCTGGTGATC GCTGTCGGTA ATCCGCTGGG ACTCTCGGGC
TCCGTGACCG CAGGGGTGGT CAGCGGCCTC GGCCGTTCCA TCCCGGTGTG GTCGGGACGC
AACCGGCGCG TGATCGAGGA CGTCATCCAG ACCGACGCCG CGCTTAATCC CGGCAACTCC
GGAGGGGCCC TGGCCGACGC ACGGGGCAGG ATCGTGGGCA TTAACACGGC GGTCGCAGGA
GCGGGACTGG GTTTGGCGAT TCCTATCAAC GCGACGTCAC GCCGGATTAT CGCCTCCCTC
CTCTCCGACG GGCGGGTCCG GCGCGCTTAC CTGGGACTCG TGAACACTCC CGTTCAACTT
CCGGTCAGCT CGGTGGTCCG CACCGGCCAC CGGGATGGGC TGCTGGTTGT CGAAGTGCTT
CCCGGATCAC CTGCCGAACG GGCGGGCCTC CGCGCCGGGG ACGTGCTGTT GAGCGTGGGG
CGGAAATCCG TTTCGAACGC GGAAAGCCTC CAGAAGCTGC TGTTCTCGGA GGCCATCGGG
GCACCATTGG ACATTTCGGC GCTCCGCGAT GGAAAAGAAT TCCACGTTGT GGCCGTACCG
GAGGAAATGA GCGCCCCGTA A
 
Protein sequence
MAAASSGING ADTIEDPLDS YSATVIRVAE TVTPHVAAVE MAGTGRNGGY RVGAGSAVLF 
TSDGYLVTNA HVVGSAGKGH AVFADGTRTA VEVVGADPLS DLAVVHGKAP MVRPAEFGDA
ELLKVGQLVI AVGNPLGLSG SVTAGVVSGL GRSIPVWSGR NRRVIEDVIQ TDAALNPGNS
GGALADARGR IVGINTAVAG AGLGLAIPIN ATSRRIIASL LSDGRVRRAY LGLVNTPVQL
PVSSVVRTGH RDGLLVVEVL PGSPAERAGL RAGDVLLSVG RKSVSNAESL QKLLFSEAIG
APLDISALRD GKEFHVVAVP EEMSAP