Gene Arth_1716 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1716 
Symbol 
ID4445755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1917550 
End bp1919076 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content59% 
IMG OID639689538 
Productsulfatase 
Protein accessionYP_831210 
Protein GI116670277 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGTAG AAGGCGCACC AAGGACAAAC ATTCTGTTTC TCATGACAGA CCAACAACGC 
ATCGATACAA TGGGCTGCTA CGGAAATAGG TCCCGTCACA CCCCCTACCT TGACGGGCTG
GCAGCCCGGG GCACTGTGTA CGACCGCGCT TACACTCCCA CGGCCATCTG CACGCCCGCC
CGCGCATCCC TCCTGACAGG GCTTCATCCC TTCGAGCACG GGCTGCTGTC AAATTTCGAG
TGGAACTCCG GTCACCGGGA CGAACTGCCC GACGGTACTC CCACTTTTGC CGACGAACTC
AGGAAGCAGG GATACCGGTT GGGGCACGTC GGCAAATGGC ACGTCGGGCG GGAGCGCGGT
CCGGATTTCT ACGGCTTTGA AGGGGAGCAC CTGCCCGGGG CCCTGAACAC CTTCGATAAC
CCGGCATACA CGTCCTGGCT TGCGGAGAAA GGGTTCCCCT CATTCCGCAT AGTGGACCCG
GTGTACACCG TTCAAAAAGA CGGATCGCAG GGGCACCTCA TCGCAGGGAT CACTGACCAG
CCCACAGAAG CGACGTTCGA AGCCTGGCTG GCGGACCAGA CCATCGCCAA GCTCCGCGAG
TTTGCCCAGA CCCACCCGGC TGGAGGCGCC CCAGGCACCG AAACAGCCGT CGCACCCTTC
TACCTGTCCT GCCACATCTT CGGACCCCAT TTGCCGTATC TCATTCCGAG GCAATGGTAT
GACTTGGTGG ATCCAGCAAC GGTGCAGCTG CCCAAGTCCT TCGCTGAAAC TTTTAACGGC
AAACCTCTGG TCCAACAGAC CTACGCCGAA TACTGGTCCA CCGATTCATT CACGGTAGAG
GAATGGAAGA AACTGACCGC GGTCTACTGG GGCTACGTTT CCATGATCGA CCACGAGATC
GGACGCATCC TCCAGACCGT CGAGGAACTG GGGCTCAACG ATTCGACCGT GATCATGTTC
ACCGCGGATC ACGGCGAGTT CACCGGCGCA CACAGGCTCA ACGACAAGGG GCCTGCAATG
TACGAGGATA TTTACCGTAT CCCCGCTATT GTCGCTGCGC CCGGCCAGGA ACCCAGACGG
GAATCAAAAT TCGTCTCCCT CCAGGACTTC ACCGCCACGT TCATCGACAT CGCCGACGGC
TATGCCGGAA ATATTCGCGG GAGTTCATTG ATGCCCTCCA CGACCGCTCC ACTGCCCGCT
GACTGGCGAA CAGAGATGGT GTGCGAATTC CACGGACACC ATTTTCCTTA CGCGCAACGG
ATGATCCGTA ATGAACGATA CAAGTACATC GCCAACCCGG AAGGGATTGA CGAGTTCTAC
GATCTGGTCA GCGACCCCGA CGAACTCCAT AACGTGGTAA CTGTGCCCGC CTACGCGACG
CAGCTCAAGA CGATGCGGCT GAGTCTCTAC AAGGAACTCG TCTCCAGAGG TGACAAGTTC
TATCAGTGGC TGGCATTCGC AGGGGACATC GAACCCGAAG ATCGACTCAG GCCCGACACC
GCCCTCGAAC GCTTCGTAAC CCAATGA
 
Protein sequence
MAVEGAPRTN ILFLMTDQQR IDTMGCYGNR SRHTPYLDGL AARGTVYDRA YTPTAICTPA 
RASLLTGLHP FEHGLLSNFE WNSGHRDELP DGTPTFADEL RKQGYRLGHV GKWHVGRERG
PDFYGFEGEH LPGALNTFDN PAYTSWLAEK GFPSFRIVDP VYTVQKDGSQ GHLIAGITDQ
PTEATFEAWL ADQTIAKLRE FAQTHPAGGA PGTETAVAPF YLSCHIFGPH LPYLIPRQWY
DLVDPATVQL PKSFAETFNG KPLVQQTYAE YWSTDSFTVE EWKKLTAVYW GYVSMIDHEI
GRILQTVEEL GLNDSTVIMF TADHGEFTGA HRLNDKGPAM YEDIYRIPAI VAAPGQEPRR
ESKFVSLQDF TATFIDIADG YAGNIRGSSL MPSTTAPLPA DWRTEMVCEF HGHHFPYAQR
MIRNERYKYI ANPEGIDEFY DLVSDPDELH NVVTVPAYAT QLKTMRLSLY KELVSRGDKF
YQWLAFAGDI EPEDRLRPDT ALERFVTQ