Gene Arth_1700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1700 
Symbol 
ID4445773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1898523 
End bp1899962 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content62% 
IMG OID639689522 
Productsulfatase 
Protein accessionYP_831194 
Protein GI116670261 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0968821 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCACC ACCCCGGACC AAATATCCTG CTCATCCTCA GCGACGACCA GGGCGCATGG 
GCCTTGGGAT GCTCCGGCAA TACCGAAATC CAGACGCCCC ATCTGGACAA CCTGGCCTCA
GGCGGAACGC GATTGGATAA CTTCTTCTGC GTCTCACCCG TCTGTTCCCC CGCGCGGGCC
AGCCTGATGA CCGGCACCAT CCCCTCCAAA CATGGAGTGC ACGACTATCT TCATGGCGTC
GAGACCGGCC CCGAGGCTCC CGACTATCTG CAGGGCCAGC GCCTCTTCAC GGATGACTTG
GCAGCTGCGG GCTATTACAT GGGACTATCC GGAAAATGGC ACTTGGGAGC CAACGACCGG
GCACGGGAGG GATTCAGTCA CTGGTTCTCG CTGGCCGGCG GCGGCAGCCC CTACGATGCG
GCGACCATGT ACAGGAACGG CGTGAAGGAA ACAGTGTACG GCTATCTCAC TGATGCCATT
ACAGCTGATT CGACCGGCTT CATGGAACGT GCCGCCGGGC AAGATTCCCC GTTCTTCCTG
GCGCTGAATT ACACCGCACC GCATAAGCCC TGGAAGGACC AGCATCCTGC TGAATTTACG
GCTCTCTATG ATGATTGTGC GTTTGAGAGC TGCCCCCAGG AGCCCACGCA TCCCTGGACG
CCTACAGTCG ACGGGGTACC GATTGGAGGT GAAGCTGACG TCCGGGCGGC TTTGGTGGGC
TACTTCGCCG CCGTCAGCGC GATGGACGCG GGGATAGGTC AGGTGCTCCA GAAGCTCGAT
GAGCTCGGAT TGAGAGAAGA CACACTCGTG ATCTTCAGTA GCGACAACGG GTTCAACTGC
GGGCAACACG GCGTGTGGGG CAAGGGAAAC GGCACGTTCC CCTTGAATGT CTTCGATTCC
TCGATCAAAG TGCCCGCCAT ATTTTCTTTC CCCGGCAGAA TTGCCCGCGG GAAAGTGCGG
GAAGAACTGC TGTCGGCATA CGACCTTCCG GCCACCATCC TGGAGCTGGC CGGGCTTGAC
CCGCTCGAAT TCGAACAAGG GCCCGGGAAA TCCTTCGCCG ATGTTCTCCG CGGAAAGCCC
CTCGCACCCG CCAGGCCCCG GCCGGTAGTT GTCTTCGATG AATACGGTCC GGTACGCATG
ATCCGCAGCG ACTCATGGAA ATACGTCCAC CGCTACCCCC AGGGTCCCCA TGAACTGTAC
GACCTGGCCA CCGACCCGGG GGAGCGGCAT AACCTGGTTC GGGAGGTCCG GCATGAAGAG
CGCGTCGCCG GAATGCGCCG GGATATGCAG CTCTGGTTTG AGCAGTACCA GGAAGAGGAG
GCAGACGGCC GCAAATTCCC CGTCGTCGGG GCAGGACAGA CGCTCCCGGT CCGGGCCGAT
CCCCTAGGCG CATTCACGCC GCCAAGCTGG GACGGGATCT CTACTGCCGG AGGCCGATGA
 
Protein sequence
MSHHPGPNIL LILSDDQGAW ALGCSGNTEI QTPHLDNLAS GGTRLDNFFC VSPVCSPARA 
SLMTGTIPSK HGVHDYLHGV ETGPEAPDYL QGQRLFTDDL AAAGYYMGLS GKWHLGANDR
AREGFSHWFS LAGGGSPYDA ATMYRNGVKE TVYGYLTDAI TADSTGFMER AAGQDSPFFL
ALNYTAPHKP WKDQHPAEFT ALYDDCAFES CPQEPTHPWT PTVDGVPIGG EADVRAALVG
YFAAVSAMDA GIGQVLQKLD ELGLREDTLV IFSSDNGFNC GQHGVWGKGN GTFPLNVFDS
SIKVPAIFSF PGRIARGKVR EELLSAYDLP ATILELAGLD PLEFEQGPGK SFADVLRGKP
LAPARPRPVV VFDEYGPVRM IRSDSWKYVH RYPQGPHELY DLATDPGERH NLVREVRHEE
RVAGMRRDMQ LWFEQYQEEE ADGRKFPVVG AGQTLPVRAD PLGAFTPPSW DGISTAGGR