Gene Arth_1708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1708 
Symbol 
ID4445747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1908516 
End bp1909865 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content64% 
IMG OID639689530 
Productsulfatase 
Protein accessionYP_831202 
Protein GI116670269 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCATC CCGCCCCGGC TCAGCCGAAC ATCCTGATCA TCATGGCCGA CCAGTGGGCG 
GCGCACGCCA TGGGGTGCGC CGGATCGACT GTGGTGAACA CACCCAACCT GGACAATCTC
GCCGCGGCCG GCACCCGCTT CGACCGCGCC TACACGACTT TCCCGCTGTG CGTACCTGCA
CGCAGTTCGC TGGTCTCCGG GCGCTATCCC CACGAACTAG GAATCGACGG TAATGCCGTA
CCAGCAGGCA GCGGGCCGGG GCGGACGCCG GGGAGTCTGG GTCACTGGTT CAAGGCGGCC
GGCTACGACT GCGCCTACGC CGGCAAATGG CACGCCCCGG AGGCAAGCGC CCAGCCCGAG
GACGGCTTCG ACGTCATCCA TCCCTTCGGC GATGAGGGGC TAACGGCCTC GGCGATCGAC
TGGCTCGGCG CCCGCCATGA CACCGGCACG CCTTTCCTGC TGTTGGTCTC CTTCGATAAC
CCCCATACCA TCTGCGAATA TGCCCGAGGC CAGCATCTGC CGTACGGGGA CGTCCAGCGG
CCAGCAGACA TCCGAGACGC GCCTCCGCTG CCCTCGAATT TCGCCACAAC GCCCTATAGT
CCCCAGGCGT TGACTCACGA ACGGGCCCAG GCCGAACAGG CTTACGGGAC GGCGGACTTC
AGCCACGATG ACTGGCGGCT TTACAGGCAC GCATACGCGC AGCTCATCGA AAGGACTGAC
GAACAGATCG GAGTCATCCT GGGTGAACTT GACCGTCAAG GCCTGAGGGA GACTACCGTA
GTGCTCTTCA CCAGCGATCA TGGCGACGGA GACGCCGCCC ATGGCTGGAA CCAGAAGACC
TCGTTACAGG AAGAAGCCAT ACGGGTTCCG CTGCTGATGA GGGGCCCCGG TGTCGGCTAC
AGCCAGGTAG GCAGCCAGTT AATCTCCCTC GGCCTGGACC TCATTCCGAC GCTCTGCAGC
CTGGCAGGCA TTGATGCCCC TGCCACCGCC ACCGGGGTGG ACTGGATCAC CGAACCGCGC
GCGCCCGGGG AAGGGATTAC CGTCGAAACG GCTTTCAGCG CAGGACAGCG GGCCACCACT
CTGGGGCGCG CCTTAATCAC TGGACGGTAC AAATACACCG TCTACAGCTG GGGTAAACAC
CGGGAACAGC TGGTGGACCT CACGGCCGAT CCCGGCGAGC TCCGTAATCT CGCGGAAGAG
TCCGCTTTCG ATGAGGTCCT GGAGGAATTC CGGCGACGGC TTCTGGATTG GTGTTGGGAA
ACCGGCGATC AGGCGTTTCT GAAGAAACTC GTCCTGCCCC ATTCCGGGAG CAGCCTGGCC
CGCAAGGAAA TCTACGCCGT GCCTTACTAG
 
Protein sequence
MSHPAPAQPN ILIIMADQWA AHAMGCAGST VVNTPNLDNL AAAGTRFDRA YTTFPLCVPA 
RSSLVSGRYP HELGIDGNAV PAGSGPGRTP GSLGHWFKAA GYDCAYAGKW HAPEASAQPE
DGFDVIHPFG DEGLTASAID WLGARHDTGT PFLLLVSFDN PHTICEYARG QHLPYGDVQR
PADIRDAPPL PSNFATTPYS PQALTHERAQ AEQAYGTADF SHDDWRLYRH AYAQLIERTD
EQIGVILGEL DRQGLRETTV VLFTSDHGDG DAAHGWNQKT SLQEEAIRVP LLMRGPGVGY
SQVGSQLISL GLDLIPTLCS LAGIDAPATA TGVDWITEPR APGEGITVET AFSAGQRATT
LGRALITGRY KYTVYSWGKH REQLVDLTAD PGELRNLAEE SAFDEVLEEF RRRLLDWCWE
TGDQAFLKKL VLPHSGSSLA RKEIYAVPY