Gene Arth_1701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1701 
Symbol 
ID4445774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1899962 
End bp1901389 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content61% 
IMG OID639689523 
Productsulfatase 
Protein accessionYP_831195 
Protein GI116670262 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.987903 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGCTG AACGCCCTAA TATCCTGTTC ATCATCGCCG ATCAGTTCCG CAACAGTGCT 
TTGGGATTCA GGGGCCAGGA TCCCACCTAC ACCCCATCCT TGAACTCCTT CGCAGCCGAA
TCAAAGGACA TCCTCCACGC CGTCAGTAAC TATCCGGTCT GCAGTCCGCA CCGGGCAATG
CTCATGACAG GGCAACACCC CCACCGTAAC GGCGTGCCTC TGAACATCAA TTCCAACACC
GGTGCCGGTC TCGAACCCGG CATAGGCACA TGGTCACAGG TACTCCGGGA TGCGGGCTAC
GGGACCGGAT ACATCGGCAA ATGGCATCTG GAAGCAGTCA CCGAAGAGGA CGCAATCTGG
GGCGAAGGAT TTCGCGAAGG AGCTGTCTGG GATGCCTACT CGCCTGTGGA CAGGCGGCAC
GGTTTCTCCT TCTGGTACTC GTACGGAGCC GCCCACGACC ACATGCACCC CCACTATTGG
GTAGGGGACG CACCCCGCGA AGAGAAAATC GTCGTGGATC AATGGTCGGC AGAACACGAA
ACCGACATTG CCATCGGCTT TTTGCGCGAA ACAACAGACG CTGCCGAGTC CTTCGCGCTG
GTGGTGTCCT ACAACCCGCC GCACCAACCG TTCGAGCTGG CACCTGCAAC CTACCGCCCC
AGGTACGCTC AACTCTCCGC ACGGGAACTA TTGACCAGAC CCAACGTCGA CGTCACGGGC
CCGGCCGGAG CGGAAGCAGC GCAGGCAGCC CCGTTGTATT TCGCCGCCAT TTCCGCCATC
GATCACCAAA TCGGCCGGCT GCTCGTCGCC TTGGAAGCCA GCGGTCACCA CAAAAACACG
ATCGTGATCT TCACCTCGGA CCATGGCATG CAGCTGGGAA GCCACGGTCT TATGTTCAAA
AATGTTCCTT GGGAAGAATC GATGTCGTTA CCATTTCTCA TCCGTTGGCC GGGCCGGATA
GCGTCCGGTC CCGACGACAA GGTTTTGATC AGCTCCGTTG ATGTCGGTCC CACCCTCCTG
GGCCTGGCGG GGCTATCTAC CAGCCGGCCG CCGGCCATGC AGGGCGCTGA TCTCTCGTCC
CGGCTGATCG GCGCGACAAC GACGCCGGTC CCTGGTCCCG CCATCTACTA CGGGCCCCCG
GCGCGGGACG GCGGGCCGGG GATGCGAGGC CTGAGGACCC TGAGTCACAA GTTGCTCTTC
AGCTGTATCC CCGATCCGGC CCAACGATCC GGGTTCCTTC TCTCCGCGCA GCTCTACGAC
CTGAAGTCCG ACCCCTACGA AATGAGTGAC CAGGCTGCCT CCCGCCCTGC TGAGGTTCAG
CTGATGGGGC GTGAACTGGT ACGGCAGCTG GAGATCGTGG AGGATCCCTG GGCGGCAAGG
GATCAGCTCC GGGAACACTT GGACGGGGAA ACGTGCAGCT GGAAATAG
 
Protein sequence
MTAERPNILF IIADQFRNSA LGFRGQDPTY TPSLNSFAAE SKDILHAVSN YPVCSPHRAM 
LMTGQHPHRN GVPLNINSNT GAGLEPGIGT WSQVLRDAGY GTGYIGKWHL EAVTEEDAIW
GEGFREGAVW DAYSPVDRRH GFSFWYSYGA AHDHMHPHYW VGDAPREEKI VVDQWSAEHE
TDIAIGFLRE TTDAAESFAL VVSYNPPHQP FELAPATYRP RYAQLSAREL LTRPNVDVTG
PAGAEAAQAA PLYFAAISAI DHQIGRLLVA LEASGHHKNT IVIFTSDHGM QLGSHGLMFK
NVPWEESMSL PFLIRWPGRI ASGPDDKVLI SSVDVGPTLL GLAGLSTSRP PAMQGADLSS
RLIGATTTPV PGPAIYYGPP ARDGGPGMRG LRTLSHKLLF SCIPDPAQRS GFLLSAQLYD
LKSDPYEMSD QAASRPAEVQ LMGRELVRQL EIVEDPWAAR DQLREHLDGE TCSWK