Gene Acry_1005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_1005 
Symbol 
ID5161070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp1114913 
End bp1116469 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content70% 
IMG OID640552922 
Productsulfatase 
Protein accessionYP_001234141 
Protein GI148260014 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR03417] choline-sulfatase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.549789 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCCTG GGGTTGATGG CAACGGAGCA GACAAGGCGG CGGACGGGCG GCCCAATATC 
CTGATCGTGA TGGCCGACCA GCTCGGCGCG CGGGCGCTGC CGGCCTATGG CAACCAGGTC
GCGCTGACGC CCAATATCGA CGCGCTCGCC GCCGGCGGGG TGGTGTTCGA CAACGCCTAT
TGCAACAGCC CGCTCTGCGG CCCCTCGCGC TACGTGTTCA TGAGCGGGCA GCTGCCCTCG
GCGATCGGCG CCTTCGACAA CGCGGCCGAG TTTCCGGCGA TGCTGCCGAG CTTCGCGCAC
CACATGCGCG CCGCCGGATA CCGCACGATC CTGTCCGGCA AGATGCATTT CTGCGGGCCG
GACCAGATGC ACGGATTCGA GGAGCGGCTG ACCACCGACA TCTACCCGGC CGATTTCGGC
TGGACGCCGG ACTGGACCGA TTTCGCGACG CGGCCGAGCT GGTACCACGA CATGAGCTCG
GTGCGCGAGG CGGGGCTGTG CGTGCGGACC AACCAGATGG ACTATGACGA CGAGGTGGTG
TTCGCGGCGC GGCAGAAGCT GTTCGACCTC GCGCGCGATG ACGACGGGCG GCCGTTCTGC
ATGGTGGTGT CGCTGACGCA TCCGCACGAT CCCTTCGCGA TGACGGAGGA ATACTGGAAC
CTCTACGACC ACGACGCGAT CGACATGCCG CGGGTGCGGA CGGCGCCGGC TTCGATGGAC
CCGCATTCGC TGCGGCTGCG CCACGTGTCG AACATGGACA ACGAGCCGGT GACCGAGGCG
CAGGTGCGCA ACGCGCGGCA CGCCTATTAC GCGGCGATCT CCTTCGTCGA CCGCCAGCTC
GGCCGGCTGC GCGAGACGGT CGAGGCGTGC GGGCTCGCGG CGCGGACGGT GACGGTGATG
ACCGCCGATC ACGGCGAGCT GCTCGGCGAG CACGGGCTCT GGTACAAGAT GAGCTTCTTC
GAGGACGCGT GCCGGATTCC GCTGATCGTG CATGCGCCGG GGCGGTTCGC GCCGGCGCGG
GTGGGGGCGG CGGTGTCCTC GGTCGACATG CTGCCGACGC TGGTCGGGCT CGGCGGCGGG
AGGATCCCGG CGGGGCTCGC CTGCGACGGG ACCTCGCTGC TCGGCCATCT CGAAGGGCGC
GGCGGGCATG ACGGCGCGTT CGGCGAATAT CTCGCCGAGG GGGCGATCGC GCCGATCGTG
ATGATCCGGC GCGGGCGGCA CAAGTTCATC CATTGCCCGG CCGATCCGGA CCAGCTCTTC
GACCTCGAGG CCGATCCGGA CGAGCGGGCG AACCTCGCGG CGGCGCCGGA GCACGCGGCG
CTGGTGGCCG CGTTCCGCGC CGAGGTGGCG GCGCGCTGGG ACCTCGACGC GGTGCATCGC
GCGGTGCTGG CGAGCCAGGC GCGGCGGCGC TTCATCGACG CGGCGCTGCG GCAGGGGCGG
CGGAAGTCGT GGGATTTCCA GCCCTTCGTC GATGCCTCGG AACAGTACAT GCGCAACCAC
ATGCGGCTCG GCGACCTCGA GAAGCGGGCG CGGTTTCCCC GGCCGGCGGG GGACTGA
 
Protein sequence
MMPGVDGNGA DKAADGRPNI LIVMADQLGA RALPAYGNQV ALTPNIDALA AGGVVFDNAY 
CNSPLCGPSR YVFMSGQLPS AIGAFDNAAE FPAMLPSFAH HMRAAGYRTI LSGKMHFCGP
DQMHGFEERL TTDIYPADFG WTPDWTDFAT RPSWYHDMSS VREAGLCVRT NQMDYDDEVV
FAARQKLFDL ARDDDGRPFC MVVSLTHPHD PFAMTEEYWN LYDHDAIDMP RVRTAPASMD
PHSLRLRHVS NMDNEPVTEA QVRNARHAYY AAISFVDRQL GRLRETVEAC GLAARTVTVM
TADHGELLGE HGLWYKMSFF EDACRIPLIV HAPGRFAPAR VGAAVSSVDM LPTLVGLGGG
RIPAGLACDG TSLLGHLEGR GGHDGAFGEY LAEGAIAPIV MIRRGRHKFI HCPADPDQLF
DLEADPDERA NLAAAPEHAA LVAAFRAEVA ARWDLDAVHR AVLASQARRR FIDAALRQGR
RKSWDFQPFV DASEQYMRNH MRLGDLEKRA RFPRPAGD