Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acry_1005 |
Symbol | |
ID | 5161070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidiphilium cryptum JF-5 |
Kingdom | Bacteria |
Replicon accession | NC_009484 |
Strand | - |
Start bp | 1114913 |
End bp | 1116469 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640552922 |
Product | sulfatase |
Protein accession | YP_001234141 |
Protein GI | 148260014 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.549789 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCCTG GGGTTGATGG CAACGGAGCA GACAAGGCGG CGGACGGGCG GCCCAATATC CTGATCGTGA TGGCCGACCA GCTCGGCGCG CGGGCGCTGC CGGCCTATGG CAACCAGGTC GCGCTGACGC CCAATATCGA CGCGCTCGCC GCCGGCGGGG TGGTGTTCGA CAACGCCTAT TGCAACAGCC CGCTCTGCGG CCCCTCGCGC TACGTGTTCA TGAGCGGGCA GCTGCCCTCG GCGATCGGCG CCTTCGACAA CGCGGCCGAG TTTCCGGCGA TGCTGCCGAG CTTCGCGCAC CACATGCGCG CCGCCGGATA CCGCACGATC CTGTCCGGCA AGATGCATTT CTGCGGGCCG GACCAGATGC ACGGATTCGA GGAGCGGCTG ACCACCGACA TCTACCCGGC CGATTTCGGC TGGACGCCGG ACTGGACCGA TTTCGCGACG CGGCCGAGCT GGTACCACGA CATGAGCTCG GTGCGCGAGG CGGGGCTGTG CGTGCGGACC AACCAGATGG ACTATGACGA CGAGGTGGTG TTCGCGGCGC GGCAGAAGCT GTTCGACCTC GCGCGCGATG ACGACGGGCG GCCGTTCTGC ATGGTGGTGT CGCTGACGCA TCCGCACGAT CCCTTCGCGA TGACGGAGGA ATACTGGAAC CTCTACGACC ACGACGCGAT CGACATGCCG CGGGTGCGGA CGGCGCCGGC TTCGATGGAC CCGCATTCGC TGCGGCTGCG CCACGTGTCG AACATGGACA ACGAGCCGGT GACCGAGGCG CAGGTGCGCA ACGCGCGGCA CGCCTATTAC GCGGCGATCT CCTTCGTCGA CCGCCAGCTC GGCCGGCTGC GCGAGACGGT CGAGGCGTGC GGGCTCGCGG CGCGGACGGT GACGGTGATG ACCGCCGATC ACGGCGAGCT GCTCGGCGAG CACGGGCTCT GGTACAAGAT GAGCTTCTTC GAGGACGCGT GCCGGATTCC GCTGATCGTG CATGCGCCGG GGCGGTTCGC GCCGGCGCGG GTGGGGGCGG CGGTGTCCTC GGTCGACATG CTGCCGACGC TGGTCGGGCT CGGCGGCGGG AGGATCCCGG CGGGGCTCGC CTGCGACGGG ACCTCGCTGC TCGGCCATCT CGAAGGGCGC GGCGGGCATG ACGGCGCGTT CGGCGAATAT CTCGCCGAGG GGGCGATCGC GCCGATCGTG ATGATCCGGC GCGGGCGGCA CAAGTTCATC CATTGCCCGG CCGATCCGGA CCAGCTCTTC GACCTCGAGG CCGATCCGGA CGAGCGGGCG AACCTCGCGG CGGCGCCGGA GCACGCGGCG CTGGTGGCCG CGTTCCGCGC CGAGGTGGCG GCGCGCTGGG ACCTCGACGC GGTGCATCGC GCGGTGCTGG CGAGCCAGGC GCGGCGGCGC TTCATCGACG CGGCGCTGCG GCAGGGGCGG CGGAAGTCGT GGGATTTCCA GCCCTTCGTC GATGCCTCGG AACAGTACAT GCGCAACCAC ATGCGGCTCG GCGACCTCGA GAAGCGGGCG CGGTTTCCCC GGCCGGCGGG GGACTGA
|
Protein sequence | MMPGVDGNGA DKAADGRPNI LIVMADQLGA RALPAYGNQV ALTPNIDALA AGGVVFDNAY CNSPLCGPSR YVFMSGQLPS AIGAFDNAAE FPAMLPSFAH HMRAAGYRTI LSGKMHFCGP DQMHGFEERL TTDIYPADFG WTPDWTDFAT RPSWYHDMSS VREAGLCVRT NQMDYDDEVV FAARQKLFDL ARDDDGRPFC MVVSLTHPHD PFAMTEEYWN LYDHDAIDMP RVRTAPASMD PHSLRLRHVS NMDNEPVTEA QVRNARHAYY AAISFVDRQL GRLRETVEAC GLAARTVTVM TADHGELLGE HGLWYKMSFF EDACRIPLIV HAPGRFAPAR VGAAVSSVDM LPTLVGLGGG RIPAGLACDG TSLLGHLEGR GGHDGAFGEY LAEGAIAPIV MIRRGRHKFI HCPADPDQLF DLEADPDERA NLAAAPEHAA LVAAFRAEVA ARWDLDAVHR AVLASQARRR FIDAALRQGR RKSWDFQPFV DASEQYMRNH MRLGDLEKRA RFPRPAGD
|
| |