Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Jann_1494 |
Symbol | |
ID | 3933941 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Jannaschia sp. CCS1 |
Kingdom | Bacteria |
Replicon accession | NC_007802 |
Strand | - |
Start bp | 1465402 |
End bp | 1466910 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637903844 |
Product | sulfatase |
Protein accession | YP_509436 |
Protein GI | 89053985 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000530638 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCAAC CTAACATCCT GATAGTAATG GTGGATCAGT TGAACGGCAC GTTGTTCCCG GACGGCCCGG CGGACTGGCT GCACACGCCC AATCTGGACC GCTTGGCAGC CCGCTCCACA CGATTTGCCA ACACGTATAC CGCCTCTCCA CTCTGCGCGC CGGCCCGCGC GTCGTTCATG TCGGGCCAAT TACCGTCGCG CACCGGCGTC TATGACAACG CCGCGGAATT CACGTCGTCA ATCCCGACCT ACGCCCATCA CCTGCGCCGT GCGGGCTACT ATACTGCGTT GTCGGGCAAG ATGCATTTTG TGGGGCCGGA CCAGCTGCAC GGGTTTGAAC AGCGGCTGAC CACGGATATC TACCCGGCTG ATTTCGGTTG GACGCCGGAC TACCGCAAGC CCGGTGAGCG GATCGATTGG TGGTATCACA ACATGGGCTC CGTCACCGGA TCCGGCGTTG CTGAGATCAC CAACCAGATG GAATATGACG ACGCCGTGGC GTTTGAGGCG GAGCAAAAAC TCTACGACCT GTCGCGTGGG GCCGATCGGC GCCCGTGGTG TCTGACGGTC AGTTTCACCC ACCCCCATGA CCCCTATGTC GCGCGCCGCG AGTTCTGGGA TTTGTATAAC GATTGCAATC GCTTGATACC TGAGATCGGG GCGATCCCCT ATGCAGAGCA GGACAACCAT TCCAAGCGCA TTCTGGACGC GAATGATCTG GGTAATTTCG ATATTACGGA TCAGGACATC GCCCGGTCGC GCCAGGCCTA TTTCGCCAAC ATCTCCTACC TCGATCAGAA GATCGGCGGC GTGCTGGACG TGCTGGAGCG CACGCGGCAG GAGGCGATCG TGATCTTCAT CTCGGACCAC GGTGACATGT TAGGCGAACG TGGCTTGTGG TTCAAAATGT CCTTCTATGA GGGCTCGGCT CGCGTGCCGC TGATGATCTC CGCGCCGGGG ATGGAGCCTG GGATGGTCTC GGACCCGGTC TCCACCATCG ACCTCTGCCC CACCCTTTGC GACCTCGCGG GCGTGTCGAT GGACGAGGTG ATGCCGTGGA CCGATGGCGA AACGTTAACG CCGTTAGGAA AGGGCGCCAA ACGGGTCAAT CCGGTGGCGA TGGAATACGC GGCGGAAGGG TCGTACTCTC CCGTCGTGGG CCTGCGCCAA GGCCAATGGA AATACACCAA TTGCGCGATT GATCCGGAGC AATTGTTCGA TCTGGCCGCC GATCCCCATG AGCTGAACGA CCTGTCCGAA AACCCCGCCC ATGCGGCGAC CTTGGAGGCG TTTCGGGTCA AGGCAGCCGC CCGCTGGGAT CTGGAGGCGT TCGACGCAAA CGTTCGGCAA TCCCAGGCCC GCCGCTGGGT CGTCTACGAG GCGCTGCGCA ACGGCGCCTA CTACCCGTGG GATTTTCAAC CCCTGCGCGA CGCGTCCGAG CGCTACATGC GCAACCACAT GGACCTCAAC GTTCTTGAGG AAAATCAACG CTTTCCCCGG GGCGAATGA
|
Protein sequence | MTQPNILIVM VDQLNGTLFP DGPADWLHTP NLDRLAARST RFANTYTASP LCAPARASFM SGQLPSRTGV YDNAAEFTSS IPTYAHHLRR AGYYTALSGK MHFVGPDQLH GFEQRLTTDI YPADFGWTPD YRKPGERIDW WYHNMGSVTG SGVAEITNQM EYDDAVAFEA EQKLYDLSRG ADRRPWCLTV SFTHPHDPYV ARREFWDLYN DCNRLIPEIG AIPYAEQDNH SKRILDANDL GNFDITDQDI ARSRQAYFAN ISYLDQKIGG VLDVLERTRQ EAIVIFISDH GDMLGERGLW FKMSFYEGSA RVPLMISAPG MEPGMVSDPV STIDLCPTLC DLAGVSMDEV MPWTDGETLT PLGKGAKRVN PVAMEYAAEG SYSPVVGLRQ GQWKYTNCAI DPEQLFDLAA DPHELNDLSE NPAHAATLEA FRVKAAARWD LEAFDANVRQ SQARRWVVYE ALRNGAYYPW DFQPLRDASE RYMRNHMDLN VLEENQRFPR GE
|
| |