Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sala_0302 |
Symbol | |
ID | 4082865 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingopyxis alaskensis RB2256 |
Kingdom | Bacteria |
Replicon accession | NC_008048 |
Strand | + |
Start bp | 301155 |
End bp | 302786 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 638008660 |
Product | sulfatase |
Protein accession | YP_615358 |
Protein GI | 103485797 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.139649 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAATA AATGGGTGGC GCTGGGTGCC ATGACGCTGG CGGCGGTCGG TGGCTATTGG GCCTATGACG CGAACAAATA TCGCATCCCC GGCATCGTGC AGGACTGGCG CGAGCCGGTG CAGCCGAACC GGGCGATCGC GTGGCAGCAA GGGCCGGCGG CGGCGCCGGA GGGCGCGCGG CCGCCGAACA TCATCCTGAT CGTCGCCGAC GACCTGGGCT ATAACGACAT CAGCCTGAAC GGCGGCGGCG TCGCGGGGAT CGTCAAGACG CCGAACATCG ATGCGCTGGC GCGTGAGGGG GTCCATTTCA CCACCGCCTA TGCCGCCAAC GCGACCTGCT CGCCCTCGCG CGCGGCGATG ATGACGGGGC GCTATCCGAC GCGCTTCGGC TTCGAGTTCA CCGCGGTGCC GATCGAGTTC GCCGAAAATC TGGCGCATGG CGAGGGTGTC GGGCCGCACC GCGCGATCTT TCACGACGAA CTGGTGACGC CCGACATCCC GCCCTATCCC CAGATGGGGG TGCCCGCGAG CGAGGTGACA ATTGCCGAAG CGGTGAAGGC GGCGGGCTAT CACACGGTTC ACATCGGCAA ATGGCATCTG GGCGAAGCCC CCGAATTGCA GCCGCACGCC CAGGGCTTCG ACGAAAGCCT GGCGGTGCTG GCGGGCGCGG CAATGCTGCT GCCCGAGGAT GACCCCGACG CAGTCAACGC CAAGCTGCCG TGGGATCCGA TCGACCGCTT CATCTGGGCC AATCTGCGCC ACGCAGTGAC CTTCAACGGC AGCAAGCGGT TTGCCGCGCA GGGGCATATG ACCGACTATT TCGCCGACGA GGCGATCAAG GCGATCGAAG CGAACCGGAA CCGGCCCTTT TTCCTCTATC TTGCCTTCAC CGCGCCGCAC ACGCCGCTGC AGGCGACGCG CGCCGATTAT GACCGGCTCG CGGCGATCAA GGATCACAGG ACGCGCGTTT ATGGCGCGAT GATCGCGCAG ATGGACCGGC GGATCGGCGA CGTGATGGCC AAGCTGAAGG AGGCCGGGAT CGACGACAAT ACGCTCGTCA TCTTCACCAG CGACAATGGC GGCGCCTGGT ACAACGGGAT GCCGGGGCTG AATGCGCCGT TCCGCGGGTG GAAAGCGACC TTTTTCGAAG GCGGCATCCG GGCGCCGCTG TTCATGCGCT GGCCAGCGCG CATCGCGCCG GGGACCGAGC GCGGCGACGT GACGGGCCAT CTCGACCTTT TCGCGACGAT TGCCGCCGCG GCGGGCGCGG CGCTGCCCGC GGACCGGACG ATCGACAGCG AGGATATATT GGCCGGTCCC GCCAAGCGTC CGGCGATGTT CTGGCGCTCG GGCGATTATC GCGCGGTGCG CGCGGGCGAC TGGAAATTGC AGGTGACGAA GCGACCCGAA AAGGCGCGCC TCTATAACCT TGCCGCCGAT CCGACCGAAC GGACCGACCT GTCGGCGCGC GAGCCGGCGC GCGTCGCCGA ACTCGGCGCG ATGATCGAGG CGCAGAACCG GGGCATGGCG ACGCCGATCT GGCCGGGGTT GGTCGAAGGG CCGGTGCGCA TCGACGTGCC GCTGAACACG CCGTGGCAGG ACGGGCAGGA TTATATCTAT TGGACCAACT GA
|
Protein sequence | MANKWVALGA MTLAAVGGYW AYDANKYRIP GIVQDWREPV QPNRAIAWQQ GPAAAPEGAR PPNIILIVAD DLGYNDISLN GGGVAGIVKT PNIDALAREG VHFTTAYAAN ATCSPSRAAM MTGRYPTRFG FEFTAVPIEF AENLAHGEGV GPHRAIFHDE LVTPDIPPYP QMGVPASEVT IAEAVKAAGY HTVHIGKWHL GEAPELQPHA QGFDESLAVL AGAAMLLPED DPDAVNAKLP WDPIDRFIWA NLRHAVTFNG SKRFAAQGHM TDYFADEAIK AIEANRNRPF FLYLAFTAPH TPLQATRADY DRLAAIKDHR TRVYGAMIAQ MDRRIGDVMA KLKEAGIDDN TLVIFTSDNG GAWYNGMPGL NAPFRGWKAT FFEGGIRAPL FMRWPARIAP GTERGDVTGH LDLFATIAAA AGAALPADRT IDSEDILAGP AKRPAMFWRS GDYRAVRAGD WKLQVTKRPE KARLYNLAAD PTERTDLSAR EPARVAELGA MIEAQNRGMA TPIWPGLVEG PVRIDVPLNT PWQDGQDYIY WTN
|
| |