Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sala_0299 |
Symbol | |
ID | 4082862 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingopyxis alaskensis RB2256 |
Kingdom | Bacteria |
Replicon accession | NC_008048 |
Strand | + |
Start bp | 297402 |
End bp | 299231 |
Gene Length | 1830 bp |
Protein Length | 609 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 638008657 |
Product | sulfatase |
Protein accession | YP_615355 |
Protein GI | 103485794 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.142586 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGCAG GCTGGAGGAC CACCAGGCGG GGCCGGACAG GTGCGTCCCG GCGTGCTGCC ATCATCGCGC TCGCATGCCT GTTGGGAAGC ACATCGGGCC TGGCGCAGTC ACGCGCGGCG CCGCCGCGCC AGCCCAATAT CGTCATCCTG CTCGCCGACG ACTGGGGGTT TTCGGACGTC GGTGCCTTTG GCTCCGAAAT CGCGACACCG CATATCGACG CGCTCGCACG CGCCGGAATG CGCTTTGCGA ACTTCCATGT CTCGGGTTCC TGCTCGCCGA CGCGCGCGAT GCTCCAGACG GGGGTGATGA ACCACCGCAA CGGCCTCGGC AACATGCCCG AGACGATCCC CGACGAACAT CGCGGCAAGC CCGGTTACGA CACGGTGATG AACCTGCGCG TCGTGACGAT CGCAGAGTTG ATGAAGGCCG CGGGATACCG CACCTACCTG ACCGGCAAAT GGCATCTGGG CAGCGACGCA AAGCGGCTGC CCGAAGCGCG CGGATACGAC CGCGCCTTCA GTCTCGCCGA TGCGGGCGCC GACAATTTCG AGCAGCGACC GATCGAAGGG CTGTACGACA AGGCGAACTG GACCGAGAAC GGCCGCCCCG CGACCCTGCC CCGCGATTAT TATTCATCCA CCTTCGTCGT CGAAAAGATG ATCGAATATA TCGAGGCGGA TCGCGATAGC GGCAAACCCT TCCTCGCCTC GATCAACTTC CTCGCCAATC ATATCCCGGT GCAGGCGCCC GACAGCGACA TCGCGCGCTA TGCGGCGATG TATCAGGACG GCTGGACGGC GCTGCGCGAG GCACGCGCGC GGCGTGCGGC GGCACTCGGC ATCGTGCCGG CGGGCACGCC GATGGTGACG ATGCCGACGA CGCGCGACTG GCAGAGGCTG GACGCCGACG AACGCGCGGC GGCGGTGCGC GTGATGCAGG CCTATGGCGG CATGGCGACC GCGATGGACC GCGAGATCGG GCGACTCGTC GCGCACCTCA AGACCACGGG CGATTACGAC AACACGATCT TCGTCTTCCT GTCCGACAAT GGCGCCGAGC CGACGAATCC CTTTGCCAGT CTGCGCAACC GGCTGTTCCT GCGGATGCAA TATGATCTTT CGACCGATAA CATCGGGCGG CGGGGCAGTT TTTCGGCGAT CGGGCCGGGC TGGGCAAGCG CCGCGGCGTC GCCCTTGTCA GGTTACAAGT TCAGCGCCGC CGAGGGCGGG CTGCGCGTCC CGCTGATCAT CGCCTGGCCG GGGCATGGCG CGATCCCGGC GGGCGCGATC AACGACGGGC TGGCGCATGT CACCGACCTC TTGCCGACGC TTGCCGAACT GGCGGACGTG CCGCTGCACG AAGGGACATG GCAGGGGCGG AGCGTCGAGC CGATCACGGG GCGCAGCCTC GTCCCGATGC TGAAGGGTGC TGCGGGCAGC GTCCATGGCG ACGCGCCGCT CGGTTACGAG CTGTCGGGCA ATGCCGCGCT GTTTCGCGGC GATTACAAGC TGGTGCGCAA CCTGCCGCCG ACCGGCGACG GCCGGTGGCG GCTCTATGAC ATCAAGACGG ACCCCGGCGA GACCCGCGAC CTGTCGGCGG CGATGCCCGA TCGGTTCGCC GCGATGCTGT CCGACTACCG CGCCTATGCC AGGGCGAACG GCGTGCTCGA CATGCCGGCG GGTTATACCG CCGACGAACA GATCAACCGT TATGCGTGGG AGCAGCAGGG GCGCAAACGC GCGATCAAGG CCGGGCTGTG GCTGGGCGGC GGGTTGATGG CGCTGGCGTT GCTGGTCTGG AGCTGGCGGC GGCGGCGCGC GCGGGGGTAA
|
Protein sequence | MAAGWRTTRR GRTGASRRAA IIALACLLGS TSGLAQSRAA PPRQPNIVIL LADDWGFSDV GAFGSEIATP HIDALARAGM RFANFHVSGS CSPTRAMLQT GVMNHRNGLG NMPETIPDEH RGKPGYDTVM NLRVVTIAEL MKAAGYRTYL TGKWHLGSDA KRLPEARGYD RAFSLADAGA DNFEQRPIEG LYDKANWTEN GRPATLPRDY YSSTFVVEKM IEYIEADRDS GKPFLASINF LANHIPVQAP DSDIARYAAM YQDGWTALRE ARARRAAALG IVPAGTPMVT MPTTRDWQRL DADERAAAVR VMQAYGGMAT AMDREIGRLV AHLKTTGDYD NTIFVFLSDN GAEPTNPFAS LRNRLFLRMQ YDLSTDNIGR RGSFSAIGPG WASAAASPLS GYKFSAAEGG LRVPLIIAWP GHGAIPAGAI NDGLAHVTDL LPTLAELADV PLHEGTWQGR SVEPITGRSL VPMLKGAAGS VHGDAPLGYE LSGNAALFRG DYKLVRNLPP TGDGRWRLYD IKTDPGETRD LSAAMPDRFA AMLSDYRAYA RANGVLDMPA GYTADEQINR YAWEQQGRKR AIKAGLWLGG GLMALALLVW SWRRRRARG
|
| |