Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3111 |
Symbol | aslA |
ID | 5710963 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 3277491 |
End bp | 3279092 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641269038 |
Product | arylsulfatase precursor |
Protein accession | YP_001534445 |
Protein GI | 159045651 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.513855 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCAACT TAGGAAGGCT GCGGCGCTGC GCGCTGGGGG CGGTCTTGCT GGGCATTTCG GCAAGTGTCG CCGCCGCCCA GGAGACGGAC AAGCCCAACA TCCTGGTTAT CTGGGGCGAC GATGTGGGCC AGTCGAACAT CTCGGCCTAC ACGATGGGTC TGATGGGATA CGAAACGCCC AATATCGACC GGATCGCCGA AGAAGGCATG ATCTTCACCG ACTATTATGG CGAGCAGTCC TGCACCGCGG GCCGGTCCTC CTATATCATG GGCCAGTCAG TGTTCCGCAC GGGCCTCTCC AAGGTCGGTC TGCCCGGTGC CGAAGAAGGC ATGCAGGTCG AAGACCCAAC CATCGCGGGC CTGCTGAAGG CCCAAGGCTA CGCAACCGGC CAGTTCGGCA AGAACCACCT GGGCGACCGG GATGAGCATC TGCCGACCAA CCACGGCTTC GACGAGTTCT TCGGCAACCT CTATCACCTG AACGCCGAGG AAGAGCCCGA GAACGAGGAC TACCCGGGCG ATCTCGTGCT CGAAGACGGC CGCACCTTCC GCGAGGCGTT CGGGCCCCGC GGCGTGATCA AGTCCTCCGC CGACGGTACG ATCGAAGACA CCGGCCCCCT GACCAAGGAG CGGATGGAAA CCGTGGACGA CGAGACCGTC GCCGCGGCGA TCGACTTCAT CAAGCGCCAG GAAGAGGCGG GCAACCCCTG GTTCGTCTGG TGGTCGGGCA CCCGGATGCA CTTCCGCACC CATGTCAGCG ACGAGCGTCG CCAGATGGCC AACGAAATCG TCGGCAAGTC GGTGGACGAA TACACCGCCG GCATGATCGA ACATGACATG CATATCGGTC AGTTCCTCGA CCTGCTGGAC GAGCTCGGCA TCGCCGACGA GACCATCGTG CATTACTCCA CCGACAATGG CCCGCACATG AACACATGGC CCGATGCCGC CATGACGCCG TTCTGGGGTG AGAAGAACAC CCAGTGGGAA GGCGCATGGC GCGTGCCCTC CATGGTCCGC TGGCCCGGCC TGATCGAACC CGGCTCCGTG TCGAACTCGA TCATGCACCA CATGGACTGG CTGCCCACCT ACCTGGCCGC AGCCGGGCGT CCGAACATCA AGGAAGAACT TCTCGACGGT ATAACCGTGG CCGAGGTCGG CGGCGGACGC GATTACCGCG TGCATCTGGA TGGCTATAAC TTCCTGCCCT ATTTCGCGGG CGAAGTTGAC ACCGGCCCCC GGCAGGAGAT CTTCTACTTC ACCGATGACG GGGATCTTGC GGCCCTGCGC TTCGGCGACT GGAAGATCAC CTTCCTGGAG CAGAAGGAAT GGGCGACTCT GCGCGCCTGG ATGGAGCCTC TGACGCCGCT GCGGGTGCCG CTCATCGCCA ACCTGCGCCG CGACCCCTAT GAGCGCGGGT ATCGCACGTC GAACACCTAT TACGACTGGA TGCTCGACCG GGCCTACATG CTGGTGCCCG CGCAAGCCTA CGTCGCGGAC TTCCTGGAAA CCTTCCAGGA GTATCCACCC CGGCAGGAAG CCGCCTCCTT CAGCCTCGAC AAGGTGATGG AGAAGCTGAC CGCACCCAGC GGCGCGCGCT AA
|
Protein sequence | MVNLGRLRRC ALGAVLLGIS ASVAAAQETD KPNILVIWGD DVGQSNISAY TMGLMGYETP NIDRIAEEGM IFTDYYGEQS CTAGRSSYIM GQSVFRTGLS KVGLPGAEEG MQVEDPTIAG LLKAQGYATG QFGKNHLGDR DEHLPTNHGF DEFFGNLYHL NAEEEPENED YPGDLVLEDG RTFREAFGPR GVIKSSADGT IEDTGPLTKE RMETVDDETV AAAIDFIKRQ EEAGNPWFVW WSGTRMHFRT HVSDERRQMA NEIVGKSVDE YTAGMIEHDM HIGQFLDLLD ELGIADETIV HYSTDNGPHM NTWPDAAMTP FWGEKNTQWE GAWRVPSMVR WPGLIEPGSV SNSIMHHMDW LPTYLAAAGR PNIKEELLDG ITVAEVGGGR DYRVHLDGYN FLPYFAGEVD TGPRQEIFYF TDDGDLAALR FGDWKITFLE QKEWATLRAW MEPLTPLRVP LIANLRRDPY ERGYRTSNTY YDWMLDRAYM LVPAQAYVAD FLETFQEYPP RQEAASFSLD KVMEKLTAPS GAR
|
| |