Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0936 |
Symbol | |
ID | 5710627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 960784 |
End bp | 963285 |
Gene Length | 2502 bp |
Protein Length | 833 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641266847 |
Product | sulfatase |
Protein accession | YP_001532282 |
Protein GI | 159043488 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCCACA CAAAGACGAA CCTGGCGACC GCTTTCGCCG GTCTGCTCAC GCTGGTCTCC CCGGCCTGGG CCGACACGGC GGCTGTCTCG GGCGCACCCA ACGCGACCGC CGTGATCGAC GGACGGCAAC TGCCTGCCCC GCCCGAGGCA TTCGGCGGAC AGATCGCCCA GACGGCCAGG GAATCAGAGC CCTGGTGGCC GCCCCGCATC GTCGCCCCGA AGACCGCACC GAACATCCTG CTGATCATCA CGGACGACTC GGGTTTCGGG GTGCCGAGCA CCTTCGGTGG CGTGATCCCC CATGACACGA TGGACCAACT GGCGGACGAA GGGCTGCGGT TCACCGCCAT CCATTCCACG GCGCTGTGCT CGCCGACGCG GGCCGGGCTT CTCACCGGGC GCAACCACCA CACCGTGGGC TTCGGCGTGA TTTCCGAATC CTCGACCGGC TTTCCCGGCT ACAACTCCGT TATCACCAAG GACAAGGCGA CCATCGGGCG GATCCTGCGG GACAACGGCT ACGCGACCTC CTGGTTCGGC AAGAACCACA ACACGCCCGC CTTCCAGACC AGCCTCGCCG GCCCGTTCGA TCAATGGCCG ATCGGCATGG GGTTCGAGTA TTTCTACGGC TTCGTTGGCG GCGATGCGAA CCAGTGGCAG CCCAACCTGT ATCGCAACAC CACGCCGATC TACCCGTTCG AGGATGCCGA GCCGGGGTGG AACCTGATCA CCGCGACGGC AGATGACGCG ATCACCCATA TCACGAACCT GCACCAGATC GCCCCGGACA AGCCGTTCTT CGTCAAATAC GCCCCCGGCG CCACCCACGC GCCCCATCAC CCCACCCCGG AATGGGTTGA AAAGATCGAG GCGATGAACC TGTTCGACGA CGGCTACGAG GCGCTGCGCG CGACGATCTT CGAGAACCAG AAGAAGCTGG GGCTTGTCCC GCAGGACGCC ACGCTCACCC CCTGGCCCGA GGACAAGATC CGGAAATGGG AGCAACTCTC CGATGACGAG CGGAAACTGT TCGTCCGCCA GGTCGAAGTG TTCGCCGCCT ATGCCTCCTA TTCGGACCAC GAGATCGGCC GCGTAATCGA TGCCATCGAC GCGCTGGGGG AGCTTGAGAA CACGCTGGTG ATCTACATCA ACGGCGACAA CGGCACCTCC GCCGAAGGTG GCCCCGTGGG CACGCCGAAC GAGGTCGCCT GGTTCAACGG CATCGCCGAG ATGCCCATCG AGGTCCAGAT GCAGTGGTAT GACGTCTGGG GCACCGAAGA CACCTACAAC CACATGTCGG CGGGCTGGTC CTGGGCCTTC GACACGCCCT TCGACTATTT CAAGCAGAAC GCCAACCGGC TGGGCGGCGT GCGCCAGAAC ATGGTGGTCT CCTGGCCGAA CGGCATCGAC GACAAGGGCG GCCTGCGGGA CCAGTTCCTG CATGTGATCG ACGTGGTGCC GACCATTCTG GAGGTGACCG GGATCAACGC GCCGCTGGAG GTGGACGGCA TCCTGCAACA TCCGATCGAG GGCACGAGTT TCGCCCATCT GTTCGAGGCC GCGAATGCCG ACGCGCCCAG CCCCCGCAAG ACCCAGTATT TCGAGATGAT GGGCCAGTGG GCGCTCTATC ACGAGGGCTG GCTGCTGGCG ACCAAGGTCA ACCGCATGCC GTGGGAGACA CCGGGCGTCC CGAACCCCGA CCCGCTCAAC AACCAGGTTC TGGAACTCTA TGACCTGACG ACGGACATGA ACCAGCAAAT CGACCTGGCC GAGACCAACC CCGACAAGGT CGAGGAACTC AAGGCGCTGT TCATCGCGGA GGCGGAGAAA TACCAGGTCT TCCCGATGGA TGCCTCGGTC ACCTCGCGCC TTGCGCAACC CCGGCCCAAC ATCACCGCCG GGCGGACCGA GTTCGTCTAT ACCCGGCCGA TGACCGGCCT GCCGCAGGGT GATTCCCCGA GCATCCTGAA CGCCTCCTAC ACGATCACCG CCGAGATCGA GGTGCCCGAA GGCGGCGCGG AAGGGATGAT CGTGACCTCC GGCGGGCGCT TTGCGGGCTA CGGCTTCTAC CTCAAGGACG GCAAGCCGGT CTTCACCTGG AACCTGCTGA ACCTCGACAG GGTCCGCTGG GAAGCCCCCG ATGCGTTGCC GCCGGGCCAG CACACCGTCG CGTTCGAGTT CCAATATGAC GGTCTCGGCG CGGGGACGCT GGCCTATGGC AGCACCAGTG GCATGGGACA GGGCGGCACC GGAACGCTCA AGGTGAACGG GATCGCGGTG GATACACGGC AGATGAAGCA GACCATCCCG GTCATCCTGC AATGGGACGA AGCCTTCGAC ATCGGCTCCG ACACGTTGAC CGGCGTGCAT GACGCCGACT ACGTGCCGCC CTTCGCCCTG ACCGCGAGAC TGGACAAGCT GACAATCAAG GTGGACCAGC CGCAACTGAC CGAGGCCGAC ATCGCCACGC TCGAGGACGC GCGCAAGCGT GCGAGCGACT GA
|
Protein sequence | MFHTKTNLAT AFAGLLTLVS PAWADTAAVS GAPNATAVID GRQLPAPPEA FGGQIAQTAR ESEPWWPPRI VAPKTAPNIL LIITDDSGFG VPSTFGGVIP HDTMDQLADE GLRFTAIHST ALCSPTRAGL LTGRNHHTVG FGVISESSTG FPGYNSVITK DKATIGRILR DNGYATSWFG KNHNTPAFQT SLAGPFDQWP IGMGFEYFYG FVGGDANQWQ PNLYRNTTPI YPFEDAEPGW NLITATADDA ITHITNLHQI APDKPFFVKY APGATHAPHH PTPEWVEKIE AMNLFDDGYE ALRATIFENQ KKLGLVPQDA TLTPWPEDKI RKWEQLSDDE RKLFVRQVEV FAAYASYSDH EIGRVIDAID ALGELENTLV IYINGDNGTS AEGGPVGTPN EVAWFNGIAE MPIEVQMQWY DVWGTEDTYN HMSAGWSWAF DTPFDYFKQN ANRLGGVRQN MVVSWPNGID DKGGLRDQFL HVIDVVPTIL EVTGINAPLE VDGILQHPIE GTSFAHLFEA ANADAPSPRK TQYFEMMGQW ALYHEGWLLA TKVNRMPWET PGVPNPDPLN NQVLELYDLT TDMNQQIDLA ETNPDKVEEL KALFIAEAEK YQVFPMDASV TSRLAQPRPN ITAGRTEFVY TRPMTGLPQG DSPSILNASY TITAEIEVPE GGAEGMIVTS GGRFAGYGFY LKDGKPVFTW NLLNLDRVRW EAPDALPPGQ HTVAFEFQYD GLGAGTLAYG STSGMGQGGT GTLKVNGIAV DTRQMKQTIP VILQWDEAFD IGSDTLTGVH DADYVPPFAL TARLDKLTIK VDQPQLTEAD IATLEDARKR ASD
|
| |