Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dfer_1775 |
Symbol | |
ID | 8225346 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dyadobacter fermentans DSM 18053 |
Kingdom | Bacteria |
Replicon accession | NC_013037 |
Strand | + |
Start bp | 2181298 |
End bp | 2183061 |
Gene Length | 1764 bp |
Protein Length | 587 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644929629 |
Product | sulfatase |
Protein accession | YP_003086181 |
Protein GI | 255035560 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.819016 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.42216 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAACC TGAGTAAAAA CAATGCGGCG GGCTTCTGGA CGGCGGGCAT ACTGGCTGCA TTTGCGTTGG GCGCGGCCGT GCCTGCCATT ATCCCGCAGG TGTTCCAAAA CAATGTGCCT GCTGTGCAGC AGGAGTATAT CAAAGCCTTC GGAAAGCAGC CGGCAGTAAA GCCGAATGTG GCCAACCAGC CCAATATCCT GTACATCACC AGCGACCAGC ACCACTGGCT GCGCATGGGC TACAACGACC CGACGATCAA AACACCCAAC CTCGACCGGC TGGCCCGGTC GGGGAGCATT TTCGACCGGG CTTACACGGT AAACCCTGTG TGCACGCCTA CGCGTGCTTC CATGATTACG GGCATGTACC CGTCGCAGCA CGGCGCGTAT GCATTGGGCA CGAAGCTGCC GGAGACGATC CCGACGATTG CGAATTACCT GCAAGAAGTG GGATATTTTT CGGCGATTGT GGGCAAGGCA CATTTTATGC CGCTGGCCGG CAATGCTACT TATCCTTCGC TGGAAGCCTA CCCGGTCTTG CAGGATCTCG ACTTTTGGGA AAAATACCAT GGCCCGTTCT ACGGCTTCGA ACATGCCGAA CTTGCCCGGC CGCACGGCGA CGAATCGCAC GTGGGCCAGC ATTATGCAAT ATGGATGGAG CGCAAGCTCA AAAGCGAAGG CAAAGACCCG AAATCCTGGA AAAAGTGGTT CAGAAAGCCG CCAAAGGAGA AATTCAATGA ATCCAACGAG CGCATGCGAG AAATCATGGA GGAGAATGCG GCCATTGGCG AGGCGCAGTA CGGTGCCTGG AATATCCCCG AACCTTACCA TTTGAATGCC TGGATCGCCG AACAAACCAA TGCGCAGATC GACGCGGCGA TGAAGCGCAA CAAACCGTTC TTCGTTTGGG CGAGCTTCTT CGACCCGCAC CCGCCCTACC TGGTGCCCGA GCCGTGGGCC TCGATGTACA AGCCGGAGGA TATGAAGCTG CCCGAAGTGC CGGCCGACGA CCTGGACGAC ATGCCTTACC ATTACCGCAT GACGCAATCG GGCCCGAGAG GATGGGGCAA GCAGTTCGAA GAAGATGGTT TCGCCGTACA CGGGTTCACC CGCCACGAGC CCGACAAGGA GAAGCAGAAA AAGGACATGG CGCTGTACTA CGGCATGATT TCGATGATGG ATAAATATAT TGGTAACATC CTCGACCATC TCGAACAATC GGGCCAGCTT GACAACACGA TCATTGTATT CACCACCGAT CATGGTCACC ACATCGGCAC GCACCATTTG TCGGCCAAGG GCGGGTTTGC GATGGAGGAA GACCTGCGCA TTCCTTTTAT TGTATCCTGG AAAAACAAAG TCCCTGCCGG CAAACGGAAC AATGCGTTGA TTTCCGTCGT CGATTTTGCG CCCAGCTTTC TCACTTTGGC CGGCCGCGAA AAGCCTCTGA CGATGACCGG CGTGGATGTT TCGCCCGTTT GGCTGGGCAG GTCGGAGAAA GTAAGGGATT GGGTGATTGC GGAAAACCAT TTCCAGCGCA CGAAATTTTA CCAGAAATCC TACATCGAAA ACCGCTACAA GGTAACGTGG TACATGCATA GCGACGAGGG CGAGCTGTTT GACCTGCAAA ATGATCCGCA TGAATTTAAA AACCTTTGGC AAAGCAAGCA GCATCAGGAG CTGAAACTCA AACTGCTGCA CCGCGCCATG CAGGCCGATA TGGCCAAGGA AAGCTGCTGG ATGCCGCGCG TCGGCCCTGC ATAA
|
Protein sequence | MKNLSKNNAA GFWTAGILAA FALGAAVPAI IPQVFQNNVP AVQQEYIKAF GKQPAVKPNV ANQPNILYIT SDQHHWLRMG YNDPTIKTPN LDRLARSGSI FDRAYTVNPV CTPTRASMIT GMYPSQHGAY ALGTKLPETI PTIANYLQEV GYFSAIVGKA HFMPLAGNAT YPSLEAYPVL QDLDFWEKYH GPFYGFEHAE LARPHGDESH VGQHYAIWME RKLKSEGKDP KSWKKWFRKP PKEKFNESNE RMREIMEENA AIGEAQYGAW NIPEPYHLNA WIAEQTNAQI DAAMKRNKPF FVWASFFDPH PPYLVPEPWA SMYKPEDMKL PEVPADDLDD MPYHYRMTQS GPRGWGKQFE EDGFAVHGFT RHEPDKEKQK KDMALYYGMI SMMDKYIGNI LDHLEQSGQL DNTIIVFTTD HGHHIGTHHL SAKGGFAMEE DLRIPFIVSW KNKVPAGKRN NALISVVDFA PSFLTLAGRE KPLTMTGVDV SPVWLGRSEK VRDWVIAENH FQRTKFYQKS YIENRYKVTW YMHSDEGELF DLQNDPHEFK NLWQSKQHQE LKLKLLHRAM QADMAKESCW MPRVGPA
|
| |