Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dfer_4501 |
Symbol | |
ID | 8228104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dyadobacter fermentans DSM 18053 |
Kingdom | Bacteria |
Replicon accession | NC_013037 |
Strand | + |
Start bp | 5432570 |
End bp | 5434015 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644932347 |
Product | sulfatase |
Protein accession | YP_003088867 |
Protein GI | 255038246 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.930305 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.505642 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAAAT ATTTGTTGCT AATACCCCTA CTGACTTCCT CATTCCTTAC TCAACGCGCC GACGCGCAGG CCCCAAAGCC GCAACGCCCG AATATCGTAT TTATCCTGGC CGACGACCTT GGTTACGGCG ACGTCGGTTT TAACGGACAG AAGCTCATCA AAACGCCCAA TATCGATAAA CTGGCGAAGG AGGGAATGAT CTTTAACCAA TTTTACGCCG GTACATCGGT GTGTGCGCCT TCGCGGTCGT CGCTGCTGAC GGGGCAGCAT ACCGGCCATA CGTATATCCG CGGCAATAAG GGTGTGGAGC CGGAAGGCCA GCAGCCTATT GCCGACTCGG TGACGACGCT GGCGGAGGTG CTCAAAAAAT CGGGGTACGT GACGGCGGCA TTTGGCAAGT GGGGGCTGGG GCCGGTTGGC TCGGAAGGCG ATCCCAATAA GCAGGGCTTC GATCGTTTTT ATGGTTACAA CTGCCAAAGC CTCGCGCACC GCTATTATCC GGAACACCTT TGGGATAATA GCAAAAAAAT ACTGTTGGAA GGCAACAAAG GCCTTATTCA TAACAAGGAA TACGCGCCCG ACCTGATCCA GAAAAAGGCG CTCAGCTTTG TGAATGCGCA GGATGGCAAG CAGCCTTTCT TCCTGTTTTT GCCCTACATT TTACCCCACG CCGAGCTGGT GGTGCCGGAC GACAGCCTTT TCAGATATTA TAAAGGTAAG TTCGAAGAAA AGCCGCACAA GGGCGCCGAC TATGGCCCGG GTGCTAACGG CGGCGGCTAT GCATCACAGG ACTTTCCGCA CGCGACTTTC GCGGCGATGG TGGCGCGCCT GGACCTTTAT GTAGGCCAGG TAATGAATGC ATTGAAGAAA AAAGGCCTTG ACAAAAATAC GCTGGTGATC TTCACGAGCG ACAACGGCCC GCACGTCGAA GGAGGTGCCG ATCCGAGATT TTTCAACAGC GGCGCCGGTT TCCGCGGCGT GAAGCGCGAT TTGTACGAAG GCGGCATTCG CGAGCCATTC GCAGCCCGCT GGCCGGCCGC GATCAAGCCG GGTTCGAAAA GCGATTACAT TGGCGCATTC TGGGATATTC TGCCCACTTT CGCCGAGCTG GCCAACGCGC CGGCCCCGCG TAACATCGAC GGTATTTCAT TTACCGATGC ATTGAAAGGC AAGGCGATTC AGAAAAAGCA CGATTACCTC TATTGGGAAT TTCATGAGCA AGGCGGCCGC CAGGCGGTTC GCCAGGGTAA CTGGAAGGCC GTCCGCCTGA AAGCCGCCGG AAATCCCGAT GCATTGGTAG AGCTCTACGA TCTTTCAAAA GACCCGCAGG AAAAGAATAA CCTCACCCCA CAGTTCCCCG AAAAAGCCAA GGAACTCGGC CAGATCATGA ACCGCGCGCA CGTTTCATCC GCGATTTTCC CGTTTGGCAG TCTGGCGACG AATTAA
|
Protein sequence | MRKYLLLIPL LTSSFLTQRA DAQAPKPQRP NIVFILADDL GYGDVGFNGQ KLIKTPNIDK LAKEGMIFNQ FYAGTSVCAP SRSSLLTGQH TGHTYIRGNK GVEPEGQQPI ADSVTTLAEV LKKSGYVTAA FGKWGLGPVG SEGDPNKQGF DRFYGYNCQS LAHRYYPEHL WDNSKKILLE GNKGLIHNKE YAPDLIQKKA LSFVNAQDGK QPFFLFLPYI LPHAELVVPD DSLFRYYKGK FEEKPHKGAD YGPGANGGGY ASQDFPHATF AAMVARLDLY VGQVMNALKK KGLDKNTLVI FTSDNGPHVE GGADPRFFNS GAGFRGVKRD LYEGGIREPF AARWPAAIKP GSKSDYIGAF WDILPTFAEL ANAPAPRNID GISFTDALKG KAIQKKHDYL YWEFHEQGGR QAVRQGNWKA VRLKAAGNPD ALVELYDLSK DPQEKNNLTP QFPEKAKELG QIMNRAHVSS AIFPFGSLAT N
|
| |