Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dfer_0997 |
Symbol | |
ID | 8224567 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dyadobacter fermentans DSM 18053 |
Kingdom | Bacteria |
Replicon accession | NC_013037 |
Strand | - |
Start bp | 1166055 |
End bp | 1167608 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644928858 |
Product | sulfatase |
Protein accession | YP_003085411 |
Protein GI | 255034790 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.224171 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0910854 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAACA TCTCCATAGC CCGATGGCTC TGCTTTGCCG TCATGCTGAT CGTGTCCTTC GGCACGCGCC ACGCGCTCGC GCAATCCAAA CCGAATATCA TTGTTTTCCT CGTCGACGAT ATGGGCTGGC AGGATACCTC GGTGCCATTC TGGAATAAGC CGACCGATTT CAACCGCCGC TACCGTACGC CGAACATGGA ACGGCTTGCG CGCGAAGGCA TGAAGTTTAC CAATGCCTAC GCCATGCCCG TGTGTACGCC CACCCGCGTG AGCCTGATCA CCGGCGTGAA TGCGGCCCAC CACCGCGTGA CGCACTGGAC GTCGCCGGAT AAGGACAAAA ACACCGATTA TGCCGACAAA GCGCTCGAAG CCGTCGACTG GAACATCAAT GGTTTCAGTC CCGTTGCCGG TGTTCCGCAT ACTTTTCACG GCACAGCATT GCCCGAGGTA CTGCGGCAAA ACGGCTACTA CACCGTTCAC AGCGGCAAGG CGCATTTTGG GTCTGCCGGC ACGCCGGGTT CGGACCCTGT GAATCTTGGA TTTGAGATAA ACATCGCGGG TAGTTCCATT GGACATCCGG CCAGCTATTC GGGAAAGGCC AACTACGACA GCCCCGTGAA CGGCAAGCCT AACCGCAATG CCGTGCCCGG CCTGGAAGCC TACCACGGCA CGGACACATT CCTGAGCGAC GCCATCACGA CCGAAGCATT GAAGGCGATC GCTAAGCCGG TAGCAGAGAA AAAGCCGTTT TTTCTCTACC TCTCGCATTA CGCCGTCCAC ATTCCGCTCA CGGCCGATCC ACGCTTCCTG AACCGCTACC TCGAAGCCGG CCTGGACAGC ACCGAAGCCA AATACGCGGC ATTGGTGGAA GGAATGGACA AAAGCCTCGG CGATGTGCTC CGATACCTCG ATGAGCAGAA GATAGCCGAT AATACAGTGG TACTTTTCAT GTCCGACAAT GGCGGCCTGA GCACTTCACC CGCACGCGGC GGCAAGGCGT GGACGCATAA TCTGCCCTTG AAAGCCGGAA AAGGCTCCGT GTACGAAGGC GGCATCCGCG AGCCGATGCT CGTGCGGTGG CCCGGCGTAA CCAAAGCCGG CTCGGTAACG GAGCAATACG TGATTATTGA AGACTTTTTT CCAACCATTC TCGACATAGC TGGCGTGAAG AATGCCCGCA CCGTTCAGCA AGTGGACGGC AAGTCATTCC TGCCCATCCT GAAAAACCCC GCATTCAAAG ACGAACGCCG CGGACTGGTA TGGCACCACC CTAACCGCTG GATAGCCGCC GAAGGACCGA ATATCCATTA CGCCAGCGCA TTCCGCCAGG GCGACTGGAA GCTGATTTAC GATTACCGGC AGGCGAAACT GGAATTGTAC AACCTGCGTA CGGACATTGG CGAAGAACAT GACGTGGCTG CGTCCAATCC ATTGAAGGTG AAAGAATTGG CCAATCTGCT TTCCAGACAG CTGAAAACCT GGGGCGCACA ATGGCCGGTT TCCAAAAAAA CGGGCAAACC TGTTCCACTA CCGAACGAAC TGCCAGGCCT GTAA
|
Protein sequence | MMNISIARWL CFAVMLIVSF GTRHALAQSK PNIIVFLVDD MGWQDTSVPF WNKPTDFNRR YRTPNMERLA REGMKFTNAY AMPVCTPTRV SLITGVNAAH HRVTHWTSPD KDKNTDYADK ALEAVDWNIN GFSPVAGVPH TFHGTALPEV LRQNGYYTVH SGKAHFGSAG TPGSDPVNLG FEINIAGSSI GHPASYSGKA NYDSPVNGKP NRNAVPGLEA YHGTDTFLSD AITTEALKAI AKPVAEKKPF FLYLSHYAVH IPLTADPRFL NRYLEAGLDS TEAKYAALVE GMDKSLGDVL RYLDEQKIAD NTVVLFMSDN GGLSTSPARG GKAWTHNLPL KAGKGSVYEG GIREPMLVRW PGVTKAGSVT EQYVIIEDFF PTILDIAGVK NARTVQQVDG KSFLPILKNP AFKDERRGLV WHHPNRWIAA EGPNIHYASA FRQGDWKLIY DYRQAKLELY NLRTDIGEEH DVAASNPLKV KELANLLSRQ LKTWGAQWPV SKKTGKPVPL PNELPGL
|
| |