Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dfer_1638 |
Symbol | |
ID | 8225209 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dyadobacter fermentans DSM 18053 |
Kingdom | Bacteria |
Replicon accession | NC_013037 |
Strand | - |
Start bp | 1985065 |
End bp | 1986438 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644929493 |
Product | sulfatase |
Protein accession | YP_003086045 |
Protein GI | 255035424 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.132093 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAAAA GGTTAAGTAC ATGGGCGCTG GCCCTGCTGA TCGGCGCCCG GGCGATGGGA CAATCGCCGA ACATCGTTTT CATCCTCGCC GACGACCTCG GGTATGGCGA CATCGGCGCA CATGGCCAGA AGCTCCTCCG CACACCGAAT ATCGACGCCC TGGCCAAAGA GGGTATGATT TTCACCGACA TCCACGCAGG TGCGCCCGTG TGCTCGCCTT CCCGCAGCGT GCTCATCACG GGACTGCACA CGGGGCATAC CACCATCCGC GGTAATGCGA CCATCCGGGG CGGCATTGTC GGCAACAAAG GCAAACAAAC CGTTCGTCGT GCGAACCTCG CCGCCGGCGA TTTCACCGTC GGGAAGCTGA TGGCGCAGAG CGGGTACACC ACCGCGCTGA CCGGCAAATG GCACCTCGAT GGCTACGACA CGCTCGCCAC GCCCATTCAC CGCGGTTTCG ACCAGTTTTC GGGCTGGCTG ATTGCTTATC CCGGCACGTA TGCCAATGGT TACTGGCCCG CAAAACGGTA CGTCAATGGG GTGTTGAAAG ATGTTGAGCA AAATGAAAAT GGCCGGAAGG GCTATTACGC CGACGACCTC ACAACCGACG AAAGCCTGGC GTTTCTGGCC GCGCAGAAGG ATGCGAAGAA ACCTTTTGTG CTGATGATCA ACTATAACAG TCCGCATTCG CCGCTGGATG CCGCCGACAG CTCCGCGTAC AAAGACCGGG ATTGGCCGCA GGACATGAAA ATCTACGGCG CGCAGGTGCA TCACCTCGAT GAAAATGTGG GCAAGATCAA AAAATATCTG ACCGAGAGTG GTTTAGCTAA AAACACCATC GTGTTCTTCT GCTCCGACAA CGGTCCGCGC TCGGAAGGTA CGCCGCAGCA GACGGCCATC GCCGAGTTTT TCGATTCCAA CGGCCGGCTT CGCGGATATA AGCGGGATAT GTACGAAGGC GGCATCCGCG TGCCGATGGT CGTGTGGGCG CCGGGGATTG TGAAACCGGG CAGCGTGAGC AGCGAACCGG CCTATTTCGC GGACATTATG CCTACATTCG CCGATATTGC CGGATCGAAA GTTTCTTATA CGACCGACGG CGCGAGCGTG CTGGCGTCGA TCAAGGGGAA GGCAGCATGG CAGCCGCGCT TTTTGTATTG GGAGTTTTTT GAAAAAGGCT TTGAACAGGC TGTTCGTTAC GGCAAATGGA AAGCGGTGAA AGCCAAAGGA AAGCTGGAAC TGTACGACCT GGATAAGGAC ATCAGCGAAA CGAACGACGT GTCCGCGGAC AACCCGGCGA TTGTAGCGAA AATTGAAAAC TATCTGAAAA CCAGCAGGAC GGAATCGCCA TTCTGGCCGG TGGAGGGCAA ATGA
|
Protein sequence | MLKRLSTWAL ALLIGARAMG QSPNIVFILA DDLGYGDIGA HGQKLLRTPN IDALAKEGMI FTDIHAGAPV CSPSRSVLIT GLHTGHTTIR GNATIRGGIV GNKGKQTVRR ANLAAGDFTV GKLMAQSGYT TALTGKWHLD GYDTLATPIH RGFDQFSGWL IAYPGTYANG YWPAKRYVNG VLKDVEQNEN GRKGYYADDL TTDESLAFLA AQKDAKKPFV LMINYNSPHS PLDAADSSAY KDRDWPQDMK IYGAQVHHLD ENVGKIKKYL TESGLAKNTI VFFCSDNGPR SEGTPQQTAI AEFFDSNGRL RGYKRDMYEG GIRVPMVVWA PGIVKPGSVS SEPAYFADIM PTFADIAGSK VSYTTDGASV LASIKGKAAW QPRFLYWEFF EKGFEQAVRY GKWKAVKAKG KLELYDLDKD ISETNDVSAD NPAIVAKIEN YLKTSRTESP FWPVEGK
|
| |