Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dfer_2520 |
Symbol | |
ID | 8226092 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dyadobacter fermentans DSM 18053 |
Kingdom | Bacteria |
Replicon accession | NC_013037 |
Strand | - |
Start bp | 3099234 |
End bp | 3101057 |
Gene Length | 1824 bp |
Protein Length | 607 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644930352 |
Product | sulfatase |
Protein accession | YP_003086903 |
Protein GI | 255036282 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.541987 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.989615 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATAAAC TAACCCTTAC AATTCTTCTG GCGGTGCTCA CCGCCGCCAT GTCCCCGGCC ACTGCGCAGG ACCGTCCGAA TATCCTCTGG ATTGTCAGCG AGGATAATAC CGTTCTGCTG GGCAGCTACG GCGATCAATT TGCCACCACG CCCAACCTGG ACCAGTTTGC GGCCGGAAGC ATCCGCTACA AAAATGCATT TTCGACGGCT CCCGTGTGTG CACCTTCGCG TAACACGCTC ATTACCGGCA TGTACCCGCC ATCGCTGGGC ACGGAGCACA TGCGGAGCGT GTACCCGTCG CCGGCATTCG TGAAGTTTTT CCCGAAATAC CTCCGGGAAG CGGGCTACTA TACCACCAAC AATGCCAAAA AGGATTACAA CACGCCCGAC CAAACCGACG CCTGGGACGA ATCGAGCAAC AAGGCGACTT ACAAGAACCG GAAACCGGGA CAGCCGTTTT TTGCGGTATT TAATCTGAAT GTGTCTCACG AAAGTTCGCT GCACGAGCCA TTGCCTGCAT TGAAGCACGA TCCCGAAAAA GTGCCGCTGC CGCCATATCA CCCGGCGACC CCGGAGCTGA AACACGACTG GGCGCAATAC TACGATAAGC TGGAAGAAAT GGACCGGCAA TTCGGGCGCT TATTGCAGGA ATTGAAGGAC GAAGGGCTGG CCGAAAATAC GATCGTTTTT TACTATGCCG ACAATGGCGG TGTGCTGGCG CGCAGCAAGC GGTTTATGTA TGAATCGGGT TTGCATGTGC CGCTAATTGT CCATTTGCCG CCGAAATACG CGCATTTGGC CAGCCAAAAA TCGGGTACGG TGTCGGACAG GCTGGTGACG TTCCTGGATT TCGCGCCTAC GGTGCTGAGT CTGGTGGATA TTAAAGTGCC GGAATATATG CAGGGAGGCG CATTTTTGGG CAAACAACAG AAGCCGGAGC CTCCGTATGC ATTCGGTTTC AGGGGCAGAA TGGACGAGCG GATCGATATG TCGCGGTCGG TGCGGGACAA GAAGTTTCGG TATATCCGGA ATTACCTGCC CAATAAAATT TATGGTCAAT ACCTGGAATA CCTGTGGCGT GCGCCGTCGG TGAAGTCGTG GGAAGAGCTC TACAAGGCTG GAAAACTGAA TGCCGTGCAG TCGAAATTCT GGGAAGCGAA GCCTGCCGAA GAACTTTTTG ACGTAGACGC CGATCCGCAC AATATCAAGA ACCTGGCGGA CGATCCGAAA TATAAAAAGG ATCTGGAAAG ATTGCGGAAG GCGAATGCGG AGTGGATGGC GAAGTACAAG GACGTAGGCT TTATCCCCGA AGCGATCATC TACGAAATCG CCAAAAAGAC TCCTTTGTAC GATTATGCGC GGAGCGGGCA ATACAATTTC GGGAAAATAG CCGCTACCGC CGACTTGGCG TCGTCACGCA CTGCTGCTCA CACGCAGGCG CTCATCAAAG CCCTGGCGGA TACTGATCCG TCGGTACGGT ACTGGGGCGC GACCGGCCTC ACGGTCTTGA AAGCAGCAGC AGGCAAAGAC GCTTTGCGGA AAGCGCTGAA AGACCCTGAA CCCGCCGTGC GCATTGCCGC CGCCGAAGCA CTCTACGTGA CCGGTGCCGA CAAGACCGCG GCCGTAGCGA CGCTGACCGA CGCATTGAAA AGCGATAATC CGTACGCCCG GCTGCAAGCC CTGAATGTGC TCGACCTGGC GGGCAAAGAC GCCGCTCCGG CCATTCCGGG CGCGGAGCAA ATCGCAGCAC AAAAGCCTGA AATGTTCGAT TACGACATTC GCGCTGCCAA AGTGCTGCTC AATAATTTCA AAAATTCAAA GTAA
|
Protein sequence | MYKLTLTILL AVLTAAMSPA TAQDRPNILW IVSEDNTVLL GSYGDQFATT PNLDQFAAGS IRYKNAFSTA PVCAPSRNTL ITGMYPPSLG TEHMRSVYPS PAFVKFFPKY LREAGYYTTN NAKKDYNTPD QTDAWDESSN KATYKNRKPG QPFFAVFNLN VSHESSLHEP LPALKHDPEK VPLPPYHPAT PELKHDWAQY YDKLEEMDRQ FGRLLQELKD EGLAENTIVF YYADNGGVLA RSKRFMYESG LHVPLIVHLP PKYAHLASQK SGTVSDRLVT FLDFAPTVLS LVDIKVPEYM QGGAFLGKQQ KPEPPYAFGF RGRMDERIDM SRSVRDKKFR YIRNYLPNKI YGQYLEYLWR APSVKSWEEL YKAGKLNAVQ SKFWEAKPAE ELFDVDADPH NIKNLADDPK YKKDLERLRK ANAEWMAKYK DVGFIPEAII YEIAKKTPLY DYARSGQYNF GKIAATADLA SSRTAAHTQA LIKALADTDP SVRYWGATGL TVLKAAAGKD ALRKALKDPE PAVRIAAAEA LYVTGADKTA AVATLTDALK SDNPYARLQA LNVLDLAGKD AAPAIPGAEQ IAAQKPEMFD YDIRAAKVLL NNFKNSK
|
| |