Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dfer_1780 |
Symbol | |
ID | 8225351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dyadobacter fermentans DSM 18053 |
Kingdom | Bacteria |
Replicon accession | NC_013037 |
Strand | + |
Start bp | 2191087 |
End bp | 2192532 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644929633 |
Product | sulfatase |
Protein accession | YP_003086185 |
Protein GI | 255035564 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAACTT ACCTGCGGGG TTTCCTGCTA CTGACTCTCA TGCTGGCCGG CGGCTTTCTC CCCAAAGCCG CCGGGCAAAA GCCCAACATC ATTTTCATCA TCGGCGACGA CATCAGCTGG GATGACATCG GGGCTTATGG GAATGCCAAA ATAAAGACAC CGAACCTGGA CAAACTGGCT AAGGAAGGGC TCCGATTCAC CAACCTTTAC CTCACCGCCA GCTCGTGCAG CCCGAGCCGG ACGAGCATTC TCACAGGCCG GTACCCGCAC AATACCGGCG CGGCGGAACT GCATTCGCCC CTGCCCGCGC ATCTCGCGTA TTTCCCCGAA CTGCTTAAAA AACAGGGTTA CTTTGCCGCA TTGGCCGGAA AATGGCACGA GGGGCCTAAC ACACGCCGGG CATACGACAC ACTGCTCGTA GATAGGAGAG CCAATGGCGA AGGCGGTGAA GCACAATGGC TTAACCTGCT CAGGGCAAGG CCTAAGGACA AGCCGTTTTT CTTCTGGCTC GCGCCATTCG ATGCGCACAG GCCGTGGTCG GCGCGCACCG AGGGGCACCA GCATGATCCG CAAACTGAAA TAGTAGTTCC GCCGACGCTG GTCGATGACA GGGAAACGCG GCAGGACCTG GCGCATTATT ACAATGAAAT TTCGCAACTC GATCATTACG TTGGTCAGCT GCGCGCGGAG CTGGTGCGGC AGGGCGTGGC CGAGAACACG ATCATCATTT TCACTGCGGA TAATGGCCGT GCATTTCCGG GCAGCAAAAC GCGGTTGTAT GATGCGGGTG TGAAAACGCC GTTTATCGTG AACTGGCCTG CGGGTATCCG CCCCGGGCAG GTTTGCGAAA GCCTGGTGAG CAGCATCGAC ATCGCGCCGA CCTTGCTGGA ACTGGCCGGT ACCCAGCCCA CAGAATCGTT CCAGGGCCTC AGCTTTGCCC AGCTTTTGAA AAACCCGGAA AAAGCATTCC GCAAATATGT TTTTGCCGAA CACAACTGGC ACGATTATGC CGCTTACGAG CGCTCGGTAC GCAGCAAGGA TTTTTTGTAT ATCATCAATA AAAGGCCTGA GCTGGACAAC GGAGGGCCTA TTGACGCCAA CCAAAGCCCA TCGGCCAAAG CGCTGAAAGC GCCCGGCAAA CTGACCGTCC TGCAAAAGGA CGCATTGCTG AAACCCCGCC CCACAGAAGA ATTTTTCGAC AACCGGAAAG ACTCGCTGCA AACGCACAAT GCAATTGCCA ACAAATCCTA CGCCGCACAA CTGGCCGAAC ACCGGGCGAT ACTCGAACAA TGGCAGCAGG AAACCGGCGA TACCGAGCCG AAATCCATCA CGCCCGACTG GTACCATCGC GAAACGGGCG AGCCGGTGGC CAGCAACGGC CAGCGCGGAG AAATTCCGGG AAGCAGCAAA AAGGCAGACC ATGTCAACCG GAAAGGGCCA TTTTAG
|
Protein sequence | MQTYLRGFLL LTLMLAGGFL PKAAGQKPNI IFIIGDDISW DDIGAYGNAK IKTPNLDKLA KEGLRFTNLY LTASSCSPSR TSILTGRYPH NTGAAELHSP LPAHLAYFPE LLKKQGYFAA LAGKWHEGPN TRRAYDTLLV DRRANGEGGE AQWLNLLRAR PKDKPFFFWL APFDAHRPWS ARTEGHQHDP QTEIVVPPTL VDDRETRQDL AHYYNEISQL DHYVGQLRAE LVRQGVAENT IIIFTADNGR AFPGSKTRLY DAGVKTPFIV NWPAGIRPGQ VCESLVSSID IAPTLLELAG TQPTESFQGL SFAQLLKNPE KAFRKYVFAE HNWHDYAAYE RSVRSKDFLY IINKRPELDN GGPIDANQSP SAKALKAPGK LTVLQKDALL KPRPTEEFFD NRKDSLQTHN AIANKSYAAQ LAEHRAILEQ WQQETGDTEP KSITPDWYHR ETGEPVASNG QRGEIPGSSK KADHVNRKGP F
|
| |