Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dfer_2052 |
Symbol | |
ID | 8225624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dyadobacter fermentans DSM 18053 |
Kingdom | Bacteria |
Replicon accession | NC_013037 |
Strand | - |
Start bp | 2500929 |
End bp | 2502560 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644929889 |
Product | sulfatase |
Protein accession | YP_003086440 |
Protein GI | 255035819 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.430392 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTAAAAC AATACATGAC CAGTGTGTTG CTAATGGCGT TCGCTTGCCT GGCGCCGGCA GCATTTTCCC AAAAAAAGCC GAACGTGATC GTGATATTGG CCGATGACCT CGGCTATGCA GACCTGGGCT GTTACGGAGG TGAAATCCCG ACCCCAAACC TCGACAAGCT GGCACAGAGC GGCGTTCGGT TTACCAATTT TTACAACACC GCCCGCTGCT GCCCCACGCG GGCGGCCCTG CTCACCGGCG TTTACAGCCA CCAGGCGGGG ATCGGCCACA TGATGGACGA CAAGGGCGCT GACCATCCCG CATACCGGGG GCAGCTCAAT CATAACAGCG TGACGATCGC CGAAGTGATG AAAGGCGCGG GCTATTTCAC GGCGATGAGC GGCAAATGGC ACGTCGGTCA CCAGCACGGC GTGTACCCCT CCAACCGAGG CTTCGATCGG TCGCTGCACG CGCCTGCGGG AGGTTTTTAC TATGCCGGGG GTAACAATGC GAAGCTTTTC CTCAACGGAC AAGAAGTTAC GAACGACTCG ACCGCATTAC CCAAAGACTG GTATTCGACC GATCTTTGGA CGAACTACGG CTTGCGTTTT ATCGACGAGG CGCTGGCTGA AAAGAAGCCG TTTATGCTCT ATCTGGCCCA CAATGCACCT CATTTTCCGT TGCAGGCACC CGAAGAGGAC ATTGTAAAGT TTCGTGGCAA ATACCTGAAA GGCTGGGAAA AACTCCGTCA GGAGCGATAT GAAAAGCAAA TTAAACTGGG ACTGATCGAC CCGTCCTGGA AGTTGCCGCC GATCAACCCC AATGTGAAGC GTTGGGATAG CCTTAGCGAC GATGAAAAGA AGCGATATGA CGACATCATG GCCATTTATG CTGCCGTGAT CTCGCGTCTC GACAAAAGCA TTGGTGACCT GGTGGATGGC TTGAAAAAGC GAGGTGTGTT TGATAATACC GTCATTCTGT TCGTATCCGA CAACGGCGGC AATGCGGAGC CAGGCATCGA GGGGCGTTAC CAAGGCGACA AGCCGGGGAA TGCCAAATCG ACCGTATTTC TGGGCCAGGG CTGGGCGGAG GCTGCATGTA CGCCGTTTTG GGCATACAAA CACCACACGC ACGAAGGCGG GATTTCGTCG CCGGGCATCG TGTCGTGGCC TGCGGGCATT CCTACTTCTC GAAATGGCAA GTTTGAGCGC CAACCGGCTC ATATCATTGA TATTATGGCA ACGCTCGTGG ATCTTGGAAA TGCGGGCTAT CCCACCACTT ATGCCGGGCA GCCGATTCAG CCGATGGAAG GTGCGAGCCT GAAACCCGCT TTCACCGGAA AGCCTATCAA CCGCAAGAAC CCGATTTTCT GGGAACACGA AGGTAACCGC GCGATCCGCG ATGGCAAATG GAAACTTGTG GCGGAAAAAA CGGAGAAATG GCAGTTGTAC GATGTGGAGC AGGATCGCAC AGAACTGAAC GACCAGTTTG ACAAACAACC CGATGTTGCG AAGAAGCTGG TAGCGAAGTA CGAAGCATGG TACAAGCGGG TTGGTGCTGA GGAGTATGAC AAGACTTTCA AATGGTTTTA TGATTACAAC AAAGCCAAGC AGGAGCCGGG AGCGGCAGGC AATGGGAAAT AG
|
Protein sequence | MLKQYMTSVL LMAFACLAPA AFSQKKPNVI VILADDLGYA DLGCYGGEIP TPNLDKLAQS GVRFTNFYNT ARCCPTRAAL LTGVYSHQAG IGHMMDDKGA DHPAYRGQLN HNSVTIAEVM KGAGYFTAMS GKWHVGHQHG VYPSNRGFDR SLHAPAGGFY YAGGNNAKLF LNGQEVTNDS TALPKDWYST DLWTNYGLRF IDEALAEKKP FMLYLAHNAP HFPLQAPEED IVKFRGKYLK GWEKLRQERY EKQIKLGLID PSWKLPPINP NVKRWDSLSD DEKKRYDDIM AIYAAVISRL DKSIGDLVDG LKKRGVFDNT VILFVSDNGG NAEPGIEGRY QGDKPGNAKS TVFLGQGWAE AACTPFWAYK HHTHEGGISS PGIVSWPAGI PTSRNGKFER QPAHIIDIMA TLVDLGNAGY PTTYAGQPIQ PMEGASLKPA FTGKPINRKN PIFWEHEGNR AIRDGKWKLV AEKTEKWQLY DVEQDRTELN DQFDKQPDVA KKLVAKYEAW YKRVGAEEYD KTFKWFYDYN KAKQEPGAAG NGK
|
| |