Gene Dfer_4501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_4501 
Symbol 
ID8228104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp5432570 
End bp5434015 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content55% 
IMG OID644932347 
Productsulfatase 
Protein accessionYP_003088867 
Protein GI255038246 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.930305 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.505642 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAAT ATTTGTTGCT AATACCCCTA CTGACTTCCT CATTCCTTAC TCAACGCGCC 
GACGCGCAGG CCCCAAAGCC GCAACGCCCG AATATCGTAT TTATCCTGGC CGACGACCTT
GGTTACGGCG ACGTCGGTTT TAACGGACAG AAGCTCATCA AAACGCCCAA TATCGATAAA
CTGGCGAAGG AGGGAATGAT CTTTAACCAA TTTTACGCCG GTACATCGGT GTGTGCGCCT
TCGCGGTCGT CGCTGCTGAC GGGGCAGCAT ACCGGCCATA CGTATATCCG CGGCAATAAG
GGTGTGGAGC CGGAAGGCCA GCAGCCTATT GCCGACTCGG TGACGACGCT GGCGGAGGTG
CTCAAAAAAT CGGGGTACGT GACGGCGGCA TTTGGCAAGT GGGGGCTGGG GCCGGTTGGC
TCGGAAGGCG ATCCCAATAA GCAGGGCTTC GATCGTTTTT ATGGTTACAA CTGCCAAAGC
CTCGCGCACC GCTATTATCC GGAACACCTT TGGGATAATA GCAAAAAAAT ACTGTTGGAA
GGCAACAAAG GCCTTATTCA TAACAAGGAA TACGCGCCCG ACCTGATCCA GAAAAAGGCG
CTCAGCTTTG TGAATGCGCA GGATGGCAAG CAGCCTTTCT TCCTGTTTTT GCCCTACATT
TTACCCCACG CCGAGCTGGT GGTGCCGGAC GACAGCCTTT TCAGATATTA TAAAGGTAAG
TTCGAAGAAA AGCCGCACAA GGGCGCCGAC TATGGCCCGG GTGCTAACGG CGGCGGCTAT
GCATCACAGG ACTTTCCGCA CGCGACTTTC GCGGCGATGG TGGCGCGCCT GGACCTTTAT
GTAGGCCAGG TAATGAATGC ATTGAAGAAA AAAGGCCTTG ACAAAAATAC GCTGGTGATC
TTCACGAGCG ACAACGGCCC GCACGTCGAA GGAGGTGCCG ATCCGAGATT TTTCAACAGC
GGCGCCGGTT TCCGCGGCGT GAAGCGCGAT TTGTACGAAG GCGGCATTCG CGAGCCATTC
GCAGCCCGCT GGCCGGCCGC GATCAAGCCG GGTTCGAAAA GCGATTACAT TGGCGCATTC
TGGGATATTC TGCCCACTTT CGCCGAGCTG GCCAACGCGC CGGCCCCGCG TAACATCGAC
GGTATTTCAT TTACCGATGC ATTGAAAGGC AAGGCGATTC AGAAAAAGCA CGATTACCTC
TATTGGGAAT TTCATGAGCA AGGCGGCCGC CAGGCGGTTC GCCAGGGTAA CTGGAAGGCC
GTCCGCCTGA AAGCCGCCGG AAATCCCGAT GCATTGGTAG AGCTCTACGA TCTTTCAAAA
GACCCGCAGG AAAAGAATAA CCTCACCCCA CAGTTCCCCG AAAAAGCCAA GGAACTCGGC
CAGATCATGA ACCGCGCGCA CGTTTCATCC GCGATTTTCC CGTTTGGCAG TCTGGCGACG
AATTAA
 
Protein sequence
MRKYLLLIPL LTSSFLTQRA DAQAPKPQRP NIVFILADDL GYGDVGFNGQ KLIKTPNIDK 
LAKEGMIFNQ FYAGTSVCAP SRSSLLTGQH TGHTYIRGNK GVEPEGQQPI ADSVTTLAEV
LKKSGYVTAA FGKWGLGPVG SEGDPNKQGF DRFYGYNCQS LAHRYYPEHL WDNSKKILLE
GNKGLIHNKE YAPDLIQKKA LSFVNAQDGK QPFFLFLPYI LPHAELVVPD DSLFRYYKGK
FEEKPHKGAD YGPGANGGGY ASQDFPHATF AAMVARLDLY VGQVMNALKK KGLDKNTLVI
FTSDNGPHVE GGADPRFFNS GAGFRGVKRD LYEGGIREPF AARWPAAIKP GSKSDYIGAF
WDILPTFAEL ANAPAPRNID GISFTDALKG KAIQKKHDYL YWEFHEQGGR QAVRQGNWKA
VRLKAAGNPD ALVELYDLSK DPQEKNNLTP QFPEKAKELG QIMNRAHVSS AIFPFGSLAT
N