Gene Dfer_0997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_0997 
Symbol 
ID8224567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp1166055 
End bp1167608 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content57% 
IMG OID644928858 
Productsulfatase 
Protein accessionYP_003085411 
Protein GI255034790 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.224171 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0910854 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAACA TCTCCATAGC CCGATGGCTC TGCTTTGCCG TCATGCTGAT CGTGTCCTTC 
GGCACGCGCC ACGCGCTCGC GCAATCCAAA CCGAATATCA TTGTTTTCCT CGTCGACGAT
ATGGGCTGGC AGGATACCTC GGTGCCATTC TGGAATAAGC CGACCGATTT CAACCGCCGC
TACCGTACGC CGAACATGGA ACGGCTTGCG CGCGAAGGCA TGAAGTTTAC CAATGCCTAC
GCCATGCCCG TGTGTACGCC CACCCGCGTG AGCCTGATCA CCGGCGTGAA TGCGGCCCAC
CACCGCGTGA CGCACTGGAC GTCGCCGGAT AAGGACAAAA ACACCGATTA TGCCGACAAA
GCGCTCGAAG CCGTCGACTG GAACATCAAT GGTTTCAGTC CCGTTGCCGG TGTTCCGCAT
ACTTTTCACG GCACAGCATT GCCCGAGGTA CTGCGGCAAA ACGGCTACTA CACCGTTCAC
AGCGGCAAGG CGCATTTTGG GTCTGCCGGC ACGCCGGGTT CGGACCCTGT GAATCTTGGA
TTTGAGATAA ACATCGCGGG TAGTTCCATT GGACATCCGG CCAGCTATTC GGGAAAGGCC
AACTACGACA GCCCCGTGAA CGGCAAGCCT AACCGCAATG CCGTGCCCGG CCTGGAAGCC
TACCACGGCA CGGACACATT CCTGAGCGAC GCCATCACGA CCGAAGCATT GAAGGCGATC
GCTAAGCCGG TAGCAGAGAA AAAGCCGTTT TTTCTCTACC TCTCGCATTA CGCCGTCCAC
ATTCCGCTCA CGGCCGATCC ACGCTTCCTG AACCGCTACC TCGAAGCCGG CCTGGACAGC
ACCGAAGCCA AATACGCGGC ATTGGTGGAA GGAATGGACA AAAGCCTCGG CGATGTGCTC
CGATACCTCG ATGAGCAGAA GATAGCCGAT AATACAGTGG TACTTTTCAT GTCCGACAAT
GGCGGCCTGA GCACTTCACC CGCACGCGGC GGCAAGGCGT GGACGCATAA TCTGCCCTTG
AAAGCCGGAA AAGGCTCCGT GTACGAAGGC GGCATCCGCG AGCCGATGCT CGTGCGGTGG
CCCGGCGTAA CCAAAGCCGG CTCGGTAACG GAGCAATACG TGATTATTGA AGACTTTTTT
CCAACCATTC TCGACATAGC TGGCGTGAAG AATGCCCGCA CCGTTCAGCA AGTGGACGGC
AAGTCATTCC TGCCCATCCT GAAAAACCCC GCATTCAAAG ACGAACGCCG CGGACTGGTA
TGGCACCACC CTAACCGCTG GATAGCCGCC GAAGGACCGA ATATCCATTA CGCCAGCGCA
TTCCGCCAGG GCGACTGGAA GCTGATTTAC GATTACCGGC AGGCGAAACT GGAATTGTAC
AACCTGCGTA CGGACATTGG CGAAGAACAT GACGTGGCTG CGTCCAATCC ATTGAAGGTG
AAAGAATTGG CCAATCTGCT TTCCAGACAG CTGAAAACCT GGGGCGCACA ATGGCCGGTT
TCCAAAAAAA CGGGCAAACC TGTTCCACTA CCGAACGAAC TGCCAGGCCT GTAA
 
Protein sequence
MMNISIARWL CFAVMLIVSF GTRHALAQSK PNIIVFLVDD MGWQDTSVPF WNKPTDFNRR 
YRTPNMERLA REGMKFTNAY AMPVCTPTRV SLITGVNAAH HRVTHWTSPD KDKNTDYADK
ALEAVDWNIN GFSPVAGVPH TFHGTALPEV LRQNGYYTVH SGKAHFGSAG TPGSDPVNLG
FEINIAGSSI GHPASYSGKA NYDSPVNGKP NRNAVPGLEA YHGTDTFLSD AITTEALKAI
AKPVAEKKPF FLYLSHYAVH IPLTADPRFL NRYLEAGLDS TEAKYAALVE GMDKSLGDVL
RYLDEQKIAD NTVVLFMSDN GGLSTSPARG GKAWTHNLPL KAGKGSVYEG GIREPMLVRW
PGVTKAGSVT EQYVIIEDFF PTILDIAGVK NARTVQQVDG KSFLPILKNP AFKDERRGLV
WHHPNRWIAA EGPNIHYASA FRQGDWKLIY DYRQAKLELY NLRTDIGEEH DVAASNPLKV
KELANLLSRQ LKTWGAQWPV SKKTGKPVPL PNELPGL