Gene Dfer_2521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_2521 
Symbol 
ID8226093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp3101086 
End bp3103398 
Gene Length2313 bp 
Protein Length770 aa 
Translation table11 
GC content55% 
IMG OID644930353 
Productsulfatase 
Protein accessionYP_003086904 
Protein GI255036283 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.673679 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGGAA AAATTTATCG TCCGTGGAAT AAATTATCGC TGGGTGCACT GGCCGTGCTC 
GCAGCGAGCG GCGTGCATGC GCAATATGCA CCGACGCCCG CTTACCAGGG CAAAATCGGT
AAAACGGTGG CGGAAACGCA GCAATCGTGG CCGGAGAAGA AAAAAGCCGC CAAAGGCTCG
CCCAACGTCG TGTGGATCTT GCTGGACGAC ATTGGTTACG GCGCCATCAG CACATTCGGC
GGGCTGATCA ACACCCCGAC GCTCGACAGC CTGGCCAACA ACGGCTTGCG CTACACCAAT
TTTCACACTA CCGCCATTTG CGCACCCACG CGGGCATCGC TGCTCACCGG CCGCAACCAG
CATTCGGTGC ATATGGGCCT ATTCCCGGAA ACGGCCATTG GCACGCCCGG CTACGACGCG
ATCATCCCGC TTGAAAAAGG TACCATCGCC GAAATCCTGA AAGACAACGG CTACAACACA
TTTGCATTGG GCAAATGGCA CATAACCCCG CTCGCGGACC TCACGCCTGC GGGGCCCTTC
AACCGCTGGC CGACAGGGCG CGGATTCGAG CAATTTTACG GGTTTCCATC GCGCGGTAGC
ATTGATCAGT GGCACCCCGA GCTCTGGGAA GGCACCCACC GCGAGCCCGA CCGGCAGGAT
GGCAAGCATT TCAACGAACT CATCGCCGAC CGGGCGATCA GCTATCTGGC GAGCCAGAAA
TCCGGCAGTC CCGACAAGCC GTTTTTCCTT TATGTGGCAA CCGGCGCGGG CCACTCGCCG
CATCAGGTTG CCAAAAAATG GTCGGATAAG TACAAGGGCA AGTTCGATGC CGGTTGGGAT
GCTTACCGGG AGCAGGTGCT GGCCAACCAG ATCAGGCTGG GCGTTGTGCC GAAAACCGCC
AAAGTGCCGC CGCGCAACCC GGGCGTGAAG GAATGGAGCT CGCTGGGCGC CGATGAGAAA
AAGCTCTACG CGCGTTTTAT GGAGACATAT GCGGGATTTT TGGAACAAAC AGACTACGAA
ATCGGGCGGG TGATTAACCA CATCCGGGAA ACGGGCGAAC TGGATAATAC CATCGTGATT
GTGTCGGTAG GGGACAATGG CGCGAGCAAG GAGGGGACTT TTGTGGGTAC GGTCAATAAT
TTCGGAACCG GCATTCCGGA GGAAGAGCGC CTGAAAAAGA ACATTGAGCA AATCGATCTC
ATCGGCACTG AGTTTTCCAA AGTAAACTAT CCGCTCGGCT GGGCCGCGGC AACCAACGTG
CCATTCCGCC ATTGGAAGCA GGATGCGAAT TCGGAAGGTG GTACGCACAA TCCGCTCATC
GTTTTTTATC CAAAGGGAAT CAAGGAAAAA GGAGGAATCC GCAACCAGTA CGGGCATGTT
TCCGACATTC TGCCTACCAC ACTCGAACTG CTGAACATCA AGGCGCCCGC CGTTTTGAAC
GGTATCAAAC AAGACCCGAT CGAGGGCACC AGCCTGGCTT ACAGCCTCAA TGACGCCGGC
GCGAAAAGCC GCCATACGGT CCAATACTAC GAGATCCGCG GCTCACGGTC GATTTACAAG
GACGGGTGGA AGGCAGGTTC TCTGCACGTG AAAGGCCAGG ATTTTGAAAA AGACAAATGG
GAGCTTTATA ATGTCAATGA AGATTTCAAC GAAACCAACG ACCTGGCAGC ATCCAATCCG
GGCAAGTTGA AGGAGCTGAA AGCATTGTTC GATAGTGAAG CTTTGAAATA CAACGTTTAC
CCGCTCAAAG ACGGAACGGA GCCATTTACA TTACCGACGG CCTATAACCA TGTGGACAGG
GTAGTGCTGT ACCCAGGCCA ATCGACGATC ATCGACATCG CCAGCCCGTT TGCATTGAAA
CGCTCGTTTT CTATCATCGC GGATGTGGAG TTGTCGGGCG TGCAGGCAGA AGGTGTGCTG
CTATCGAGAG GCGGAGCGGC GGGCGGGCTG AGCTTTTTTA TCCAAAATCA GAAACTGCAT
TTCACCTACG CCGTGGGCGA TGGCAACAAA TATGTGGTAA GTTCCGCAAA CGCCACGCTG
CCGGCCGGAA AGGCAGAACT CAAAGCAAGC GTGCAATATG ATCAGAATGG CGGCGGCGCG
GTTACTTTGT ATGTCAATAA TTCCAAGGTA GGGGAAGGGA CGCTTCCGAA AACCTCGGAT
GCATTGTACC TGCATGAAGG CGTGAACGTA GGTTTTGATG ACCTTACGCC GGTAGGGGAC
ACCTACAAAG TGCCTTTTGC ATTTACCGAT AAGATAAGGA AAGTAACCAT CGACCTCGCC
CCGGCGCAGC AGGCACATCT GGGTGCGAAA TAG
 
Protein sequence
MMGKIYRPWN KLSLGALAVL AASGVHAQYA PTPAYQGKIG KTVAETQQSW PEKKKAAKGS 
PNVVWILLDD IGYGAISTFG GLINTPTLDS LANNGLRYTN FHTTAICAPT RASLLTGRNQ
HSVHMGLFPE TAIGTPGYDA IIPLEKGTIA EILKDNGYNT FALGKWHITP LADLTPAGPF
NRWPTGRGFE QFYGFPSRGS IDQWHPELWE GTHREPDRQD GKHFNELIAD RAISYLASQK
SGSPDKPFFL YVATGAGHSP HQVAKKWSDK YKGKFDAGWD AYREQVLANQ IRLGVVPKTA
KVPPRNPGVK EWSSLGADEK KLYARFMETY AGFLEQTDYE IGRVINHIRE TGELDNTIVI
VSVGDNGASK EGTFVGTVNN FGTGIPEEER LKKNIEQIDL IGTEFSKVNY PLGWAAATNV
PFRHWKQDAN SEGGTHNPLI VFYPKGIKEK GGIRNQYGHV SDILPTTLEL LNIKAPAVLN
GIKQDPIEGT SLAYSLNDAG AKSRHTVQYY EIRGSRSIYK DGWKAGSLHV KGQDFEKDKW
ELYNVNEDFN ETNDLAASNP GKLKELKALF DSEALKYNVY PLKDGTEPFT LPTAYNHVDR
VVLYPGQSTI IDIASPFALK RSFSIIADVE LSGVQAEGVL LSRGGAAGGL SFFIQNQKLH
FTYAVGDGNK YVVSSANATL PAGKAELKAS VQYDQNGGGA VTLYVNNSKV GEGTLPKTSD
ALYLHEGVNV GFDDLTPVGD TYKVPFAFTD KIRKVTIDLA PAQQAHLGAK