Gene Dfer_2520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_2520 
Symbol 
ID8226092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp3099234 
End bp3101057 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content55% 
IMG OID644930352 
Productsulfatase 
Protein accessionYP_003086903 
Protein GI255036282 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.541987 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.989615 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATAAAC TAACCCTTAC AATTCTTCTG GCGGTGCTCA CCGCCGCCAT GTCCCCGGCC 
ACTGCGCAGG ACCGTCCGAA TATCCTCTGG ATTGTCAGCG AGGATAATAC CGTTCTGCTG
GGCAGCTACG GCGATCAATT TGCCACCACG CCCAACCTGG ACCAGTTTGC GGCCGGAAGC
ATCCGCTACA AAAATGCATT TTCGACGGCT CCCGTGTGTG CACCTTCGCG TAACACGCTC
ATTACCGGCA TGTACCCGCC ATCGCTGGGC ACGGAGCACA TGCGGAGCGT GTACCCGTCG
CCGGCATTCG TGAAGTTTTT CCCGAAATAC CTCCGGGAAG CGGGCTACTA TACCACCAAC
AATGCCAAAA AGGATTACAA CACGCCCGAC CAAACCGACG CCTGGGACGA ATCGAGCAAC
AAGGCGACTT ACAAGAACCG GAAACCGGGA CAGCCGTTTT TTGCGGTATT TAATCTGAAT
GTGTCTCACG AAAGTTCGCT GCACGAGCCA TTGCCTGCAT TGAAGCACGA TCCCGAAAAA
GTGCCGCTGC CGCCATATCA CCCGGCGACC CCGGAGCTGA AACACGACTG GGCGCAATAC
TACGATAAGC TGGAAGAAAT GGACCGGCAA TTCGGGCGCT TATTGCAGGA ATTGAAGGAC
GAAGGGCTGG CCGAAAATAC GATCGTTTTT TACTATGCCG ACAATGGCGG TGTGCTGGCG
CGCAGCAAGC GGTTTATGTA TGAATCGGGT TTGCATGTGC CGCTAATTGT CCATTTGCCG
CCGAAATACG CGCATTTGGC CAGCCAAAAA TCGGGTACGG TGTCGGACAG GCTGGTGACG
TTCCTGGATT TCGCGCCTAC GGTGCTGAGT CTGGTGGATA TTAAAGTGCC GGAATATATG
CAGGGAGGCG CATTTTTGGG CAAACAACAG AAGCCGGAGC CTCCGTATGC ATTCGGTTTC
AGGGGCAGAA TGGACGAGCG GATCGATATG TCGCGGTCGG TGCGGGACAA GAAGTTTCGG
TATATCCGGA ATTACCTGCC CAATAAAATT TATGGTCAAT ACCTGGAATA CCTGTGGCGT
GCGCCGTCGG TGAAGTCGTG GGAAGAGCTC TACAAGGCTG GAAAACTGAA TGCCGTGCAG
TCGAAATTCT GGGAAGCGAA GCCTGCCGAA GAACTTTTTG ACGTAGACGC CGATCCGCAC
AATATCAAGA ACCTGGCGGA CGATCCGAAA TATAAAAAGG ATCTGGAAAG ATTGCGGAAG
GCGAATGCGG AGTGGATGGC GAAGTACAAG GACGTAGGCT TTATCCCCGA AGCGATCATC
TACGAAATCG CCAAAAAGAC TCCTTTGTAC GATTATGCGC GGAGCGGGCA ATACAATTTC
GGGAAAATAG CCGCTACCGC CGACTTGGCG TCGTCACGCA CTGCTGCTCA CACGCAGGCG
CTCATCAAAG CCCTGGCGGA TACTGATCCG TCGGTACGGT ACTGGGGCGC GACCGGCCTC
ACGGTCTTGA AAGCAGCAGC AGGCAAAGAC GCTTTGCGGA AAGCGCTGAA AGACCCTGAA
CCCGCCGTGC GCATTGCCGC CGCCGAAGCA CTCTACGTGA CCGGTGCCGA CAAGACCGCG
GCCGTAGCGA CGCTGACCGA CGCATTGAAA AGCGATAATC CGTACGCCCG GCTGCAAGCC
CTGAATGTGC TCGACCTGGC GGGCAAAGAC GCCGCTCCGG CCATTCCGGG CGCGGAGCAA
ATCGCAGCAC AAAAGCCTGA AATGTTCGAT TACGACATTC GCGCTGCCAA AGTGCTGCTC
AATAATTTCA AAAATTCAAA GTAA
 
Protein sequence
MYKLTLTILL AVLTAAMSPA TAQDRPNILW IVSEDNTVLL GSYGDQFATT PNLDQFAAGS 
IRYKNAFSTA PVCAPSRNTL ITGMYPPSLG TEHMRSVYPS PAFVKFFPKY LREAGYYTTN
NAKKDYNTPD QTDAWDESSN KATYKNRKPG QPFFAVFNLN VSHESSLHEP LPALKHDPEK
VPLPPYHPAT PELKHDWAQY YDKLEEMDRQ FGRLLQELKD EGLAENTIVF YYADNGGVLA
RSKRFMYESG LHVPLIVHLP PKYAHLASQK SGTVSDRLVT FLDFAPTVLS LVDIKVPEYM
QGGAFLGKQQ KPEPPYAFGF RGRMDERIDM SRSVRDKKFR YIRNYLPNKI YGQYLEYLWR
APSVKSWEEL YKAGKLNAVQ SKFWEAKPAE ELFDVDADPH NIKNLADDPK YKKDLERLRK
ANAEWMAKYK DVGFIPEAII YEIAKKTPLY DYARSGQYNF GKIAATADLA SSRTAAHTQA
LIKALADTDP SVRYWGATGL TVLKAAAGKD ALRKALKDPE PAVRIAAAEA LYVTGADKTA
AVATLTDALK SDNPYARLQA LNVLDLAGKD AAPAIPGAEQ IAAQKPEMFD YDIRAAKVLL
NNFKNSK