Gene Dfer_2166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_2166 
Symbol 
ID8225738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp2648781 
End bp2650274 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content56% 
IMG OID644930003 
Productsulfatase 
Protein accessionYP_003086554 
Protein GI255035933 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.696706 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTCC TGAACCTGTT TTTACTGACG ATATCGATCA CCTGCACCGC GCAGGCGCAA 
AAAGCGCCCG ACAAGCTTCC GAACATCGTT TACATTTACG CCGACGACCT CGGCTACGGC
GAGCTCGGCT GCTACGGCCA GCAGAAGATC AAAACGCCGA ACCTCGACCG GCTCGCGAAA
GAAGGCATCC GGTTCACGCA GCACTACACG GGCACGCCTG TATGCGCGCC TGCCCGTGCC
ATGCTCATGA CGGGCAAGCA TGCGGGACAT TCCGCCATCC GCGGCAATTT CGAACTCGGC
GGCTTCCGGG ATGAGGAGGA ACGCGGGCAA ATGCCCCTGC CGGCCAACGA GTTGACCGTC
GCCGAGCTGC TCAAACAAAA AGGCTACGCC ACCGCGCTCA CGGGCAAATG GGGCATGGGA
ATGAACAACA CCGAAGGCAC GCCTACCCGG CAGGGCTTCG ACTACTATTA CGGCTACCTC
GATCAGAAGC AGGCGCACAA CCTCTACCCG TCGCATTTGT GGGAGAACGA TCGCTGGGAT
ACGCTCGCGC AGCCCTGGCA GGACATCCAT CGGAAGCTCG ATCCGGCCAA AGCCACCGAC
GCCGATTTCG AATCTTTCAA AGGAAAAGAA TATGCGCCGG CAAAAATGAC CGAAAAGGCA
TTAGCATTTA TCGACCGGAG CAAGGCCGGC CCGTTCTTCC TGTATATGCC CTACACGCTC
CCGCACGTAT CGTTGCAGGC CCCCGATGAA TATGTCAAAA AATACATCGG ACAGTTTGAT
GAGAAGCCTT ATTACGGTGA GAAGAATTAT GCGTCCACCA AATATCCGCT ATCGACTTAC
GCGTCCATGA TCACATTCCT GGACGACCAG GTCGGTATCA TTTTGGACAA ACTGAAAGCG
CTCGGTTTGG ACGATAACAC CATCGTGATG TTCAGCAGCG ACAATGGGGC CACTTTCAAT
GGCGGTGTAA ACCCGCAATT CTTCAACAGC GTGGCAGGGC TGCGCGGATT GAAAATGGAC
GTTTACGAAG GCGGGATCCG CGAGCCGTTT ATCGTCCGGT GGCCGGGGAA AATCAAACCA
GGGCGGGTGA GCGACCACGT TTCGGCCCAA TTCGACCTCA TGCCTACCCT GGCCGAGCTC
ACCGGACAAG CCTCGCCGCC CACCGACGGC ATTTCGTTCC TCCCGGAACT GCTAGGACAA
ACCAACCGGC AGAAAAAACA CGAATTCCTC TATTTTGAAT ACCCCGAAAA AGGCGGCCAG
ATTGCCGTCC GAATGGGTGA CTGGAAGGGC GTTAAAACCG ATTTACGGAA AAACCCCGGC
AACCCGTGGC AGCTTTTCAA CCTCAAAACC GACCGCAGCG AAAGCACCGA CGTCGCCGCC
AGCCACCCCG ATATTTTGAA AAAACTCGAC CAGATCGTCA AAAGAGAGCA TGAAGAACCG
GCGAATGCGG CGTGGCAGTT TGTGATGCCG GTGATCGCGG CGAGCAGGAA ATAG
 
Protein sequence
MKLLNLFLLT ISITCTAQAQ KAPDKLPNIV YIYADDLGYG ELGCYGQQKI KTPNLDRLAK 
EGIRFTQHYT GTPVCAPARA MLMTGKHAGH SAIRGNFELG GFRDEEERGQ MPLPANELTV
AELLKQKGYA TALTGKWGMG MNNTEGTPTR QGFDYYYGYL DQKQAHNLYP SHLWENDRWD
TLAQPWQDIH RKLDPAKATD ADFESFKGKE YAPAKMTEKA LAFIDRSKAG PFFLYMPYTL
PHVSLQAPDE YVKKYIGQFD EKPYYGEKNY ASTKYPLSTY ASMITFLDDQ VGIILDKLKA
LGLDDNTIVM FSSDNGATFN GGVNPQFFNS VAGLRGLKMD VYEGGIREPF IVRWPGKIKP
GRVSDHVSAQ FDLMPTLAEL TGQASPPTDG ISFLPELLGQ TNRQKKHEFL YFEYPEKGGQ
IAVRMGDWKG VKTDLRKNPG NPWQLFNLKT DRSESTDVAA SHPDILKKLD QIVKREHEEP
ANAAWQFVMP VIAASRK