Gene Dfer_2052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_2052 
Symbol 
ID8225624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp2500929 
End bp2502560 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content54% 
IMG OID644929889 
Productsulfatase 
Protein accessionYP_003086440 
Protein GI255035819 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.430392 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAAAAC AATACATGAC CAGTGTGTTG CTAATGGCGT TCGCTTGCCT GGCGCCGGCA 
GCATTTTCCC AAAAAAAGCC GAACGTGATC GTGATATTGG CCGATGACCT CGGCTATGCA
GACCTGGGCT GTTACGGAGG TGAAATCCCG ACCCCAAACC TCGACAAGCT GGCACAGAGC
GGCGTTCGGT TTACCAATTT TTACAACACC GCCCGCTGCT GCCCCACGCG GGCGGCCCTG
CTCACCGGCG TTTACAGCCA CCAGGCGGGG ATCGGCCACA TGATGGACGA CAAGGGCGCT
GACCATCCCG CATACCGGGG GCAGCTCAAT CATAACAGCG TGACGATCGC CGAAGTGATG
AAAGGCGCGG GCTATTTCAC GGCGATGAGC GGCAAATGGC ACGTCGGTCA CCAGCACGGC
GTGTACCCCT CCAACCGAGG CTTCGATCGG TCGCTGCACG CGCCTGCGGG AGGTTTTTAC
TATGCCGGGG GTAACAATGC GAAGCTTTTC CTCAACGGAC AAGAAGTTAC GAACGACTCG
ACCGCATTAC CCAAAGACTG GTATTCGACC GATCTTTGGA CGAACTACGG CTTGCGTTTT
ATCGACGAGG CGCTGGCTGA AAAGAAGCCG TTTATGCTCT ATCTGGCCCA CAATGCACCT
CATTTTCCGT TGCAGGCACC CGAAGAGGAC ATTGTAAAGT TTCGTGGCAA ATACCTGAAA
GGCTGGGAAA AACTCCGTCA GGAGCGATAT GAAAAGCAAA TTAAACTGGG ACTGATCGAC
CCGTCCTGGA AGTTGCCGCC GATCAACCCC AATGTGAAGC GTTGGGATAG CCTTAGCGAC
GATGAAAAGA AGCGATATGA CGACATCATG GCCATTTATG CTGCCGTGAT CTCGCGTCTC
GACAAAAGCA TTGGTGACCT GGTGGATGGC TTGAAAAAGC GAGGTGTGTT TGATAATACC
GTCATTCTGT TCGTATCCGA CAACGGCGGC AATGCGGAGC CAGGCATCGA GGGGCGTTAC
CAAGGCGACA AGCCGGGGAA TGCCAAATCG ACCGTATTTC TGGGCCAGGG CTGGGCGGAG
GCTGCATGTA CGCCGTTTTG GGCATACAAA CACCACACGC ACGAAGGCGG GATTTCGTCG
CCGGGCATCG TGTCGTGGCC TGCGGGCATT CCTACTTCTC GAAATGGCAA GTTTGAGCGC
CAACCGGCTC ATATCATTGA TATTATGGCA ACGCTCGTGG ATCTTGGAAA TGCGGGCTAT
CCCACCACTT ATGCCGGGCA GCCGATTCAG CCGATGGAAG GTGCGAGCCT GAAACCCGCT
TTCACCGGAA AGCCTATCAA CCGCAAGAAC CCGATTTTCT GGGAACACGA AGGTAACCGC
GCGATCCGCG ATGGCAAATG GAAACTTGTG GCGGAAAAAA CGGAGAAATG GCAGTTGTAC
GATGTGGAGC AGGATCGCAC AGAACTGAAC GACCAGTTTG ACAAACAACC CGATGTTGCG
AAGAAGCTGG TAGCGAAGTA CGAAGCATGG TACAAGCGGG TTGGTGCTGA GGAGTATGAC
AAGACTTTCA AATGGTTTTA TGATTACAAC AAAGCCAAGC AGGAGCCGGG AGCGGCAGGC
AATGGGAAAT AG
 
Protein sequence
MLKQYMTSVL LMAFACLAPA AFSQKKPNVI VILADDLGYA DLGCYGGEIP TPNLDKLAQS 
GVRFTNFYNT ARCCPTRAAL LTGVYSHQAG IGHMMDDKGA DHPAYRGQLN HNSVTIAEVM
KGAGYFTAMS GKWHVGHQHG VYPSNRGFDR SLHAPAGGFY YAGGNNAKLF LNGQEVTNDS
TALPKDWYST DLWTNYGLRF IDEALAEKKP FMLYLAHNAP HFPLQAPEED IVKFRGKYLK
GWEKLRQERY EKQIKLGLID PSWKLPPINP NVKRWDSLSD DEKKRYDDIM AIYAAVISRL
DKSIGDLVDG LKKRGVFDNT VILFVSDNGG NAEPGIEGRY QGDKPGNAKS TVFLGQGWAE
AACTPFWAYK HHTHEGGISS PGIVSWPAGI PTSRNGKFER QPAHIIDIMA TLVDLGNAGY
PTTYAGQPIQ PMEGASLKPA FTGKPINRKN PIFWEHEGNR AIRDGKWKLV AEKTEKWQLY
DVEQDRTELN DQFDKQPDVA KKLVAKYEAW YKRVGAEEYD KTFKWFYDYN KAKQEPGAAG
NGK