Gene Dfer_1776 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_1776 
Symbol 
ID8225347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp2183107 
End bp2184669 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content55% 
IMG OID644929630 
Productsulfatase 
Protein accessionYP_003086182 
Protein GI255035561 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.791288 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAAAT ATAAAACCAA ACAGTTCGCG ATCCTGGGCA CCTCGCTGGC GGTGCTTTTG 
GCTGCTACCG GATGGCAATT TGCGCCCCAA ACCGAAAAAG CCGCAAAACC CAACATCGTG
ATCGTTAACC TCGACGACCT GGGCTACGGC GACGTGGGCG CATACGGCGC CACGGCATTG
AAAACGCCCA ATATGGACCG CATCGCAAAT GGAGGGATTC GGTTTACGAA CGGCTATGCC
ACTTCCTCCA CCTGCACGCC GAGCCGGTTT GCATTGGTGA CGGGCGTGTA TCCATGGCGC
AATAAAGAAG CTAAAATCCT GCCCGGCGAT GCCCCGCTGC TGATCGATAC CGCCCAGCAA
ACCATCCCGA AAGTGCTCAA AAAAGCAGGC TATGCGACCG CTATTGTCGG AAAATGGCAT
TTGGGGCTCG GAAACGGCGA TACCGACTGG AACAAGGAAG TGAAACCGGG ACCAAACCAG
CTCGGCTTCG ATTATTCCTA CATCCTCGCC GCTACCCAGG ACCGCGTGCC CACGGTTTAC
ATTGAAAATA CGCGCGTGGT AGGCCTCGAT CCGAACGATC CGATCCGGGT GAGTTACAAG
CAGAATTTCG AAGGCGAGCC GACGGGAAAA GACAATCCGG AGCTGCTGAA AATGAAATGG
CACCACGGTC ACGACCAGAG CATCGTGAAC GGCATTTCGC GCATCGGCTA CATGAAAGGC
GGCCAGAAAG CGAAGTGGAA CGATGAGGAA ATGGCCGATC TGTTTTTGAC CAAAGCACAG
CAGTTTATCA AAGACCATAA ATCCAAACCC TTTTTCCTGT ACTACGCCAT GCAGCAGCCG
CACGTGCCCC GCACGCCGCA CCCGCGTTTC AAGGGCGTTA CCGGCATGGG ACCAAGAGGC
GACGCTATCG CCGAAGCGGA CTGGTGCCTG GGCGAATTGC TGAATACGCT GGAAAAAGAA
GGCATCCTGG AAAATACCCT CATTATCTTC ACCAGCGACA ATGGCCCGGT GGTGAACGAC
GGCTACCACG ACGACGCGGT GGAAAAACTG GGCAAGCACA AACCCGCCGG ACCGCTACGT
GGCGGCAAAT ACAGCTTGTT TGAAGCCGGT GTGCGGGTTC CATTCATTAC CTATTGGAAA
GGCACGATCA AGCCTGCGGT ATCGGATGCG GTGGTTTGCC AGCTGGATCT GCTGAGCTCG
CTCGCGCACC TGACCGGACA GGAGGCGAAG GGCCTCGACA GCCGGAATTA CCTGGATGTA
TTTTTAGGTA AAACCCAAAA AGGCCGCAGC GAGCTGATCC TCGAAGCCAG CTCGCGCACG
GCATTGCGGC AGGGCGACTG GCTGATGATA CCGCCCTACA ATGGCCCGGC GATCAATAAA
ATGGTGAATA TCGAGCTTGG TAATGCGAAA GAATATCAGC TTTATAATCT AAAAACGGAC
ATTGGCCAGC AGCATAACCT GGCCAAATCG GAGCCGGAAA GGCTGAAAAA GCTCGTTACT
GCATTTGAGC AGCTGCAACA AGGCGGGGCA AAAAGGGAGA CGGAGACGAT CAAGCTCGAA
TAA
 
Protein sequence
MIKYKTKQFA ILGTSLAVLL AATGWQFAPQ TEKAAKPNIV IVNLDDLGYG DVGAYGATAL 
KTPNMDRIAN GGIRFTNGYA TSSTCTPSRF ALVTGVYPWR NKEAKILPGD APLLIDTAQQ
TIPKVLKKAG YATAIVGKWH LGLGNGDTDW NKEVKPGPNQ LGFDYSYILA ATQDRVPTVY
IENTRVVGLD PNDPIRVSYK QNFEGEPTGK DNPELLKMKW HHGHDQSIVN GISRIGYMKG
GQKAKWNDEE MADLFLTKAQ QFIKDHKSKP FFLYYAMQQP HVPRTPHPRF KGVTGMGPRG
DAIAEADWCL GELLNTLEKE GILENTLIIF TSDNGPVVND GYHDDAVEKL GKHKPAGPLR
GGKYSLFEAG VRVPFITYWK GTIKPAVSDA VVCQLDLLSS LAHLTGQEAK GLDSRNYLDV
FLGKTQKGRS ELILEASSRT ALRQGDWLMI PPYNGPAINK MVNIELGNAK EYQLYNLKTD
IGQQHNLAKS EPERLKKLVT AFEQLQQGGA KRETETIKLE