Gene Dfer_2233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_2233 
Symbol 
ID8225805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp2741036 
End bp2742571 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content52% 
IMG OID644930069 
Productsulfatase 
Protein accessionYP_003086620 
Protein GI255035999 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAT ACATTTTAAG TGTGATTGCC CTGCTGGGAA GCATTTCCAG CCCATTTGCC 
CAAACGGCGG AACGCCCGAA CATCGTCCTG ATTTTCTCGG ATGACCATGC CTACCAGTCG
ATCGGTGCGT ATGGCAACAA GATCGCCAAA ACCCCGAACA TCGACCGCAT TGCACGCGAA
GGCGCGATTT TGACCAATAA CCTCGTGGCA AACTCTATTT GTGGGCCCAG CCGGGCCACT
TTGCTTACCG GGAAATACAG CCACATGAAC GGCTACAAGA AAAACGACCG GACGCTGTTC
GACACCTCTC AGCTGCTCTT CCCGAAGGAG CTGCAAAAGA GCGGTTATCA AACGGCGTGG
GTCGGAAAAC TGCATTTGAA CAGCCTGCCG GTGGGTTTCG ACTATTGGAA CATCTTGCCG
GGGCAAGGCA TTTACTACAA TCCCGAGTTC ATCACCGCGC CGCACGACAC CACGCGGAAG
ATCGGCTACG TGGCGGACAT TATCACGCAA AGTTCCCTCG AATGGCTCGA CAAGCGCGAC
GACAAAAAGC CGTTTTTCCT GGTGGTAGGC CAAAAATCGG TGCACCGCGG ATGGCAGCCC
GATTTGCAGG ACCTGGGCGC TTACGACGAC ATTGATTTCC CGCTTCCCGA AACGTTTTAC
GACAATTACG AAGGCCGCGT AGCTGCGAAA GACCAGCAAA TGTCGATCGA AAAGACCATG
ACCCTGAAAC AGGATTTGAA AGTACACCTG GACTACGACC GCGTGCCGGG CTACAAATTC
TTCACCGACG CGCAGAAGAA AACCTTCCGC GACTACTATG ATAAAATCAG CAAGGAATTC
GACGACAAAA AGCTGACCGG CAAAGCATTG ACGGAATGGA AATACCAGCG CTACATGAAA
GACTACCTGG CAACGGCCAA TGGTTTAGAC CGGAACATCG GCAAAATCCT GGATTATCTT
GACAAAACCG GCCTATCGAA AAACACCGTC GTGATTTACG CCTCCGACCA AGGCTTCTAC
CTCGGCGAGC ACGGCTGGTT CGACAAGCGC TTTATCTACG AAGAATCGCT CAAAACGCCG
TTTGTGATCC GCTACCCGGG CGTGATCAAG CCGGGAACGA AGGTGGACAA CCTGATTTCG
AACATCGACT GGGCGCCGAC GATCCTCGAT TTGGCACATA CCAAAATCCC GTCGGAGATT
CAGGGAAAAT CGTTTCTGCC CCTGCTCGAC AAGAATGCAA AAGCCAGCAC GCCGTGGCGT
GACGCAGCGT ATTACCACTA TTACGAATTC CCTGACTTCC ATCACGTTTA CCCGCATTTT
GGCCTGAAAA CAAAACGCTA CAAACTGGTG AGATTTTACG GCGGCGCGGA TAGCTGGGAA
TTGTTCGACC TAGAAAAAGA TCCGCATGAA CTTAAAAACG TGTACGCCGA CAAGGCAAAT
GCGGCCGTGG TGAAAGACCT GAAAGAGAAG TTAAAAACGC TGATCGTTCA ATACAAGGAC
GACGAAGCGC TAGCGCTTTT CAATGCTGCC AAATAA
 
Protein sequence
MKKYILSVIA LLGSISSPFA QTAERPNIVL IFSDDHAYQS IGAYGNKIAK TPNIDRIARE 
GAILTNNLVA NSICGPSRAT LLTGKYSHMN GYKKNDRTLF DTSQLLFPKE LQKSGYQTAW
VGKLHLNSLP VGFDYWNILP GQGIYYNPEF ITAPHDTTRK IGYVADIITQ SSLEWLDKRD
DKKPFFLVVG QKSVHRGWQP DLQDLGAYDD IDFPLPETFY DNYEGRVAAK DQQMSIEKTM
TLKQDLKVHL DYDRVPGYKF FTDAQKKTFR DYYDKISKEF DDKKLTGKAL TEWKYQRYMK
DYLATANGLD RNIGKILDYL DKTGLSKNTV VIYASDQGFY LGEHGWFDKR FIYEESLKTP
FVIRYPGVIK PGTKVDNLIS NIDWAPTILD LAHTKIPSEI QGKSFLPLLD KNAKASTPWR
DAAYYHYYEF PDFHHVYPHF GLKTKRYKLV RFYGGADSWE LFDLEKDPHE LKNVYADKAN
AAVVKDLKEK LKTLIVQYKD DEALALFNAA K