Gene Dde_0462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDde_0462 
Symbol 
ID3756563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio desulfuricans subsp. desulfuricans str. G20 
KingdomBacteria 
Replicon accessionNC_007519 
Strand
Start bp472423 
End bp473985 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content57% 
IMG OID637781322 
Productsulfatase, putative 
Protein accessionYP_386958 
Protein GI78355509 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTCAG AATCCATAAA GAACGTCATC TTCATCATGC TGGATACTCT GCAGTTCAAC 
TATCTGGGCT GCTACGGCAA TAAAGAGGTG AAGACGCCTA ATCTGGACCG TTTTGCCGGG
GAAGGCTTTC TTTTTGAAAA CGCCTACAGC GAAGGGCTGC CGACCATCCC TGTGCGCCGT
GCGCTGATGA CCGGACGCTA CACGCTGCCT TACGGCGGCT GGAAGCCGCT GGACCCCGAG
GATACCACGC TGACCGATAT TCTGTGGTGC CGCGAGGTGC AGACGGCTCT GGTATACGAT
ACGCCGCCCA TGCGGCTGCC CAAATACGGC TATTCGCGCG GGTTTGACTA TGTGCGGTTC
TGCAACGGGC ACGAACTGGA TCACGAGACC TTCAGCAAGG TACCGCTTTC TGAAAAGTTC
AAGCCGGAAG ACTATGTTTC GCCCAACTGG CTTACGCGTG ATGCGGACGG CGAACTGGAC
AGCTCCGGCA AATCGCTGCT GCGCGAGACG GAATGCTATC TGCGTCAGCG GCAGAAGTGG
CAGTCGGACG CGGACAACTA TGTGGCTGTG GTTGCCCGCG AAAGTGATGA CTGGCTGCGG
ACCAAGCGCG ATCCCGAACG TCCGTTCTTT TTGTGGGTTG ATTCCTTTGA CCCGCACGAA
CCGTGGGATC CGCCTTCGGT GTGGGAAGGC CGCCCCTGCC CGTACGATCC CGACTACACG
GGCAATCCGC TGCTGCTGGC GCCGTGGTCC GAAGTGGCCG GTGTGCTGAC TGAAGAAGAA
TGCAGGCATA TCCGCGCCCT GTACGCCGAA AAGGTGACTC TGGTGGACAA GTGGATGGGC
CGTCTGTTTG ATTCTCTGAA AGAACAGGGG CTGTGGGACA ACACCATGGT TGTGGTCACA
TCGGACCACG GACAGCCCAT GGGCGAAGGC GAACACGGCC ACGGTATCAT GCGCAAGTGC
CGTCCGTGGC CCTATGAAGA GCTGGTGCAT GTGCCCCTGA TGGTGCATGT CCCCGGTCTG
AAAGGCGGTA AGCGTATCAG CAGTTTTGTG CAGAACGTGG ACATATCGGC CACCATCATG
GACGCGCTCG GATACTATAA TACGGAAGCC CTGCATGATG CGGGGCACGA AAGCATAAGC
ACCTACGATG CGGAAGATAT GCACGGCATC AGCCTGCTTC CGGTCATGCG CGGAGAAACG
CAGACCGTGC GCGATTTTGC CATTGCCGGG TACTATGGCA TGTCATGGTC TATTATAACA
CATGATTACA GCTATATACA TTGGATAACG CGCGAGATCG ACACAGACTC CATGAACAAG
ATATTCTACG ACGGTTCCGG CAAGGGCGGT AACGCCGGCA GGCAGTCCGC CCAGCTGGAG
GTCAAGGAAG AGATGTGGAC CTGTGTGCCC GGTGCCGAGG TGGCATTGCC GCAGCAGGAC
GAGCTGTACG ACCGGAAAAA CGATCCGTTC CAGCTTAACA ACATCATCGC CGAACAGCCG
GAAAAAGCGA AAGAGCTGCT GCAGCAGCTC AAGCTGTACA TCGGCGAGCT GAGAACAACC
TGA
 
Protein sequence
MASESIKNVI FIMLDTLQFN YLGCYGNKEV KTPNLDRFAG EGFLFENAYS EGLPTIPVRR 
ALMTGRYTLP YGGWKPLDPE DTTLTDILWC REVQTALVYD TPPMRLPKYG YSRGFDYVRF
CNGHELDHET FSKVPLSEKF KPEDYVSPNW LTRDADGELD SSGKSLLRET ECYLRQRQKW
QSDADNYVAV VARESDDWLR TKRDPERPFF LWVDSFDPHE PWDPPSVWEG RPCPYDPDYT
GNPLLLAPWS EVAGVLTEEE CRHIRALYAE KVTLVDKWMG RLFDSLKEQG LWDNTMVVVT
SDHGQPMGEG EHGHGIMRKC RPWPYEELVH VPLMVHVPGL KGGKRISSFV QNVDISATIM
DALGYYNTEA LHDAGHESIS TYDAEDMHGI SLLPVMRGET QTVRDFAIAG YYGMSWSIIT
HDYSYIHWIT REIDTDSMNK IFYDGSGKGG NAGRQSAQLE VKEEMWTCVP GAEVALPQQD
ELYDRKNDPF QLNNIIAEQP EKAKELLQQL KLYIGELRTT