Gene Dvul_2553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_2553 
Symbol 
ID4664189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp2980469 
End bp2982031 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content61% 
IMG OID639820802 
Productsulfatase 
Protein accessionYP_967996 
Protein GI120603596 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.083937 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAAGA AAGACAACAT CAAGAACGTC ATCTTCATCA TGCTCGACAC GTTGCAGTTC 
AACTATCTTG GCTGCTACGG CAACGATGTG GTGAAGACGC CGAACCTCGA CAAGTTCGCC
CAAAACGGCT TCCTGTTCGA GAACGCCTAC AGCGAAGGGC TGCCCACCAT CCCCGTGCGG
CGCGCCATCA TGACCGGCCG TTTCACGCTG CCCTACAGCG GCTGGCGTCC GCTGACCACC
GAAGACACCT CCATCACGGA CATGCTCTGG TGCCGTGAGG TGCAGACAGC GCTGGTGTAT
GACACCCCCC CCATGCGTCT TCCCAAGTAC GGCTACTCTC GCGGGTTCGA CTACGTACGT
TTCTGCAACG GTCACGAACT GGACCACGAG ACCTTCTGCA ACGTGCCGCT TGACGAAGAG
TTCAAGGCCG AGGACTACCT TTCGCCCAAC TGGCTGAAGA AGGATGAGAA CGGCGAATAC
GACTCGTCGA GCAAGTCGCT CATCCGCGAG ACGGAATGCT ACCTGCGCCA GCGCCAGTTC
TGGGCTTCCG ATGCGGACAA CTACGCCTCC GTGGTCATCT CCGAGGCCGA CAACTGGCTG
AAGATGAAGC GCAACCCGCA GCGCCCCTTC TTCCTGTGGC TCGACTCGTT CGACCCGCAT
GAGCCCTGGG ACCCGCCGTC AATGTGGGAG AAGAAGCCCT GCCCCTACGA CCCCGACTAC
ACGGGCAACC CGCTGCTGCT CGCCCCGTGG ACCGAAATCG ACGGTGTGAT GACCGAAGAG
GAATGCGCCC ACATCCGCGC CCTCTACGCG GAGAAGGTGA CCCTCGTCGA CAAATGGCTC
GGCAAACTGT TCGACTCGCT CAAGGCGCAG GGGCTGTGGG ACGACACCAT GATCGTCATC
ACCTCCGACC ACGGACAGCC CATGGGCAGC GGCGAACACG GTCATGGACT GATGCGCAAA
TGCCGTCCGT GGCCCTACGA AGAACTGGTG CACGTGCCCC TGCTCATCCG GGTCCCCGGT
CTTGAGGGTG GCAAGCGCAT ATCGTCGTTC GTACAGAACG TGGACATCAC CGCCACGGTG
GTCGATGGGC TGGGCATGGG GCTCGAAGCC CTTGCCGAGG CCGGGCACGA AGGCATCACC
ACCTATGCAG GCGACGACAT GCACGGTATC AGCCTGTTGC CCGTCATGCG CGGCGAGACC
GACAAGGTGC GCGATTTCGC CATCGCGGGC TATTACGGCA TGTCATGGTC CATCATCGAT
CACGACTACA GCTACATCCA CTGGCTGCAG AGGGAGATCG ACACGGATTC CATGAACAAG
GTCTTCTACG ACGGCTCCGG CAAGGGCGGC AACGCCGGTG CCCAGTCTGC CAAGCTGGAG
ATGAAGGAAG AGATGTGGAC CTGCGTGCCG GGGGCCGAAG TATCCGTCCC CCACACCGAC
GAGCTGTACG ACAGGCGGAA CGACCAGTTC CAGATGAAGA ACCTCATCGG TGAGCAGCCG
GAAAAGGCCA AGGAACTTCT GCAGAAGCTC AAGCTCTTCA TCGGCGAGCT GCGCACGTCG
TAG
 
Protein sequence
MSKKDNIKNV IFIMLDTLQF NYLGCYGNDV VKTPNLDKFA QNGFLFENAY SEGLPTIPVR 
RAIMTGRFTL PYSGWRPLTT EDTSITDMLW CREVQTALVY DTPPMRLPKY GYSRGFDYVR
FCNGHELDHE TFCNVPLDEE FKAEDYLSPN WLKKDENGEY DSSSKSLIRE TECYLRQRQF
WASDADNYAS VVISEADNWL KMKRNPQRPF FLWLDSFDPH EPWDPPSMWE KKPCPYDPDY
TGNPLLLAPW TEIDGVMTEE ECAHIRALYA EKVTLVDKWL GKLFDSLKAQ GLWDDTMIVI
TSDHGQPMGS GEHGHGLMRK CRPWPYEELV HVPLLIRVPG LEGGKRISSF VQNVDITATV
VDGLGMGLEA LAEAGHEGIT TYAGDDMHGI SLLPVMRGET DKVRDFAIAG YYGMSWSIID
HDYSYIHWLQ REIDTDSMNK VFYDGSGKGG NAGAQSAKLE MKEEMWTCVP GAEVSVPHTD
ELYDRRNDQF QMKNLIGEQP EKAKELLQKL KLFIGELRTS