Gene Dvul_0021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_0021 
Symbol 
ID4662593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp32099 
End bp33673 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content58% 
IMG OID639818214 
Productsulfatase 
Protein accessionYP_965472 
Protein GI120601072 
COG category[R] General function prediction only 
COG ID[COG2194] Predicted membrane-associated, metal-dependent hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGCTT CCCGCTTCGG TTCCGTCGTG TTCGTCGTCA TCTCCCTGCT TGCAGCCCTA 
TGCTACAACC CGCAGCTTTA TGCGACCGGT GTGTCTCCGC TGGCTGCCGT GCCCGTGCTG
TTGCTGCACC TTGCCCTTGC GGTCACTTTC GGGGGTATTC CCGGACTTCG CTGGGGATAT
TTCCTCTTTC TCCACGTTGC GGGGTGTGCC CTCACCTATT TCTACACACT GTTCGACGTG
TCGATGAGCT ACGACACGCT GGCATGGCTG CTTGAAACCA ACCGGGCGGA AGTCGATAGT
TTTGTCACAT GGCCGCTCTT TCTGTTGCTG TGCGCAGGGA TAGCGGTCAG CCTTGTCCAC
AGCTCTCTCT CGCAGTCATT CAGGGCGAGT CTCGCCGGAA GGCGGTATGC GGCGATGGCA
GGACTGCTTT TGCTGTGTGC ACTCGTGAGT GCCGAAGGGG GACGTGCCGC GTTCAAGCAT
CTTGGCGGCA AGGCATCCTA TGAAGCGCTT TCCATGAGTC GCATCGTTCC CGTCTCCGTC
GTCAAGGCTG CTGCGGCGTA TGTCAGGGAA GAGCGCCGGT CCGACACGCT CGCGGCGCTG
CCAGACCCGG CGGATGCCGC TTCGTCGCTG GTTATCCCGG ACCATGAGAA GCCGGTGGTG
GTGTTCATCA TCGGTGAAAG CGCCCGCGCC GATCATTTTG GCATCAACGG CTACGAGCGG
GATACGACGC CGCGTCTTTC TGTCGAAAGG AACATCATCA ATTACGGGGT CTGCCGGTCG
TTTGCGAACA CGACGAGGAT TTCCCTCATC GGCATACTGA CCGATGCCAC CATTGAAGAC
AGGACGCCCC GGCATGGTTC CTTCATCAGC CTCTTCAACA AGCACGGATA CGATACGGCT
TTCTTTTCGC GCCAGAACAG GCTCGGTCGT AGCGGTCACC TCACAGACGC CCTTGTCTCA
GGTGCGGGCA CTGTGGAGTA CCTCAAGGGC GTCGACCGGG ACCTTGTGAC CCGCCTGCAG
GCGTTCGTGG CAAATGCAGG GGGCAAACTT GTCGTGCTCC ATACGCAGGG GTCGCATTTT
TCATACAACC AGCAGTACGC GGCGGAACAT CGTCGCTTCA TGCCCGATAC GTATAGTAAT
GAGGCGATGC CGCGCGACAT CGCGAATGTC ATTAATGCCT ATGACAACAG CATCGTGAAG
ACCGATGCGA TGATCGCAGA GACGATCGAT GTGCTGCGCG ACCGCAATGC CGTCGTTCTG
TATACGTCAG ATCATGGCGA GTCATTGGGT GAGAACGGTG TCTTCTTCCA CGGGACGAAG
AAGGTGGCCG ATGAACAGTA CGAGGTCCCT CTTTTCCTGT GGTACAGTGA CGTATACGAA
AAGAGCCGCC CCGATGTGGT CGCCAGATTG CGTCGTGTGC GCGGGACACC GGTCACGCAT
GACTTCATCT ATCATACACT TCTCGGACTT GGTGGCATCC GTTCCACCAT TGCCTCGGCA
AGCCACGACC TTTCGGGAGT CACGGCACAG CCGCTACGGG CGTCGGGTGA TGAGGCTGTC
GCGTTGACAG AGTGA
 
Protein sequence
MRASRFGSVV FVVISLLAAL CYNPQLYATG VSPLAAVPVL LLHLALAVTF GGIPGLRWGY 
FLFLHVAGCA LTYFYTLFDV SMSYDTLAWL LETNRAEVDS FVTWPLFLLL CAGIAVSLVH
SSLSQSFRAS LAGRRYAAMA GLLLLCALVS AEGGRAAFKH LGGKASYEAL SMSRIVPVSV
VKAAAAYVRE ERRSDTLAAL PDPADAASSL VIPDHEKPVV VFIIGESARA DHFGINGYER
DTTPRLSVER NIINYGVCRS FANTTRISLI GILTDATIED RTPRHGSFIS LFNKHGYDTA
FFSRQNRLGR SGHLTDALVS GAGTVEYLKG VDRDLVTRLQ AFVANAGGKL VVLHTQGSHF
SYNQQYAAEH RRFMPDTYSN EAMPRDIANV INAYDNSIVK TDAMIAETID VLRDRNAVVL
YTSDHGESLG ENGVFFHGTK KVADEQYEVP LFLWYSDVYE KSRPDVVARL RRVRGTPVTH
DFIYHTLLGL GGIRSTIASA SHDLSGVTAQ PLRASGDEAV ALTE