Gene Dvul_1003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1003 
Symbol 
ID4663081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1234457 
End bp1235902 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content66% 
IMG OID639819227 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_966451 
Protein GI120602051 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.861392 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGTTC CTGTGGTGCC GGGTTACTGT TCGTCGCCCT GTGGGGGGCT TTCACCGCAG 
AGGCACAACA GAGGTGGTCT GCGCTGGCTG GCGTCTACAT TCGCCGTGCT GCTGGCGTTG
TGCCTCACCG TGGCCTCTGC GCAATGCGCC TCTGCCAACG CCACGGTCGA AGCCATGGCG
GGACAGATGC TCATGCTCGG CTTCCGTGGG GCCGAACCTG CGGATGCAGG CCCCATCCTT
CGCAGCATCG CCGCAGGGCA TGTGGGTGGC GTCATCCTGT TCGACAGGGA TATGGACCCG
TCGGTGCGGG TGCGCAACAT CGTCTCCAAA GAGCAGTTGA GCCGTCTGAC GGGGGCGTTG
CAGGCGGCGG CCCCGGTTCC CCTGTTCATC GCCGTCGACC AGGAGGGCGG AAGGGTGCGA
CGCCTGAAGC CCGAGTACGG GTTCTTCGCC TATCCTTCTG CGGCGTCTCT CGGCAAGGGC
AGCCCGGAGG ATACGCGTCG CATGGCGTCC ACGCTCGCCA CGGAGATGGC GGAGGTGGGA
CTCAACGTGG ATTTCGGGCC GGTGGTCGAC CTTGCCGTCA ATCCGTCGAA CCCTGTCATA
GCCCGCCTCG AACGCAGTTA TGGCAGCGAC CCGTGCCGCG TGGCGTCCCA TGCCGCAGCT
TTCGTCAACG GGCTGGCATG GCGCGGTGTT GTGGCTTCTC TCAAGCATTT TCCGGGGCAT
GGCAGTTCGT TGCAGGACTC GCATCTTGGC GTCACCGACA TCTCGTCCAC ATGGCGGCGC
GAAGAACTCG GCCCCTATGC CCTTCTCCTG CGCGACGACT GGGCGGGCAT GGTCATGGTG
GGTCATCTCT ACAACAACCG CATCGATGCG GCGCATCCTG CCACGCTGTC ACAACGCACC
ATCGACGGAC TGCTGCGGCG CGACCTCGGC TGGAAGGGCG TCGTGGTGAC GGACGACCTG
CAGATGGGCG CCATCACCGC CCGTTATTCC CTCGACGAGA CCGTGCGTCT CGCCGTGGAG
GCTGGGGCCG ACATCCTGCT CTTCGGCAAC AATCTCGTAT GGGACGAGGG GCTTGCCGAG
AAGGTCCATG CCACGCTCGT GCGTCTCGTG CGTGAGGGCA AGGTGTCCGA ACAAAGGCTG
CGGCAATCGT GGGAACGTAT CATGCGACTC AAGTCCGTGC TTACGGTGGC ATCACCCGGC
GCCATCGCGC AGGGAGATTC TTCAGGCACT GCCATCGCCC CCGTCCCTGC CGTGCCCGAC
CCGACTGGCG CGCCGCGCCC CCTGCGTCTC ACTCTGCCCA TGCCGCGCGG CGATGAGCCG
TTGTGCCGCG ACCGCGACAC GCTGCAACGC GACCCCTTCG ACGACAGACA GGGTGCGCGC
TACGGGCGCG GTACGGGCAT CGGCATCGGC GTCGGGACCG GCACTGGCAT CGGCATCGGG
CAGTGA
 
Protein sequence
MTVPVVPGYC SSPCGGLSPQ RHNRGGLRWL ASTFAVLLAL CLTVASAQCA SANATVEAMA 
GQMLMLGFRG AEPADAGPIL RSIAAGHVGG VILFDRDMDP SVRVRNIVSK EQLSRLTGAL
QAAAPVPLFI AVDQEGGRVR RLKPEYGFFA YPSAASLGKG SPEDTRRMAS TLATEMAEVG
LNVDFGPVVD LAVNPSNPVI ARLERSYGSD PCRVASHAAA FVNGLAWRGV VASLKHFPGH
GSSLQDSHLG VTDISSTWRR EELGPYALLL RDDWAGMVMV GHLYNNRIDA AHPATLSQRT
IDGLLRRDLG WKGVVVTDDL QMGAITARYS LDETVRLAVE AGADILLFGN NLVWDEGLAE
KVHATLVRLV REGKVSEQRL RQSWERIMRL KSVLTVASPG AIAQGDSSGT AIAPVPAVPD
PTGAPRPLRL TLPMPRGDEP LCRDRDTLQR DPFDDRQGAR YGRGTGIGIG VGTGTGIGIG
Q