Gene Dvul_0224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_0224 
Symbol 
ID4662352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp272431 
End bp273726 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content64% 
IMG OID639818420 
Productextracellular solute-binding protein 
Protein accessionYP_965675 
Protein GI120601275 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCTAT TCCGCAAAAC CTGCGCACTG GTGGCGGCTT CGCTGCTGTC GCTGACCCTG 
ATGGCGGGCA CCGCGCTCGC CGAAAAGGTC AATCTCACCT TCTACTTCCC GGTCTCCGTC
GGCGGCCCCA TCACCAAGAT TGTCGAGGGC ATGACAGAGC AGTTCATGAA GGAACACCCC
GACATCAACA TCACCCCCGT CTACGCAGGC ATCTACCGCG AGACGCTCAC CAAGGCGCTC
ACGGCCCTGC GTGGCGGTGA ACCGCCCCAT GTTGCCGTGC TGCTCTCCAC CGACATGTAC
ACCCTCATCG ACGAGGATGC CATCGTCCCC TACGACAGCA TCATGAAGCC CGAAGATATG
GCCTTCACCA AGGCGTTCTT CCCCGGCTTC ATGAGCAACA GCCAGACCGG CGGCAAGACG
TGGGGCATTC CCTTCCAGCG CTCGACCATC GTCATGTACT GGAACAAGGA GGCCTTCAAG
GCTGCGGGCC TCGACCCTGA CAAGGCCCCT GCCAACTGGC AGGAACTGGT CGCCATGGGC
AAGAAGCTCA CCGTCAAGGA CGAAAGCGGC AAGGTCACCC AGTGGGGCGT CGCCATCCCG
TCCACCGGCT ATGCCTACTG GATGCTGCAG GCCCTCGCCA TCCAGAACGG CGTGGAACTC
ATGAACGCCG AAGGCACCAA GACCTACTTC GACGACCCCA AGGCCATCGA AGCCCTCACC
TTCCTCGTCG ACCTCGCCGG CAAGCACGGC GTGTCGCCCT CCGGCACCAT CGACTGGGCC
ACCACCCCGC GTGACTTCTT CGAGCGCAAG ACCGCCATCA TGTGGACCAC CACCGGCAAC
CTGACCAACG TCCGCAAGAA CGCGCAATTC CCCTTCGGTG TGGGCATGCT GCCCGCCAAC
ACCCGCCCCG GTTCGCCCAC GGGCGGCGGC AACTTCTACA TCTTCAAGAA GAGCACCCCC
GCCGAACGTC AGGCCGCCGT CACCTTCGTG AAGTGGATGA CCAGCGCCGA ACGCGCAGCC
CAGTGGGGTA TCGACACCGG CTATGTGGCG GTGCGCCCCG ATGCATGGGA GACCAAGGCC
ATGAAGGACT ACGTGGCCTC CTTCCCCTAC GCCGCCATCG CCCGCGACCA GCTGGCCCAC
GGCGTGGCCG AGCTCTCCAC CCACGACAAC CAGCGCGTGA CCAAGGCGCT TGACGACGCC
ATTCAGGCCG CCGTCACCGG TTCCAAGACA CCTGCCGAAG CACTCAAGGC AGCCCAGAAG
GAAGCCGAGC GCATCCTGCG CCGTTACGCG AAGTAG
 
Protein sequence
MALFRKTCAL VAASLLSLTL MAGTALAEKV NLTFYFPVSV GGPITKIVEG MTEQFMKEHP 
DINITPVYAG IYRETLTKAL TALRGGEPPH VAVLLSTDMY TLIDEDAIVP YDSIMKPEDM
AFTKAFFPGF MSNSQTGGKT WGIPFQRSTI VMYWNKEAFK AAGLDPDKAP ANWQELVAMG
KKLTVKDESG KVTQWGVAIP STGYAYWMLQ ALAIQNGVEL MNAEGTKTYF DDPKAIEALT
FLVDLAGKHG VSPSGTIDWA TTPRDFFERK TAIMWTTTGN LTNVRKNAQF PFGVGMLPAN
TRPGSPTGGG NFYIFKKSTP AERQAAVTFV KWMTSAERAA QWGIDTGYVA VRPDAWETKA
MKDYVASFPY AAIARDQLAH GVAELSTHDN QRVTKALDDA IQAAVTGSKT PAEALKAAQK
EAERILRRYA K