Gene DvMF_2968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_2968 
Symbol 
ID7174910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp3743181 
End bp3744251 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content64% 
IMG OID643541501 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002437373 
Protein GI218888052 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones78 
Fosmid unclonability p-value0.897922 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAAGA AGATGCTGTT GCTGCTGTCG ATGTTCACGG TGCTGCTGCT GCCCTGCCTG 
GGCAATGCCG CGCCCAAGGA ATTCCGCCTG CTGACCTGGA AGGGGTACGC CCCGGCGGAG
CTGGTGGAAA AGTTCGAGAA GGAAACCGGC TACAAGGTGC AGGTGACCTA TTCGAACAAC
GAGGAAATGA TCGCCAAGCT GCGCGCCACG CGCGGCGGCG GGTTCGACCT TGCCCAGCCC
AGCCAGGACC GCATTTCTTC CGTGCAGGAA AGCTTCGGCC TGTACCAGCC CATCGACTTC
GGCCGCATCG AGGCAGCCCG GTTCATTCCC TCCATGCTCG ACGCGGTGAA GAAGAACACC
CTGGTCAAGG GCAAGTCGTA CGCCGTGCCG TTCTGCTGGG GCACCGACGG CCTGATCGTG
AACCGCAAGT TCGCCCCCGA TGCCAAAAGC TTCGCCGACC TGCTGGACGC CAAGTACGCG
GGCCGCGCCA GCTACCGCCT GAAGCGCCCC ACCCTCATCG CGCAGGCCTT CGGCATGGGC
ATCGACCCCT TCAAGCTGTA CGCCGATGAA GCCGCCTACC AGAAGATGCT GGACCAGGTG
GAAGGCAAGC TCATTGCCGC CAAGGGCGTG GTGAAGAACT ACTGGACCAA CGGCGACGCG
CTGCTGGAAT CCATGCGTTC GGGCGAAGTG CACATCGCCC AGGCCTGGGA CAACGGCGGC
TTCAAGCTGC ACGCGGAAAA CCCCGACATC GACTTTGTGG CGCCCACCAC CGGCGCGCTG
GGCTGGATCG ACACCTTCGC CATCCCCGCC AAGGCCGACA ACGCGGACGC CGCGTACAAG
TGGATCAACT TCATGATGCG GCCCGAAAAC GCGGCAGTGT TCACCAACGC CGAAGACACC
CCCACCGCCG CCGTGGGCGT GGGCGAACGC CTGAAGCCCA CCTTCCGCGC AGACTTCGAA
CGCTGCTACC CGCAGCAGGT CATCGACAAC ATCAAGTGGT ACCCGCCCGT GCCCGCCAAG
CTCGAAGCCA TGGAAGGCAA GGCGCTGGAC CGGGTGAAGG CCGCCCAGTA G
 
Protein sequence
MTKKMLLLLS MFTVLLLPCL GNAAPKEFRL LTWKGYAPAE LVEKFEKETG YKVQVTYSNN 
EEMIAKLRAT RGGGFDLAQP SQDRISSVQE SFGLYQPIDF GRIEAARFIP SMLDAVKKNT
LVKGKSYAVP FCWGTDGLIV NRKFAPDAKS FADLLDAKYA GRASYRLKRP TLIAQAFGMG
IDPFKLYADE AAYQKMLDQV EGKLIAAKGV VKNYWTNGDA LLESMRSGEV HIAQAWDNGG
FKLHAENPDI DFVAPTTGAL GWIDTFAIPA KADNADAAYK WINFMMRPEN AAVFTNAEDT
PTAAVGVGER LKPTFRADFE RCYPQQVIDN IKWYPPVPAK LEAMEGKALD RVKAAQ