Gene DvMF_2733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_2733 
Symbol 
ID7174672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp3458659 
End bp3459879 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content65% 
IMG OID643541266 
Productvon Willebrand factor type A 
Protein accessionYP_002437140 
Protein GI218887819 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4961] Flp pilus assembly protein TadG 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATGC TGATGGCCGT GCTGCTGCCC GTGGTGCTGG GCCTTGCCGG CCTTGGCATC 
GATTCGGGCA TGCTCTACCT CGCGCACAAC CGCCTGCAGG GGGCCGTGGA TGCGGCAGCC
CTGGCGGGCA GTCTGGAACT TCCCTACGAC CCGCAACTGG ACAAGGGGCT GGTGAAGGGC
GCCGTGAACC AGTACATGGC CGCCAACTAC CCCGCCGCCG TGCTGAAGGG CGTCACCCCC
GGCACCGAGG AACGCAGCGT CACCGTGAAG GCCGAGGCCA CCGTGGACAC CATCTTCATG
GGTGCCCTTG GCATCGGGTC CAGCACGGTG CGCGCCCAGG CCACCGCCGG GTACAACAAC
CTGGAAGTGG TCTTCGTCAT CGACAACACC GGCTCCATGA AGGGCACGGC CATCCAGCAG
GCCAACGCGG CCGCCACCCA GCTTGCCGAA CTGATCATGC CCGACGGCAT GGAAACCTCG
GTCAAGGTGG GGCTGGTGCC CTTCCGGGGC AAGGTGCACA TTCCCGCGGG CGTGGACGGC
CTGGCCGACG GCTGCCGCAA CGCCGACGGC ACCCTGGCGC CCTCGTGGAT ACTGGAAGAG
TACAAGCAGA CCAAGTACCG CTACCCCACG GGTTCGTCAC TCAACGTGCC CAAGGGCACC
TGCGACAGCA TTCCGCGCGT GCAGGCCCTG ACCAGCAACC GCACCACCAT CGTCAGCGCC
ATCGCCAAGC AGGACGCCCT GGGCGATGCC TCGGGCACCG TCATCTCCGA AGGCATCAAG
TGGGGGCGCC ATGTGCTCAC TCCCGAGGCG CCGTTCACCC AGGGCTCGTC CAACAAGGAC
ATGCGCAAGG TGATGATCGT GCTGACCGAC GGCGATACCG AGGACGGCAA GTGCGGCGGC
AACTACGCCC TGAACTACAC GCCCAACGCC TACTGGACCA ACGCCTACTA CGGCATGTTC
GACATGAACA CTCACTGCGA GAACGGCGGC AAGCTGAACG CGGCCATGCT GAGCGAGGCG
CAGATCGCCA AGGACAAGGG CATAGAGATC TTTGCCATCC GCTACGGCGA CTCCGACTCC
ACGGACATCA GCCTGATGAA GGCCATCGCC TCCAGCAAGG CGGGCACGGA CGACCACTAC
TACAACGCGC CCTCTGCCTA CGATCTTGAA GAAATCTTCA AGAAGATCGG TCGGCAGCTT
GGCTGGCGGT TGCTGCGCTA G
 
Protein sequence
MAMLMAVLLP VVLGLAGLGI DSGMLYLAHN RLQGAVDAAA LAGSLELPYD PQLDKGLVKG 
AVNQYMAANY PAAVLKGVTP GTEERSVTVK AEATVDTIFM GALGIGSSTV RAQATAGYNN
LEVVFVIDNT GSMKGTAIQQ ANAAATQLAE LIMPDGMETS VKVGLVPFRG KVHIPAGVDG
LADGCRNADG TLAPSWILEE YKQTKYRYPT GSSLNVPKGT CDSIPRVQAL TSNRTTIVSA
IAKQDALGDA SGTVISEGIK WGRHVLTPEA PFTQGSSNKD MRKVMIVLTD GDTEDGKCGG
NYALNYTPNA YWTNAYYGMF DMNTHCENGG KLNAAMLSEA QIAKDKGIEI FAIRYGDSDS
TDISLMKAIA SSKAGTDDHY YNAPSAYDLE EIFKKIGRQL GWRLLR