Gene Dvul_1104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1104 
Symbol 
ID4662810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1342923 
End bp1344185 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content63% 
IMG OID639819333 
Productvon Willebrand factor, type A 
Protein accessionYP_966551 
Protein GI120602151 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4961] Flp pilus assembly protein TadG 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.753254 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.1698 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGCCC TGTCGGCCCT TCTCCGGCGG CAGAAGGGTT CCATGGCGAC GCTGGTCGCC 
GTGCTGCTGC CAGTCGTCCT CGGGCTTGTG GGCCTCGGCA TCGACTCGGG CATGCTCTAC
CTGTCCCACA GCCGCCTGCA GGCCGCCGTG GATGCTGCCG CCCTCGCCGG CAGCCTGCAA
CTGCCCTACG ACCCGCAGCT GGACAAGGGA CTCGTGCGCG GGGCCGTCAC GCAGTACATG
GATGCCAACT ACCCCGAAGC CTCTCTCAAC GGGGTGACTC CGGGCACAGA GGAACGCAGT
GTCACGGTCA CCGCCACCGC CACCGTGCCC ACCATCTTCA TGAACGCGCT CGGCATCGGT
TCCAGCGAGG TGCACGCCAA GGCCACTGCC GGATACAACA AGCTGGAGGT CGTCTTCGTC
ATCGACAACT CCGGTTCCAT GAAGGGCACC CCCATCCAGC AGACCAACAG CGCGGCCTCG
CAGCTTGTGG AACTCATCAT GCCCGAGGGC ATGATGACGT CGGTCAAGGT GGGGCTGGTG
CCCTTCCGCG GCAAGGTGCA CCTGCCAGCC GGTGTGGACG GGCTTCCCGA CGGCTGCCGC
AACGCCGACG GGACGCTGAA CCCCAGCTGG CTGCACGAAG AGTACTTCAA GACGTCATAC
CGCTATCCCT CAGGCTCGTC ACTGAACGTG CCCAAGAACA CGTGCACCAG CATTCCCCGC
GTGCAGGGAC TGACTGAAGA CCGCGAGACA ATCCTCACCG CCATATCGAA GCAGAACGGC
CTTGGTGACG CCTCGGGGAC GGTCATATCC GAAGGGCTGA AATGGGGACG TCACGTGCTC
ACGCCCGAGG CACCGTTCAC CGAAGGCTCA TCGGCCAAGG ACATCCGCAA GGTCATCATC
GTGCTCACCG ATGGTGATAC CGAAGACGGA AAGTGCGGAG GCAGCTACGC CATCAACTAC
ACCCCCAACG CCTACTGGAC CAACGCCTTC TACGGCATGC TGGACATGAC GTCGCACTGC
GAGAACGGGG GCAAGCTCAA TGCCGCCATG CTCGAAGAGG CGCGCAAGGT GAAGGAGGCG
GGTATCGAGG TGTTCGCCAT ACGCTTCGGC GATTCAGACA GTGTCGACGT CTCGCTCATG
AAGAGCATCG CGTCCAGCAA GGCTGGGACC AACGACCATT ACTACGACGC GCCCTCGGCC
TACGACATCG ACGACGTGTT CAAGAAGATC GGCCGACAGC TCGGCTGGAG ACTGCTGCGC
TAG
 
Protein sequence
MRALSALLRR QKGSMATLVA VLLPVVLGLV GLGIDSGMLY LSHSRLQAAV DAAALAGSLQ 
LPYDPQLDKG LVRGAVTQYM DANYPEASLN GVTPGTEERS VTVTATATVP TIFMNALGIG
SSEVHAKATA GYNKLEVVFV IDNSGSMKGT PIQQTNSAAS QLVELIMPEG MMTSVKVGLV
PFRGKVHLPA GVDGLPDGCR NADGTLNPSW LHEEYFKTSY RYPSGSSLNV PKNTCTSIPR
VQGLTEDRET ILTAISKQNG LGDASGTVIS EGLKWGRHVL TPEAPFTEGS SAKDIRKVII
VLTDGDTEDG KCGGSYAINY TPNAYWTNAF YGMLDMTSHC ENGGKLNAAM LEEARKVKEA
GIEVFAIRFG DSDSVDVSLM KSIASSKAGT NDHYYDAPSA YDIDDVFKKI GRQLGWRLLR