Gene Dvul_2686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_2686 
Symbol 
ID4662904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp3128332 
End bp3130263 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content68% 
IMG OID639820933 
Productglycosyl transferase family protein 
Protein accessionYP_968125 
Protein GI120603725 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.137899 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCCAG ACATGTCCAC CAGACGTGCC CGTCAGCCTG CCACGGCGTG CCTTTCCGCT 
GGTGTCCCCC TGTGCTACAC CCGACAAAAC GGTGGAGCCT TCATGTCTGT ATCCCTGCTT
GCGCACTGGC GCACCCTCGC CCCGTCGATA CGTAGCCGAC TGCTTCATGG CAGCGTGGGC
AGCGCACATT GTCTGCGTCT CGCCTCGCGC TGTCTCACCG TAGCCGGGGC GTCCGGAGTC
GGTAGCGGTG ACGCACAGAC CGCCTTTCGC CTTGGCCGTG CCCTGCTGCT TGCCGCATGG
GAAGAAGACC CCTGCAACGG TCAACTGGCC TCGCAGGTGC TGGCCCTGCA CGGGCGTGTC
CCGTGGCTGG GTGAGGCCAC TGCACGGTTG CTTGCCGCCG TGGCGGGGGC GTGGCATGCG
CCTGCAGACC TCGGCCCGCT GGAACGGCTT GCCGCCGCAG GCGACTGGCA GGGGGCTTTG
GATCTTGCCG CAGGGCATGT CGTACGCGCT GCGGATGCCG GAACCTGCGA TCTTTTCTGG
TTGCGGCAGG CGCTGGTGTG TGCCGAACTT TCGGGCGATG CGGCTTGGGG TGCAGACCTC
GCCGCGCGGG CACTGGGCGA TGTACCGGCA CCGGAAATGT TTCGCCGCGG CGGGCTCGCA
CCCCTTGCGC CGCTTGCTGC CTACCTCGAA GGGTGCCGAC TCGCCACGTC AGGCAACGCT
GCCGACTTCG CCCGCGCGCT GGGCCTGTTC CGGGGGTGTT CCGCGCTGGT GGGTGACCAT
GTGACGGTCC CGGAGGGCGC ATGGCTGGCA CCCGTGGAAA GGGCCGGTCA CTGCATGGCG
CGTCTGGGGG CTCGTGACGG CGCGCTTTCC CTGTGGCGCG TGGTGCTTGC GGCGCGCCCG
TGGCATGTGA GCCTCATACT GCGGGCGCAC GATGTGGCGC AGGGGTACGA CCAGCCCGCG
CCCACCCCGC CGGGAACCAC CGCGGCCCTG CTCTATTCGT GGAACAAGGC CGAAGAACTC
GATGAAGCAC TACAGGCGCT GGCAGCATCG CTGGATGACA TCACCGTGGT CGCCTGCCTC
GACAACGGTT CCACAGATGC CACGGGCGAT GTCATGCGTG TGTGGGGCGA CCGCATGGGG
CGCGACAGGT TCGTGCCCGT GCGCCTTGCC GTCAACGTGG GGGCCCCGGC AGCCCGCAAC
TGGCTGATGC AGTTGCCGCA GGTGGCGGCG TGCGAGTACG CCGCCTATCT CGACGATGAC
GCCGCCGTGC CAGCCGACTG GCTGCGTCAC TTCGCCCGTG CCGTGGCCGT GCGGCCCGAT
GCCGCAGCGT GGGGCTGCCG TGTGGTGGAT TGGCACTCGC CTGCACTGGT GCAGTCTGCG
GCGTTGCACA TGGTGCCCGC CTTCGAGGTG CGCGATGTGG CGGGAGGTGC CGAGGCCGGA
ACAGGTGAAG ACGGCCTGCC CGACGCAGAA GCCGTCTATG CGCCCACACT GGCGCACGGG
TTGCCTTTCA CCGTCAGCGA CCTGCACTGC CAGACCACCG ACCTTGGCAG GTTCGACCAC
ATCCGACCGT GCGTCTCGGT GACGGGGTGC TGCCATCTCT TCCGTACGCG CGACCTTCTC
GCACGCGGCG GGTTCGCGCT CTCTCTCTCC CCCTCGCAGT ACGACGACCT CGAACACGAC
TTGCGCGCCG CCCGCGACGG ACGTCTGGCC TGTTACGACG GCTTTTTCGC CGTGCGCCAC
AAGAAGCGCA CCGGCAAGGC CGCCCGCATG TCCGGGGCGC AGTACGGCAA CGGGCTGGGC
AATCGCTACA AGTTGCATGG CATGTATGAT GCGCCCGCCG TGGCCCGCAT CCACCGCATG
GAACTCGAAG CTCTGGAACA GGATTTGCTT GAACGCATGG CGCGACTCGA TGTCCTGCAC
AGACGGGGCT GA
 
Protein sequence
MMPDMSTRRA RQPATACLSA GVPLCYTRQN GGAFMSVSLL AHWRTLAPSI RSRLLHGSVG 
SAHCLRLASR CLTVAGASGV GSGDAQTAFR LGRALLLAAW EEDPCNGQLA SQVLALHGRV
PWLGEATARL LAAVAGAWHA PADLGPLERL AAAGDWQGAL DLAAGHVVRA ADAGTCDLFW
LRQALVCAEL SGDAAWGADL AARALGDVPA PEMFRRGGLA PLAPLAAYLE GCRLATSGNA
ADFARALGLF RGCSALVGDH VTVPEGAWLA PVERAGHCMA RLGARDGALS LWRVVLAARP
WHVSLILRAH DVAQGYDQPA PTPPGTTAAL LYSWNKAEEL DEALQALAAS LDDITVVACL
DNGSTDATGD VMRVWGDRMG RDRFVPVRLA VNVGAPAARN WLMQLPQVAA CEYAAYLDDD
AAVPADWLRH FARAVAVRPD AAAWGCRVVD WHSPALVQSA ALHMVPAFEV RDVAGGAEAG
TGEDGLPDAE AVYAPTLAHG LPFTVSDLHC QTTDLGRFDH IRPCVSVTGC CHLFRTRDLL
ARGGFALSLS PSQYDDLEHD LRAARDGRLA CYDGFFAVRH KKRTGKAARM SGAQYGNGLG
NRYKLHGMYD APAVARIHRM ELEALEQDLL ERMARLDVLH RRG