Gene Dvul_3072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_3072 
Symbol 
ID4662002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008741 
Strand
Start bp158078 
End bp159724 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content69% 
IMG OID639813992 
Productsugar transferase 
Protein accessionYP_961271 
Protein GI120586926 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03013] sugar transferase, PEP-CTERM system associated
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCAGGC ATCCCCTCCT CCATCTGCTC GGCGATTCAC TGCGTGTGGC GCTGGCACTC 
GGCATCGTGG CCCTGTGGCC GCTGTCGGGT GAACTGGCCA TGCCGCCCGT GACGGACATG
CTGGTGCTGG CCGCCGTGCT GCTGGTCTGC GCCGCCGTGA CCGACGTGCA TGAGCAGCGG
GCAGGGGGGA CGATGCCGGG CATCGGCGGT ACGGTACTGG CGGTGCTGGC GGCGTCGGTG
CTGCTGGTGC CGCTGCACGC CGAGGCGGGG GTGCTGTCGC GGCATGAGCT GACGGCCATG
TGGGTGCTGC TGTGGTTCGC CCTGTTGCGG GCGGGCAGCA CCGTGGCCGC GTCGTTCTGG
CGGTTGTTCC CCTCGCTGGC GCATGGCGTG CTGGTGCTTG GCGACGACGA GGGCATCCGC
ACGGCGGCGG GCCTTGCCGA GGCCAGCGAA GGCCGCTTCA GGGTGAAGAC CACGGTGCGC
TGGCACTCGG GCCGCCCGGT GGGCACGGGG TACCAGGCGC AGGCGTCATC CGAACGGGCG
CAGCCGTCGT CCGACCAGGC GCAGCCGTCA TCCGAACGGG CGCAGCCGTC GTCCGAACGG
GCGCAGCCGT CAGCAGGACA GCAGGCGTCG TCCGACCGGG CGCAGTCATC ATGCTATCAG
GCGCAGCCGT CATCAGGACA GGCCTGCCAG ACCCATCTCC GTGCGCAGGG TCTCGGGCTG
CCCCAAGACC CGCACTGCGC CGATGTCACG GGCGTCGTGC CGGATGCGTC GCACCCCGTT
CGTGGTGCCA TTCCCGGTTC CGTCCCCGCC ACCGGTTCAT GTGGAGACGA CGCCGAATGG
CTGGCGAACC TCGCCCGCCG CGAGAAGGTG CGTACCATCG TGGTCAGCCT CGCGGAACGT
CGCGGCAGCT TCCCGGTGGA CGCGGTGATG CGCTGCCGTC TGCGCGGGGT GCGGGTGCTC
GACGCCTCCA CCTTCTACGA GATGGTCACG CGCAAGCTCA ACGTCGAGCA CATCACCCCC
GGCTGGCTCA TCTTCGCCCC GGGCTTCGGC GGCAGCAGGC TGTGCGATGC CGGACGCCGG
GCGGCGGACA TCGCCCTCGC GCTGGTGGCG TCCCTTCTCG TCGCGCCCTT CATCCCCTTC
GTGGCCCTTG CCGTCAGGCT CGACTCGCCG GGCCCGGTGC TCTTCCGTCA GGTGCGGGTG
GGGCGGGGCG GCAGGCCCTT CACCCTCTAC AAGTTCAGGA CCATGCGGCA GGACGCGGAG
AAGGACACCG GCCCGGTCTG GGCGCGGGCC AACGACAGCC GCGTGACCCG CTTCGGGGCC
TTCATGCGGC GCTGCCGCAT CGACGAGTTG CCGCAACTCT TCAACGTGCT GCGCGGCGAC
ATGGGCTTCA TCGGCCCGCG GCCCGAGCGC CCCGAGTTCG TGGCTGAACT GTCGCGCGAG
GTGCCCTTCT ACGAACAGCG GCACGCCGTG CGTCCGGGAC TCACCGGGTG GGCGCAGGTG
CGCTTCCCCT ACGGGGCGAG CAAGGCCGAT GCACTGGAGA AACTGCGGTA CGATCTCTAC
TACATCAAGA ATCGTGCCTT CATGCTGGAT ATGGAGATCA TTGCAAGGAC GTTTTCCGTC
GTACTTTCAG GATCGGGAGC AAGGTAG
 
Protein sequence
MFRHPLLHLL GDSLRVALAL GIVALWPLSG ELAMPPVTDM LVLAAVLLVC AAVTDVHEQR 
AGGTMPGIGG TVLAVLAASV LLVPLHAEAG VLSRHELTAM WVLLWFALLR AGSTVAASFW
RLFPSLAHGV LVLGDDEGIR TAAGLAEASE GRFRVKTTVR WHSGRPVGTG YQAQASSERA
QPSSDQAQPS SERAQPSSER AQPSAGQQAS SDRAQSSCYQ AQPSSGQACQ THLRAQGLGL
PQDPHCADVT GVVPDASHPV RGAIPGSVPA TGSCGDDAEW LANLARREKV RTIVVSLAER
RGSFPVDAVM RCRLRGVRVL DASTFYEMVT RKLNVEHITP GWLIFAPGFG GSRLCDAGRR
AADIALALVA SLLVAPFIPF VALAVRLDSP GPVLFRQVRV GRGGRPFTLY KFRTMRQDAE
KDTGPVWARA NDSRVTRFGA FMRRCRIDEL PQLFNVLRGD MGFIGPRPER PEFVAELSRE
VPFYEQRHAV RPGLTGWAQV RFPYGASKAD ALEKLRYDLY YIKNRAFMLD MEIIARTFSV
VLSGSGAR