Gene Dvul_0195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_0195 
Symbol 
ID4662509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp238542 
End bp239702 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content67% 
IMG OID639818391 
Productglycosyl transferase, group 1 
Protein accessionYP_965646 
Protein GI120601246 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCCC CGGTCGTCTG CTTCTGCAAC ACCAACCCCG GCTGGGGCGG CGGCGAGAAA 
TGGCACCTTG AGGCGGCCAT CGCCCTTGCC CATCGTGGAC GGCGTGTGCT GCTCATGGCC
CACCCGGCAG GACGCCTCCA TGCCGAGGCG TCACGACTGG CAGCCACCCT CCCCGCCCAT
CTGCCCGGAC TGCGCGTCCT TCCGTTGCAG GTCGGCAGGC TCACGTTCCT CAATCCGGGC
GCCATCGTCC GTATCGCGCA TGTCCTGCAC AGGGAGAAGG TCGACAGCCT CGTGCTGGGC
CTGACCTCCG ACCTCAAGGC CGTGGGCCCT GCCGCGCGCC TCGCCGGAGT GCGTCAGGTG
TTCTATCGCC GGGGCAGCGC GCTGCCCATA CGCAACACGG CCTTCAACCG TCTGCTCTAT
GGGCGCGTCA TCAATGGACT CATCGTCAAC TCGCAAGAGA CCCGCCGGCT GGCGCTGGTG
AACAATGCGG GACTCATCCC CGAAGAGCGC ATCCACCTGC TTCACAACGG CATCGACGCC
ACGGGGTTCG ACGCCGCGCT CAAGAAGGCC AGCCCCGCCT ACAGGGCCGG CGGACATACG
CTGGTCATCG GCAATGCGGG GCGCCTCAAC AGGCAGAAGG GGCAGCACCA CCTGCTGCAC
ATGGCGCGTC TTCTGGCTGA CGAGGGGCTG GACTTCAGGC TTGTCATCGC GGGAGAGGGC
GAACGGAGAC AGGAGCTTGA GACGCTGGCG CGAACGCTTG GCGTTTCGGG GCATGTGGTC
TTTGCGGGGT TTCTCGCCGA CCTTGCGCCT TTCTGGAAGA GTCTGGACGT CTTCGTGCTC
AGTTCGCACT GGGAGGGCTT CGGCTATGTG CTTGCGGAGG CCATGCTGGC AGAAGTGCCC
GTGGTGTCCT TCGACGTGAG CAACATCCCC GAACTCGTGC AGGATGGCAC CAACGGCCTG
CTGGTGCCCG GCCCGGACGC GGCACCGGAA GGCGACGCCG CCCCCGCAGC GGGGCTTGCC
CGCGCCGTCA TGACCATGGC TGCGTCGCAG GACCTGCGTT GTCGCATGGG AGCGGCGGGC
AGGGCGCACG CCCTCGCCAA ATATGCGCAA GAGTCCTGCA TGGACGCACT GGAGAGCATC
CTCGGTAGCG CACCCCGGTA G
 
Protein sequence
MTPPVVCFCN TNPGWGGGEK WHLEAAIALA HRGRRVLLMA HPAGRLHAEA SRLAATLPAH 
LPGLRVLPLQ VGRLTFLNPG AIVRIAHVLH REKVDSLVLG LTSDLKAVGP AARLAGVRQV
FYRRGSALPI RNTAFNRLLY GRVINGLIVN SQETRRLALV NNAGLIPEER IHLLHNGIDA
TGFDAALKKA SPAYRAGGHT LVIGNAGRLN RQKGQHHLLH MARLLADEGL DFRLVIAGEG
ERRQELETLA RTLGVSGHVV FAGFLADLAP FWKSLDVFVL SSHWEGFGYV LAEAMLAEVP
VVSFDVSNIP ELVQDGTNGL LVPGPDAAPE GDAAPAAGLA RAVMTMAASQ DLRCRMGAAG
RAHALAKYAQ ESCMDALESI LGSAPR