Gene Dvul_3065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_3065 
Symbol 
ID4662001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008741 
Strand
Start bp149855 
End bp150901 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content71% 
IMG OID639813985 
Producthypothetical protein 
Protein accessionYP_961264 
Protein GI120586919 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID[TIGR03019] FemAB-related protein, PEP-CTERM system-associated 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0204721 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGACA TGGAGGTCAG GGGCGTCGAC CCCAACGCCC CGCAAGAGGC GGCCCTGTGG 
GACGCCTACG TGGCGGCCCA TGCGGAGTCC ACGGGCTATC ACCGCATGGG CTGGACGCGG
GTGGCGCAAC GGGCCTTCGG GCACGCGGCG TATCCCCTCG CGGCCTTCGA CGGCGGACGC
ATCGCCGGGG TGCTGCCCCT CGTCCACATC CGCAGTCGCC TCTTCGGGCG CTTTCTCGTC
TCGCTGCCCT TCGTCAACTA CGGCGGGCTG CTGGCGGACT CCGCCGAGGC GGCGCAGGCG
CTCATCGACG AGGCCGAGGG GCTGCTGCGG CGCACCGGGG CGGGCAGCAT CGAACTGCGG
CACGTGGGGC CGCCGCGCCT CGGGCTTTCC GCCAAGTCGC ACAAGGTGAC CATGCTCCTC
GACCTGCCGG ACGACCCCGA CACCCTGTGG CGCGGCCTGC GCGACAAGGT GCGCAATCAG
GTGCGCAAGG CGGGCAAGTC GGGCCTCACC GTGGAACAGG GCGGCGCGGG GCTGCTTGGG
CCGTTCTACG ACGTGTTCGC CGTCAACATG CGCGACCTCG GCACGCCGGT GTACTCGCGG
CGCTTCTTCG AGACCATCAT GGACGAATTC CCCGGCGCCA CGCGCATCGT CGCCGTGCGC
GACGGAGACG CCGTGGTGGC GGCAGCCCTC TGCTACACGC ACGGCAACAC CTTCGAGGTG
CCGTGGGCCT CGTCGCTGCG CACCCACCGT GACCGCTGCC CCAACAACCT CATGTACTGG
CACTGCATGG AGACGGCGTG CCGTGAAGGG TTCACCGTGT TCGACTTCGG GCGTTCGTCG
CGCGACAGCG GCCCGTGGCG CTTCAAGGCG CAGTGGGGCG CGCGCGAGGT GCCCCTCAGC
TGGGAGTACC TGCTGGCCGA CGGCGCACCC CTGCCCGACC TCAACCCGTC CAGCGCCCGC
TTCAGCCTCG CCGTGCGGGT GTGGCGGCAT CTGCCCGTGG CCCTCACGCG GTTCATCGGC
CCGCACATCG TCAGGAGCAT CCCATGA
 
Protein sequence
MSDMEVRGVD PNAPQEAALW DAYVAAHAES TGYHRMGWTR VAQRAFGHAA YPLAAFDGGR 
IAGVLPLVHI RSRLFGRFLV SLPFVNYGGL LADSAEAAQA LIDEAEGLLR RTGAGSIELR
HVGPPRLGLS AKSHKVTMLL DLPDDPDTLW RGLRDKVRNQ VRKAGKSGLT VEQGGAGLLG
PFYDVFAVNM RDLGTPVYSR RFFETIMDEF PGATRIVAVR DGDAVVAAAL CYTHGNTFEV
PWASSLRTHR DRCPNNLMYW HCMETACREG FTVFDFGRSS RDSGPWRFKA QWGAREVPLS
WEYLLADGAP LPDLNPSSAR FSLAVRVWRH LPVALTRFIG PHIVRSIP