Gene Dvul_3070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_3070 
Symbol 
ID4661951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008741 
Strand
Start bp155769 
End bp157271 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content65% 
IMG OID639813990 
Productlipopolysaccharide biosynthesis 
Protein accessionYP_961269 
Protein GI120586924 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACCCG GAGGACGTGA ACCCCAGCAG GCGCCGGGCC GCGAGTCCGG CAGCCTCGAT 
GTCAGGCGTT ACCTCGTGCT GCTGCGCGAG AGCATCGTGA CCTTCTGCGC CATCGCCGTG
GGGGTGACCC TGCTCGGTGT GGCGGTCAGC TATGTGTTGC CCAAACGCTA CGAGGCGCAC
TCCTCCGTGT CGGTCGAGCA GAACGTGGTC AACGAACTGG TGAAGGGCAT CGCCATCACC
CCGTCGCTTG AAGCGAAACT GCGCATCCTC AAGGTGTCCA TCCTGAGTCG CAAGATGCTG
CTCGCCGTCA TCCGCGACCT CGACATGGAC CTCGGCAGGC AGGGCGAACA GCTTGAGATA
CTCATCGAGA ACACGCGCAA GAACGTCGAG ATCAGCTACG AGGAGAAGAA GGGCCTCTTC
TACATCCGCT ATCGCGACGC GTCGCCGGAA CGGGCGCGTG ACTTCGTCAA CGCGCTCACC
CGGCGCTACA TCGAGGAGAG TACGGCCTCG AAGCGCGAAG AGTCGTACGA GGCCACGCGC
TTCCTCTCCG ACCAGATAGA GGTGTTCCAG AAGCGCATCG AGGCCGCGCA GAAGGCCATC
GACGCCTTCA AGTCCGAGAA GGGCATGATC CTCAGCATGA ACGAGAGCAT CCTGCGCGAG
GAGATCAAGG AGACGGAACA CCGCCTCGAA GAGACGCGCA TCCGCAAGAA CGAGCGGCTG
GCCCAGCTCG GCATCCTCGA GAAGGGGACG GGCGGCGGGC GGCTTGCCGA GAAGGAGGCG
GCCTACAAGG CGCTTCTGGG AACCTACACC GCGCAGCACC CCGACGTGGT GCGCGCCAAG
GCCGAACTCG ACGCCCTGCG CGCCAGCGGG GGCGGGGGCG GCGGGCGCAA GGGCGGCGTC
GACTACCAGC GCATCAAGGT CGAACTCGAA TCGCTGAACG AGATAGAGCG CATCCAGCAA
GAGCTCATCG AGAAGGACAA GCGCCTGTTG CAGGAACTGC CCGCAGTGCA GACCGAACTG
CAGGCCCTGC AACAGGCCCG CAAGAACGAG ACGCTCATCT ACGAACAGCT GGTGACGCGC
TACGGGCAGT CCGAAGTCTC GAAGCAGATG GAATTGCAGG ACAAGGCCGT GAGCCTTCGC
ATCATCGACC CCGCCATCCT GCCCATACGC CCGAGCACCC CCAACAGGCC GCTCATCATG
CTGGCGGGGC TGCTGCTTGG CGGGGCCATC GGCGCGGGGT GGGTCATCCT CTCCGACCAG
CTGTTCCGCA AGCTGCGCTC GGTGGAGGAC CTCACCGCCA TGGGTGTCGT GGTGCTCGGC
GCGTTGCCCC GCATCGCCTC GCCCGACGAC GCGCGCATCG GGCGGCGCAG GCAGGTCGCC
CTTGCCATGT CGGTGACGGT GGCCCTGTTC GTGGTGGGGC TGGCAGCCGC GGAATACAGC
GGTTTCGAGG CGTTCGACGG CGTGTTCGCC CGTGTGCGCA ACATCGTCTC CAACTGGTTG
TGA
 
Protein sequence
MRPGGREPQQ APGRESGSLD VRRYLVLLRE SIVTFCAIAV GVTLLGVAVS YVLPKRYEAH 
SSVSVEQNVV NELVKGIAIT PSLEAKLRIL KVSILSRKML LAVIRDLDMD LGRQGEQLEI
LIENTRKNVE ISYEEKKGLF YIRYRDASPE RARDFVNALT RRYIEESTAS KREESYEATR
FLSDQIEVFQ KRIEAAQKAI DAFKSEKGMI LSMNESILRE EIKETEHRLE ETRIRKNERL
AQLGILEKGT GGGRLAEKEA AYKALLGTYT AQHPDVVRAK AELDALRASG GGGGGRKGGV
DYQRIKVELE SLNEIERIQQ ELIEKDKRLL QELPAVQTEL QALQQARKNE TLIYEQLVTR
YGQSEVSKQM ELQDKAVSLR IIDPAILPIR PSTPNRPLIM LAGLLLGGAI GAGWVILSDQ
LFRKLRSVED LTAMGVVVLG ALPRIASPDD ARIGRRRQVA LAMSVTVALF VVGLAAAEYS
GFEAFDGVFA RVRNIVSNWL