Gene Dvul_2032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_2032 
Symbol 
ID4662468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp2366308 
End bp2368644 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content61% 
IMG OID639820275 
Productorganic solvent tolerance protein 
Protein accessionYP_967475 
Protein GI120603075 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1452] Organic solvent tolerance protein OstA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCTCG GCAGAAAGAC ACGCGCCGTC GCGGCGGCGT TGATGGCGTT TGTGTGTTGC 
TGCGTGGCCG AGGCCACCAC CCCCGCCCCG GTGCTCGCCT CGGCGAGCGT GCTCGAGATG
CGCACCGACG ATGCCGAGAC CGTGACATGG CACCTCACTG CGGACAACCT CTCCACACTC
AACGAGAGCA AGATCCTCGA AGCCACCGGC CAGGTCGCCC TGCGGCGTGG TAACGAATTT
TTCAAGGCCG ACTACGCCCG CTACTACTCC ACGACCAACT GGGTCTACCT CAAGGGCAAC
GTCGAGGTGT TCACCGGCAC CGAGACCATA CGTGCCGAAG AAGCCGAATT CGACCTGCGC
AGCCGCACGG GATGGATGCA GAAGGGCGAG ATATTCATGG AAGGCCCGCA CATCTATTTC
ACGGGTGAGC GGGTCACCAA GAAATGGGGC GACTACTACA CCTTTGAAAA GGCCAAGGTG
ACCTCGTGCC CTCCCGACGG TGAGGCGTGG TCCATGAATG CCGAGCAGGC CGTGGTCGAG
ATCGACGGCT ACGCCCAACT CTTCGGTGCC ACATTCGATG TGGCCGACAC CAGTCTTGCC
TACTCGCCCT ACATGATACT GCCCGCCAAG CGTACCCGCC AGACGGGTCT TCTCATGCCG
GAGTACGGTA TGAGTACCCG CCGGGGCGTG TACTACAACC AGCCGTTCTA CTGGGCAGTC
GACGACCGGC GAGACGTCAC CATCAACGAG TACTGGATGG AGAAACGCGG CTTCATGCAC
GGCGTGGAGT ACCGCTCGCG CGAGGCCTCG GACACCGCCA TGTGGATGCG CTTCGACTGG
CTTGACGACC TCACGACCGT CAAGAACGAC GCCGACGACC CCATCGCCAA GGACGGCCTG
GTACGCACCA ACAGCGAGCG ATTCTGGCTG CGCGGCATGA CCGAAGGCAG ACTCGGCGAC
CCCGACTGGC GATACAAGCT CGACCTCGAC TACGTGTCGG ACCAGAACTT CCTGCGTGAG
TTCAACAGCG GCATGTCAGG CTTCGGGAAA TCCCGTTCAC AACTCTTCGA ACTGTTCGGC
CGCGACATCC GTGAACTTGA CCGCAACCGC ATCTCGCAGG GCATGGTCTA CCGTGAATGG
GACAGGGCCA CAGTGGCCCT GAGCGCCCGC TACGAGCAGG ACCCTTCGCT CGGGCACGGC
AACAAGAACT ACAGCGCCGA CACCACCGTC CAGCGCCTGC CCCAACTTGA CGTGTTCCTG
CACAAGGGCA GGGCCTTCGA GGAAATGCCC CTCGAAATCG AGGCCGAAGC CCAGACGGTG
TATTTCCACC GCCGCAACGG CACGCAGGGC GGACGTACCG AAATAGCGCC ACGTGTGACG
CTGCCCATGA ACAGCCGTTA CGGTTCCGTC ATCGCCAGCA TGGGCTGGCG ACAGACCATG
TATAATACGG AACGAGCGAA GGGACTCGAT GGCGAACCGG CACCCACCGG TGACTACCGC
TCACTGCCTG ACTACAACGT CGCGGCGTTC ACCGAAGTGG GCCGCGTGTG GGACCTTGAA
GACGGCGCCC TCGACCCCGC CACAGAAGAG GTCGGCACAC GCCACTGGAA GTCCGTCTAC
CACCGGGTGC AACCCCGTAT CGAGTACCGC AATATCGCCA ACGTCGATCA GGACCGTAAC
CCCTACTACG ACGACAGCGA CAGAATCGGC CCCAGAAGCG AACTCGTCTA CTCCGTCACC
AATATCCTCA CCCGCAAGCG TGGCACGGTG GTGCCTGCAC GCGACCCTGA CAGCGAAGAG
GAGTACCAGA AGGTCGCCTA CGACTATCTC GACGTGATAC GTTGGCGTCT TGAAAGCGGT
TACGACCTGC GCGAGATGGA CCGCAACGAC GACCTCGAAG ACTATCCGCG CCGTCCTTTC
ATGGACATGA TCTCCGACCT TCGGTTCGGC GTGTTCGACT GGCTGAGCAT CTACACGCGC
AGCGACTGGT CGCCCTATCT TGGAGACTTC ACGCGGCACG ACCAGGGCGT GACCTTCACA
TCACAGCGGT GGGGCTATAT CAGCACGGGG TACAGCTACA GACCGAAGCT GGACGAATAC
AACCGGAAAC GCGCCTATAC AGGCAACCTC GCCACGCTCA CCGCCGCCGT CAACATGTGG
GGGCCATGGT CGGCCGCCGG GCACATGTAC TATGACTTCG TGCGTCAGGA AGGCTACGAA
CGCGCTTTCG ACCTCATCTA CAGCAACCCC TGCTACAAGT TCATCCTGCG CTACACCTAC
GACGTCTATG ACGAAGGGAT AGAATTCCTC GTGGAGTTGC CCGGACTCAC GAACTGA
 
Protein sequence
MTLGRKTRAV AAALMAFVCC CVAEATTPAP VLASASVLEM RTDDAETVTW HLTADNLSTL 
NESKILEATG QVALRRGNEF FKADYARYYS TTNWVYLKGN VEVFTGTETI RAEEAEFDLR
SRTGWMQKGE IFMEGPHIYF TGERVTKKWG DYYTFEKAKV TSCPPDGEAW SMNAEQAVVE
IDGYAQLFGA TFDVADTSLA YSPYMILPAK RTRQTGLLMP EYGMSTRRGV YYNQPFYWAV
DDRRDVTINE YWMEKRGFMH GVEYRSREAS DTAMWMRFDW LDDLTTVKND ADDPIAKDGL
VRTNSERFWL RGMTEGRLGD PDWRYKLDLD YVSDQNFLRE FNSGMSGFGK SRSQLFELFG
RDIRELDRNR ISQGMVYREW DRATVALSAR YEQDPSLGHG NKNYSADTTV QRLPQLDVFL
HKGRAFEEMP LEIEAEAQTV YFHRRNGTQG GRTEIAPRVT LPMNSRYGSV IASMGWRQTM
YNTERAKGLD GEPAPTGDYR SLPDYNVAAF TEVGRVWDLE DGALDPATEE VGTRHWKSVY
HRVQPRIEYR NIANVDQDRN PYYDDSDRIG PRSELVYSVT NILTRKRGTV VPARDPDSEE
EYQKVAYDYL DVIRWRLESG YDLREMDRND DLEDYPRRPF MDMISDLRFG VFDWLSIYTR
SDWSPYLGDF TRHDQGVTFT SQRWGYISTG YSYRPKLDEY NRKRAYTGNL ATLTAAVNMW
GPWSAAGHMY YDFVRQEGYE RAFDLIYSNP CYKFILRYTY DVYDEGIEFL VELPGLTN