Gene Dvul_1151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1151 
Symbol 
ID4662787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1404581 
End bp1406509 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content63% 
IMG OID639819381 
ProductHEAT repeat-containing PBS lyase 
Protein accessionYP_966598 
Protein GI120602198 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.181025 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGAAG AACAGCATAT CCTAGAGGTC TTGCAGAGCG ACGACGCCGA GCAGGTCCGC 
GAGGCGGCCT ACTCGGCAGG AGAACTGCGC CTGGAGGCAG CCGTCCCCCA TCTTGTGCGC
CATTTGCAAA GTCAGAACAT CGGGGTGCAA GAAGCTGTGG ACAACGCCCT TCGCAAGATC
GGCGGCGCGG CTGCCGTGGC GGGGGTCATA CCCCTTCTGC GCTCGGACGA TGCGCCCATC
CGCAACATAT CCATGGATAT CCTGCGTGAC ATCGGGCGTG ACGAATTCGA TGCGCTCAAG
CAATTGCTTC ATGACGAAGA CCCGGACATC CGCATCTTCG CCTCTGACAT CCTCGGCACC
TCCGGCAGTA TCCTCGCCGT GCCCGCACTG TGCGAGGCGC TCCTGCGCGA CCCCGAGGTG
AACGTACGCT ATCAGGCGGC GGTGAGCCTT GGTACGCTGG CCTTCCCCGA GGCGGCCGAC
TGCCTCAACA AGGCCATGCA GGACGAGGAA TGGGTGCAGT TCTCCGTCAT CGAGGCCCTC
ATCAAGATTC GCGCAGAATC TTCGGTCAAC GCCCTCGTGA AGGCTCTCGA TTCCACCTCC
GACCTCGTGG CGTCCATGAT CGTCGACGCG CTCGGCGAGA TGGGCAACCT CAAGGCCGTG
CCGCTTCTGC TCAAGCGTAT CGAGAAGTCA CCGACGCCGC TGCGGAACAA GATCGTGCGG
GCCATCGTCT CCATCCTCGG CAAGAAGTCG CTCTCGCTTC TGGGCGAAAG GGAACAGCAG
CGTCTTGGCG CCTACCTGCT TGCCGCACTT GATGACGAAG ACGAGGAAGT GCAGGACGCC
GCCATGGTCG GGCTTGCCAG CCTCGGGCAG GCAGAGGCGA CCGTGGCCGT GCTGCGGCTT
GCGGCACGAC TTGACCCCGA CCGTGACCAT GAGCGCCTCG AAGCCGCCAT TCGCTGCATC
GGTGGCATCG GGTTCAACGA AGCGGTGGAA GAGGCCCTCA GCGACGAGAA CGAATCCACC
ATCCTCATGG TGGTTGAAGC GGCCGCGGAT ATGAATCCCG CCGAGATCGT CCCGGCCCTG
AAACGGATAT TCTGGGACAA GGGGCGCGAC GCGCAGCGGG CCATCGCCAC GCAACTGGCG
CGCATCGCAA GCCTTGAGGA CGTGGACTTC TTCCTCGACC TGCTCGAACG GGCCGAAGAC
GCCCATGTCA TCAAGGCTGC ACTGCACTTC CTCGGACACC GCGCGAAGCC AGCGGGCGTG
GGCGAACGCA TGCTCGCCCT GCTCGACCAT CCCTACGACG ATGTCAAAGA GGCCGCGCTC
GAAGCCTGCA TCGCGCTACA GGATGTCTCA CTCTGCGCGA GCTTCAGGGA GCGTTTCCAT
TCCGGAGACC CGTTGCAGCG CATGATGGCG ACCTATGCCA TGGGGCGGCT TGACGTCGAC
GGCAACCTCG ACGTGCTGCG TGAGGCGCTG GTGGATGATG TGCCTGACGT GCGCAAGGTG
GCGCTAGAGG CTCTTGCCGG AACGTGTCCC ATCACGCCCG AGCATCTGGA ACTCGTCGTG
CCGCTCATGC ACGACCCCGT ACGTGAAGTG CGGCTCGCCC TCGTCGACCT TTTCGGGGCG
TGCCCTGAGG AATCGGTCAT CGGCCACCTG CTGGAGGCGC TGGATGACGA GGATGACTGG
GTCCGGGTGC GTGCCATCGA AGCCCTCGGT CGTCTGGGGC GGAAGGAATG CGTGACGCCG
CTCATCGAAC GGGTGGATGG CGCAGGCAAC CTCGTGTTGC TCAAGATCAT CGAAGCCCTT
GGTTCCATTG GCGGCAACAT GGCGTTCCGG GCACTTCTCG CGCTCATGGA GCATGAAGAC
CCCGAGATTC AGCAGGCTGC CGAAGATGCT GTCGCTCGTC TGGGCGAACA AGACAGCGAG
GGTGAGTGA
 
Protein sequence
MSEEQHILEV LQSDDAEQVR EAAYSAGELR LEAAVPHLVR HLQSQNIGVQ EAVDNALRKI 
GGAAAVAGVI PLLRSDDAPI RNISMDILRD IGRDEFDALK QLLHDEDPDI RIFASDILGT
SGSILAVPAL CEALLRDPEV NVRYQAAVSL GTLAFPEAAD CLNKAMQDEE WVQFSVIEAL
IKIRAESSVN ALVKALDSTS DLVASMIVDA LGEMGNLKAV PLLLKRIEKS PTPLRNKIVR
AIVSILGKKS LSLLGEREQQ RLGAYLLAAL DDEDEEVQDA AMVGLASLGQ AEATVAVLRL
AARLDPDRDH ERLEAAIRCI GGIGFNEAVE EALSDENEST ILMVVEAAAD MNPAEIVPAL
KRIFWDKGRD AQRAIATQLA RIASLEDVDF FLDLLERAED AHVIKAALHF LGHRAKPAGV
GERMLALLDH PYDDVKEAAL EACIALQDVS LCASFRERFH SGDPLQRMMA TYAMGRLDVD
GNLDVLREAL VDDVPDVRKV ALEALAGTCP ITPEHLELVV PLMHDPVREV RLALVDLFGA
CPEESVIGHL LEALDDEDDW VRVRAIEALG RLGRKECVTP LIERVDGAGN LVLLKIIEAL
GSIGGNMAFR ALLALMEHED PEIQQAAEDA VARLGEQDSE GE