Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_1151 |
Symbol | |
ID | 4662787 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | + |
Start bp | 1404581 |
End bp | 1406509 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639819381 |
Product | HEAT repeat-containing PBS lyase |
Protein accession | YP_966598 |
Protein GI | 120602198 |
COG category | [C] Energy production and conversion |
COG ID | [COG1413] FOG: HEAT repeat |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.181025 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGAAG AACAGCATAT CCTAGAGGTC TTGCAGAGCG ACGACGCCGA GCAGGTCCGC GAGGCGGCCT ACTCGGCAGG AGAACTGCGC CTGGAGGCAG CCGTCCCCCA TCTTGTGCGC CATTTGCAAA GTCAGAACAT CGGGGTGCAA GAAGCTGTGG ACAACGCCCT TCGCAAGATC GGCGGCGCGG CTGCCGTGGC GGGGGTCATA CCCCTTCTGC GCTCGGACGA TGCGCCCATC CGCAACATAT CCATGGATAT CCTGCGTGAC ATCGGGCGTG ACGAATTCGA TGCGCTCAAG CAATTGCTTC ATGACGAAGA CCCGGACATC CGCATCTTCG CCTCTGACAT CCTCGGCACC TCCGGCAGTA TCCTCGCCGT GCCCGCACTG TGCGAGGCGC TCCTGCGCGA CCCCGAGGTG AACGTACGCT ATCAGGCGGC GGTGAGCCTT GGTACGCTGG CCTTCCCCGA GGCGGCCGAC TGCCTCAACA AGGCCATGCA GGACGAGGAA TGGGTGCAGT TCTCCGTCAT CGAGGCCCTC ATCAAGATTC GCGCAGAATC TTCGGTCAAC GCCCTCGTGA AGGCTCTCGA TTCCACCTCC GACCTCGTGG CGTCCATGAT CGTCGACGCG CTCGGCGAGA TGGGCAACCT CAAGGCCGTG CCGCTTCTGC TCAAGCGTAT CGAGAAGTCA CCGACGCCGC TGCGGAACAA GATCGTGCGG GCCATCGTCT CCATCCTCGG CAAGAAGTCG CTCTCGCTTC TGGGCGAAAG GGAACAGCAG CGTCTTGGCG CCTACCTGCT TGCCGCACTT GATGACGAAG ACGAGGAAGT GCAGGACGCC GCCATGGTCG GGCTTGCCAG CCTCGGGCAG GCAGAGGCGA CCGTGGCCGT GCTGCGGCTT GCGGCACGAC TTGACCCCGA CCGTGACCAT GAGCGCCTCG AAGCCGCCAT TCGCTGCATC GGTGGCATCG GGTTCAACGA AGCGGTGGAA GAGGCCCTCA GCGACGAGAA CGAATCCACC ATCCTCATGG TGGTTGAAGC GGCCGCGGAT ATGAATCCCG CCGAGATCGT CCCGGCCCTG AAACGGATAT TCTGGGACAA GGGGCGCGAC GCGCAGCGGG CCATCGCCAC GCAACTGGCG CGCATCGCAA GCCTTGAGGA CGTGGACTTC TTCCTCGACC TGCTCGAACG GGCCGAAGAC GCCCATGTCA TCAAGGCTGC ACTGCACTTC CTCGGACACC GCGCGAAGCC AGCGGGCGTG GGCGAACGCA TGCTCGCCCT GCTCGACCAT CCCTACGACG ATGTCAAAGA GGCCGCGCTC GAAGCCTGCA TCGCGCTACA GGATGTCTCA CTCTGCGCGA GCTTCAGGGA GCGTTTCCAT TCCGGAGACC CGTTGCAGCG CATGATGGCG ACCTATGCCA TGGGGCGGCT TGACGTCGAC GGCAACCTCG ACGTGCTGCG TGAGGCGCTG GTGGATGATG TGCCTGACGT GCGCAAGGTG GCGCTAGAGG CTCTTGCCGG AACGTGTCCC ATCACGCCCG AGCATCTGGA ACTCGTCGTG CCGCTCATGC ACGACCCCGT ACGTGAAGTG CGGCTCGCCC TCGTCGACCT TTTCGGGGCG TGCCCTGAGG AATCGGTCAT CGGCCACCTG CTGGAGGCGC TGGATGACGA GGATGACTGG GTCCGGGTGC GTGCCATCGA AGCCCTCGGT CGTCTGGGGC GGAAGGAATG CGTGACGCCG CTCATCGAAC GGGTGGATGG CGCAGGCAAC CTCGTGTTGC TCAAGATCAT CGAAGCCCTT GGTTCCATTG GCGGCAACAT GGCGTTCCGG GCACTTCTCG CGCTCATGGA GCATGAAGAC CCCGAGATTC AGCAGGCTGC CGAAGATGCT GTCGCTCGTC TGGGCGAACA AGACAGCGAG GGTGAGTGA
|
Protein sequence | MSEEQHILEV LQSDDAEQVR EAAYSAGELR LEAAVPHLVR HLQSQNIGVQ EAVDNALRKI GGAAAVAGVI PLLRSDDAPI RNISMDILRD IGRDEFDALK QLLHDEDPDI RIFASDILGT SGSILAVPAL CEALLRDPEV NVRYQAAVSL GTLAFPEAAD CLNKAMQDEE WVQFSVIEAL IKIRAESSVN ALVKALDSTS DLVASMIVDA LGEMGNLKAV PLLLKRIEKS PTPLRNKIVR AIVSILGKKS LSLLGEREQQ RLGAYLLAAL DDEDEEVQDA AMVGLASLGQ AEATVAVLRL AARLDPDRDH ERLEAAIRCI GGIGFNEAVE EALSDENEST ILMVVEAAAD MNPAEIVPAL KRIFWDKGRD AQRAIATQLA RIASLEDVDF FLDLLERAED AHVIKAALHF LGHRAKPAGV GERMLALLDH PYDDVKEAAL EACIALQDVS LCASFRERFH SGDPLQRMMA TYAMGRLDVD GNLDVLREAL VDDVPDVRKV ALEALAGTCP ITPEHLELVV PLMHDPVREV RLALVDLFGA CPEESVIGHL LEALDDEDDW VRVRAIEALG RLGRKECVTP LIERVDGAGN LVLLKIIEAL GSIGGNMAFR ALLALMEHED PEIQQAAEDA VARLGEQDSE GE
|
| |