Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_1549 |
Symbol | |
ID | 4662551 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | + |
Start bp | 1839178 |
End bp | 1840248 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639819782 |
Product | sigma-70 region 2 domain-containing protein |
Protein accession | YP_966993 |
Protein GI | 120602593 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02392] alternative sigma factor RpoH [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000354786 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAAAC GCCCGAAAGC GCAACACGAA GACATCGCCG ATGAGAAGCA CGTCACCGTA GTCGACGACG TCGAGATCAT CGACGGCCCG GACGATTCGG CTGACGACAC CGAAGATACC GAAGATTTCG TCGACGAAGA CGATGTCATA GACATCGGCG ACGACGACCA CGCGCCCGAC ACGTTCCACC TCAACGCTCC TGCCACGGTA TCCACCGGGA AGGACAGCCT GCACCTCTAC CTGCGCGAGA TAAGCCGCTT TCCCATGCTC AAGCCCGAAG AGGAGTATGA GCTGGCGAAG CGTGTCCGCG AAACGGGCGA CGGTGATGCG GCCTTTCGCC TCGTCTCGTC GCACCTGCGT CTCGTGGTGA AGATCGCCAT GGACTTCCAG CGGCGCTGGA TGCAGAACGT GCTCGACCTC ATCCAGGAGG GCAACGTCGG CCTCATGCGC GCGGTGAACA AGTTCGATCC CGAAAAGGGC ATCAAGTTCT CGTATTACGC CGCCTTCTGG ATCAAGGCCT ACATCCTCAA GTTCATCATG GACAACTGGC GGATGGTCAA GATCGGCACC ACGCAGGCGC AACGCAAGCT GTTCTACAAT CTCAACAAGG AACGACAGAA GCTCATCCTG CAGGGCTACG ACCCGGACGC AGCCACCCTG TCGGAACGCC TGAACGTGAC CAAGGAACAG GTCGTGGAGA TGGAACAGCG CCTCGACGCT TCCGACGTGT CACTCGACAT CCCGGTGGGT GACGAGGGCG GCGGGGCTTC GCGCATGGAC TTCCTGCCCG CACTCGGCCC CGGCATCGAG GACGCACTGT CGAACCATGA GATTGCCAGC ATGGTGCAAA ACCGTCTGCA ATCCATCATT CCCAAGCTTT CCGACAAGGA AGTGGACATC CTGCAGAACA GGCTTCTTTC TGAAGAACCA GTCACCTTGC GCGAGATTGG CGAGAAATAC GACATCACCC GTGAACGCGT CCGCCAGATA GAGGCGCGTC TGCTGCAAAA GATACGCGAC CACCTGTTCA AGGAAATCAA GGACTTTTCA TCCGACTGGA TCAACCAGTA G
|
Protein sequence | MTKRPKAQHE DIADEKHVTV VDDVEIIDGP DDSADDTEDT EDFVDEDDVI DIGDDDHAPD TFHLNAPATV STGKDSLHLY LREISRFPML KPEEEYELAK RVRETGDGDA AFRLVSSHLR LVVKIAMDFQ RRWMQNVLDL IQEGNVGLMR AVNKFDPEKG IKFSYYAAFW IKAYILKFIM DNWRMVKIGT TQAQRKLFYN LNKERQKLIL QGYDPDAATL SERLNVTKEQ VVEMEQRLDA SDVSLDIPVG DEGGGASRMD FLPALGPGIE DALSNHEIAS MVQNRLQSII PKLSDKEVDI LQNRLLSEEP VTLREIGEKY DITRERVRQI EARLLQKIRD HLFKEIKDFS SDWINQ
|
| |