Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A0044 |
Symbol | rpoD |
ID | 5136802 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 41202 |
End bp | 43067 |
Gene Length | 1866 bp |
Protein Length | 621 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640531504 |
Product | RNA polymerase sigma factor RpoD |
Protein accession | YP_001216017 |
Protein GI | 147674012 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00416651 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCAAA ATCCGCAGTC ACAGCTTAAA CAACTTGTCC TTCGCGGCAA GGAACAGGGC TATCTGACCT ACGCCGAAGT AAATGACCAC TTGCCTGCTG AAATCGTTGA TTCAGAGCAG GTGGAAGATA TTATTCAGAT GATTAATGAC ATGGGCATTA AGGTGGTGGA AACCGCACCT GATGCCGATG ATCTTGCCCT CAGCGATGAT ACTACCATCA CTGACGAAGA TGCTGCTGAA GCAGCCGCCG CGGCACTTTC TAGCGTAGAG AGTGAGATTG GCCGCACCAC CGATCCAGTA CGTATGTATA TGCGTGAAAT GGGTACGGTT GAGCTTTTGA CACGTGAAGG TGAAATCGAT ATTGCCAAGC GCATTGAAGA TGGTATTAAC CAAGTTCAAA GTGCGATTGC TGAGTATCCT GGAACCATCC CTTATATTCT TGAGCAGTTT GATCGTGTTC AGGCCGAAGA GCTACGTCTC ACTGACCTGA TTTCAGGTTT CGTTGACCCT AACGACATGG AAACCGAAGC GCCAACCGCG ACTCACATCG GTTCTGAGCT TTCTGAAGCG GATCTCGCGG ATGAAGATGA TGCTGTCGTC GAAGATGAAG ACGAAGATGG CGACGGTGAA AGCAGCGACA GCGAAGAAGA AGTCGGTATC GACCCTGAAC TGGCTCGTGA GAAATTCAAT GAACTGCGCG GTAAGTTCCA AAACCTGCAA TTAGCGGTTA ATGAATTTGG TCGTGACAGT CATCAAGCTT CTGAAGCGTC AGACTTAGTG CTGGATATCT TCCGTGAATT CCGCCTAACA CCAAAGCAAT TCGACCACTT GGTTGAAACT CTGCGCACTT CAATGGATCG TGTTCGCACC CAAGAACGTT TGGTAATGAA AGCGGTAGTT GAAGTCGCGA AGATGCCGAA GAAATCGTTC ATCGCCCTAT TTACAGGCAA TGAATCGAAT GAAGAGTGGC TGGATAAAGT CCTTGCTTCT GACAAGCCTT ACGTAGCGAA AGTCCGTGAG CAAGAAGAAG AGATCCGCCG TTCAATTCAG AAACTACAAA TGATCGAGCA AGAGACATCA CTGTCTGTTG AACGCATCAA AGACATCAGC CATCGTATGT CAATCGGTGA GGCGAAAGCT CGCCGTGCGA AGAAAGAGAT GGTTGAAGCA AACTTACGTC TGGTAATTTC GATTGCTAAG AAATACACCA ACCGTGGTCT GCAATTCTTG GATCTGATCC AAGAAGGTAA CATCGGTTTG ATGAAAGCCG TCGATAAGTT TGAATACCGT CGTGGTTATA AATTCTCAAC TTATGCGACT TGGTGGATCC GTCAGGCAAT CACTCGTTCT ATTGCTGACC AAGCACGTAC GATTCGTATT CCAGTACACA TGATCGAGAC GATCAATAAA CTGAATCGAA TTTCGCGCCA AATGCTGCAA GAGATGGGTC GTGAGCCACT GCCTGAAGAA TTGGCAGAAC GCATGCAAAT GCCAGAAGAC AAAATCCGCA AAGTGCTGAA AATTGCCAAA GAACCCATCT CCATGGAAAC ACCGATTGGT GATGATGAAG ATTCGCATCT GGGTGATTTT ATCGAGGATA CCACCCTCGA GCTGCCACTG GATTCTGCGA CCGCAACCAG CCTTAAAGCT GCCACTCGCG ATGTATTAGC AGGCCTAACA CCTCGTGAAG CGAAAGTGCT ACGTATGCGT TTCGGTATCG ATATGAATAC CGACCATACT CTGGAAGAAG TGGGCAAGCA GTTTGATGTA ACTCGTGAAC GTATTCGTCA GATTGAAGCA AAAGCACTGC GTAAACTGCG TCATCCAAGC CGCTCAGAAG TTCTGCGCAG CTTCTTGGAT GAATAA
|
Protein sequence | MDQNPQSQLK QLVLRGKEQG YLTYAEVNDH LPAEIVDSEQ VEDIIQMIND MGIKVVETAP DADDLALSDD TTITDEDAAE AAAAALSSVE SEIGRTTDPV RMYMREMGTV ELLTREGEID IAKRIEDGIN QVQSAIAEYP GTIPYILEQF DRVQAEELRL TDLISGFVDP NDMETEAPTA THIGSELSEA DLADEDDAVV EDEDEDGDGE SSDSEEEVGI DPELAREKFN ELRGKFQNLQ LAVNEFGRDS HQASEASDLV LDIFREFRLT PKQFDHLVET LRTSMDRVRT QERLVMKAVV EVAKMPKKSF IALFTGNESN EEWLDKVLAS DKPYVAKVRE QEEEIRRSIQ KLQMIEQETS LSVERIKDIS HRMSIGEAKA RRAKKEMVEA NLRLVISIAK KYTNRGLQFL DLIQEGNIGL MKAVDKFEYR RGYKFSTYAT WWIRQAITRS IADQARTIRI PVHMIETINK LNRISRQMLQ EMGREPLPEE LAERMQMPED KIRKVLKIAK EPISMETPIG DDEDSHLGDF IEDTTLELPL DSATATSLKA ATRDVLAGLT PREAKVLRMR FGIDMNTDHT LEEVGKQFDV TRERIRQIEA KALRKLRHPS RSEVLRSFLD E
|
| |