Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C3465 |
Symbol | rpoD |
ID | 6487951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 3368557 |
End bp | 3370404 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642743594 |
Product | RNA polymerase sigma factor RpoD |
Protein accession | YP_002047209 |
Protein GI | 194451693 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.268708 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 0.00491171 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAGCAAA ACCCGCAGTC ACAGCTGAAA CTTCTTGTCA CCCGTGGTAA GGAGCAAGGC TATCTGACCT ATGCTGAGGT CAATGACCAT CTGCCGGAAG ATATCGTCGA TTCAGATCAA ATTGAAGATA TCATCCAAAT GATCAACGAC ATGGGTATTC AGGTAATGGA AGAAGCGCCT GATGCCGATG ATCTGCTGCT GGCTGAAAAT ACCACCAGCA CCGATGAAGA TGCGGAAGAA GCTGCTGCAC AAGTTCTGTC CAGTGTTGAG TCTGAAATCG GTCGTACGAC TGACCCGGTA CGCATGTATA TGCGTGAAAT GGGCACTGTT GAACTGTTGA CCCGGGAAGG CGAAATCGAC ATCGCTAAAC GTATCGAAGA CGGGATCAAC CAGGTTCAAT GCTCCGTTGC CGAATACCCG GAAGCCATTA CCTATCTGCT GGAACAGTAC GATCGCGTTG AGGCTGAAGA GGCTCGTTTG TCCGATCTTA TCACCGGCTT TGTCGATCCG AACGCGGAAG AAGAGATGGC GCCGACCGCA ACTCACGTCG GTTCTGAACT CTCCCAGGAA GACCTGGATG ATGACGAAGA CGAAGATGAA GAAGACGGCG ACGATGACGC CGCCGATGAC GACAACAGCA TTGACCCTGA ACTGGCACGC GAAAAATTCG CTGAGCTGCG CGCGCAATAC GTCGTTACGC GCGACACCAT CAAAGCGAAA GGCCGCAGCC ATGCTGCCGC GCAGGAAGAG ATTCTGAAGC TGTCTGAAGT GTTCAAACAG TTCCGTCTGG TACCGAAGCA ATTCGACTAT CTGGTCAACA GTATGCGCGT GATGATGGAT CGCGTGCGTA CCCAGGAACG TCTGATCATG AAGCTCTGCG TCGAGCAGTG CAAAATGCCG AAGAAGAACT TTATCACGCT GTTTACCGGT AATGAAACCA GCGAAACCTG GTTCAATGCC GCTATCGCGA TGAACAAACC GTGGTCGGAA AAACTGCACG ATGTCGCGGA AGAAGTGCAA CGCTGCCTGC AAAAACTGCG GCAGATTGAA GAAGAGACCG GTCTGACCAT CGAACAGGTG AAAGACATCA ACCGTCGCAT GTCCATCGGG GAAGCGAAAG CCCGCCGTGC GAAGAAAGAG ATGGTTGAAG CGAACTTGCG TCTGGTTATT TCTATCGCTA AGAAATACAC CAACCGTGGC TTGCAATTCC TTGATCTGAT TCAGGAAGGC AACATCGGTC TGATGAAAGC GGTAGATAAG TTCGAATACC GTCGCGGCTA CAAATTCTCC ACCTATGCAA CCTGGTGGAT CCGTCAGGCG ATCACCCGTT CTATCGCCGA TCAGGCGCGC ACCATCCGTA TTCCGGTGCA TATGATTGAG ACCATCAACA AGCTCAACCG TATTTCTCGC CAGATGCTGC AAGAGATGGG CCGCGAGCCA ACGCCGGAAG AGCTGGCTGA ACGGATGCTG ATGCCGGAAG ATAAAATTCG TAAGGTGCTA AAGATTGCCA AAGAGCCAAT CTCCATGGAA ACGCCGATCG GCGACGATGA AGATTCGCAT CTGGGTGATT TCATCGAGGA TACCACCCTC GAGCTGCCGC TGGACTCTGC CACTACCGAG AGCCTGCGTG CCGCCACTCA CGACGTTTTG GCTGGCCTGA CCGCTCGTGA AGCGAAAGTG CTGCGTATGC GTTTCGGTAT CGATATGAAC ACCGACCACA CGCTGGAAGA AGTGGGTAAA CAGTTCGATG TTACCCGCGA ACGTATCCGT CAGATCGAAG CGAAGGCGCT GCGTAAACTG CGCCACCCGA GCCGTTCTGA AGTGCTGCGC AGCTTCCTCG ACGATTAA
|
Protein sequence | MEQNPQSQLK LLVTRGKEQG YLTYAEVNDH LPEDIVDSDQ IEDIIQMIND MGIQVMEEAP DADDLLLAEN TTSTDEDAEE AAAQVLSSVE SEIGRTTDPV RMYMREMGTV ELLTREGEID IAKRIEDGIN QVQCSVAEYP EAITYLLEQY DRVEAEEARL SDLITGFVDP NAEEEMAPTA THVGSELSQE DLDDDEDEDE EDGDDDAADD DNSIDPELAR EKFAELRAQY VVTRDTIKAK GRSHAAAQEE ILKLSEVFKQ FRLVPKQFDY LVNSMRVMMD RVRTQERLIM KLCVEQCKMP KKNFITLFTG NETSETWFNA AIAMNKPWSE KLHDVAEEVQ RCLQKLRQIE EETGLTIEQV KDINRRMSIG EAKARRAKKE MVEANLRLVI SIAKKYTNRG LQFLDLIQEG NIGLMKAVDK FEYRRGYKFS TYATWWIRQA ITRSIADQAR TIRIPVHMIE TINKLNRISR QMLQEMGREP TPEELAERML MPEDKIRKVL KIAKEPISME TPIGDDEDSH LGDFIEDTTL ELPLDSATTE SLRAATHDVL AGLTAREAKV LRMRFGIDMN TDHTLEEVGK QFDVTRERIR QIEAKALRKL RHPSRSEVLR SFLDD
|
| |