Gene SeD_A3567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3567 
SymbolrpoD 
ID6871473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3423448 
End bp3425295 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content53% 
IMG OID642786555 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_002217192 
Protein GI198242489 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.942992 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value0.234837 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCAAA ACCCGCAGTC ACAGCTGAAA CTTCTTGTCA CCCGTGGTAA GGAGCAAGGC 
TATCTGACCT ATGCTGAGGT CAATGACCAT CTGCCGGAAG ATATCGTCGA TTCAGATCAA
ATTGAAGATA TCATCCAAAT GATCAACGAC ATGGGTATTC AGGTAATGGA AGAAGCGCCT
GATGCCGATG ATCTGCTGCT GGCTGAAAAT ACCACCAGCA CCGATGAAGA TGCGGAAGAA
GCTGCTGCAC AAGTTCTGTC CAGTGTTGAG TCTGAAATCG GTCGTACGAC TGACCCGGTA
CGCATGTATA TGCGTGAAAT GGGCACTGTT GAACTGTTGA CCCGGGAAGG CGAAATCGAC
ATCGCTAAAC GTATCGAAGA CGGGATCAAC CAGGTTCAAT GCTCCGTTGC CGAATACCCG
GAAGCCATTA CCTATCTGCT GGAACAGTAC GATCGCGTTG AGGCTGAAGA GGCTCGTTTG
TCCGATCTTA TCACCGGCTT TGTCGATCCG AACGCGGAAG AAGAGATGGC GCCGACCGCA
ACTCACGTCG GTTCTGAACT CTCCCAGGAA GACCTGGATG ATGACGAAGA CGAAGATGAA
GAAGACGGCG ACGATGACGC CGCCGATGAC GACAACAGCA TTGACCCTGA ACTGGCACGC
GAAAAATTCG CTGAGCTGCG CGCGCAATAC GTCGTTACGC GCGACACCAT CAAAGCGAAA
GGCCGCAGCC ATGCTGCCGC GCAGGAAGAG ATTCTGAAGC TGTCTGAAGT GTTCAAACAG
TTCCGTCTGG TACCGAAGCA ATTCGACTAT CTGGTCAACA GTATGCGCGT GATGATGGAT
CGCGTGCGTA CCCAGGAACG TCTGATCATG AAGCTCTGCG TCGAGCAGTG CAAAATGCCG
AAGAAGAACT TTATCACGCT GTTTACCGGT AACGAAACCA GCGAAACCTG GTTCAACGCC
GCTATCGCGA TGAACAAACC GTGGTCGGAA AAACTGCACG ATGTCGCGGA AGAAGTGCAA
CGCTGTCTGC AAAAACTGCG GCAGATTGAA GAAGAGACCG GTCTGACCAT CGAACAGGTG
AAAGACATCA ACCGTCGAAT GTCCATCGGG GAAGCGAAAG CCCGCCGTGC GAAGAAAGAG
ATGGTTGAAG CGAACTTACG TCTGGTTATT TCTATCGCTA AGAAATACAC CAACCGTGGC
TTGCAGTTCC TTGATCTGAT TCAGGAAGGC AACATCGGTC TGATGAAAGC GGTAGATAAG
TTTGAATACC GTCGCGGCTA CAAGTTCTCC ACCTATGCAA CCTGGTGGAT CCGTCAGGCG
ATCACCCGTT CTATCGCCGA TCAGGCGCGC ACCATCCGTA TTCCGGTGCA TATGATTGAG
ACCATCAACA AGCTCAACCG TATTTCTCGC CAGATGCTGC AAGAGATGGG CCGCGAGCCA
ACGCCGGAAG AGCTGGCTGA ACGGATGCTG ATGCCGGAAG ATAAAATTCG TAAGGTGCTG
AAGATTGCCA AAGAGCCAAT CTCCATGGAA ACGCCGATCG GCGACGATGA AGATTCGCAT
CTGGGTGATT TCATCGAGGA TACCACCCTC GAGCTGCCGC TGGACTCTGC CACTACCGAG
AGCCTGCGTG CCGCCACTCA CGACGTTTTG GCTGGCCTGA CCGCTCGTGA AGCGAAAGTG
CTGCGTATGC GTTTCGGTAT CGATATGAAC ACCGACCACA CGCTGGAAGA AGTGGGTAAA
CAGTTCGATG TTACCCGCGA ACGTATCCGT CAGATCGAAG CGAAGGCGCT GCGTAAACTG
CGCCACCCGA GCCGTTCTGA AGTGCTGCGC AGCTTCCTCG ACGATTAA
 
Protein sequence
MEQNPQSQLK LLVTRGKEQG YLTYAEVNDH LPEDIVDSDQ IEDIIQMIND MGIQVMEEAP 
DADDLLLAEN TTSTDEDAEE AAAQVLSSVE SEIGRTTDPV RMYMREMGTV ELLTREGEID
IAKRIEDGIN QVQCSVAEYP EAITYLLEQY DRVEAEEARL SDLITGFVDP NAEEEMAPTA
THVGSELSQE DLDDDEDEDE EDGDDDAADD DNSIDPELAR EKFAELRAQY VVTRDTIKAK
GRSHAAAQEE ILKLSEVFKQ FRLVPKQFDY LVNSMRVMMD RVRTQERLIM KLCVEQCKMP
KKNFITLFTG NETSETWFNA AIAMNKPWSE KLHDVAEEVQ RCLQKLRQIE EETGLTIEQV
KDINRRMSIG EAKARRAKKE MVEANLRLVI SIAKKYTNRG LQFLDLIQEG NIGLMKAVDK
FEYRRGYKFS TYATWWIRQA ITRSIADQAR TIRIPVHMIE TINKLNRISR QMLQEMGREP
TPEELAERML MPEDKIRKVL KIAKEPISME TPIGDDEDSH LGDFIEDTTL ELPLDSATTE
SLRAATHDVL AGLTAREAKV LRMRFGIDMN TDHTLEEVGK QFDVTRERIR QIEAKALRKL
RHPSRSEVLR SFLDD