Gene SNSL254_A3471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3471 
SymbolrpoD 
ID6486865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3365730 
End bp3367577 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content53% 
IMG OID642738758 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_002042478 
Protein GI194445988 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones88 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCAAA ACCCGCAGTC ACAGCTGAAA CTTCTTGTCA CCCGTGGTAA GGAGCAAGGC 
TATCTGACCT ATGCTGAGGT CAATGACCAT CTGCCGGAAG ATATCGTCGA TTCAGATCAA
ATTGAAGATA TCATCCAAAT GATCAACGAC ATGGGTATTC AGGTAATGGA AGAAGCGCCT
GATGCCGATG ATCTGCTGCT GGCTGAAAAT ACCACCAGCA CCGATGAAGA TGCGGAAGAA
GCTGCTGCAC AAGTTCTGTC CAGTGTTGAG TCTGAAATCG GTCGTACGAC TGACCCGGTA
CGCATGTATA TGCGTGAAAT GGGCACTGTT GAACTGTTGA CCCGGGAAGG CGAAATCGAC
ATCGCTAAAC GTATCGAAGA CGGGATCAAC CAGGTTCAAT GCTCCGTTGC CGAATACCCG
GAAGCCATTA CCTATCTGCT GGAACAGTAC GATCGCGTTG AGGCTGAAGA GGCTCGTTTG
TCCGATCTTA TCACCGGCTT TGTCGATCCG AACGCGGAAG AAGAGATGGC GCCGACCGCA
ACTCACGTCG GTTCTGAACT CTCCCAGGAA GACCTGGATG ATGACGAAGA CGAAGATGAA
GAAGACGGCG ACGATGACGC CGCCGATGAC GACAACAGCA TTGACCCTGA ACTGGCACGC
GAAAAATTCG CTGAACTGCG CGCGCAATAC GTCGTTACGC GCGACACCAT CAAAGCGAAA
GGCCGCAGCC ATGCTGCCGC GCAGGAAGAG ATTCTGAAGC TGTCTGAAGT GTTCAAACAG
TTCCGTCTGG TACCGAAGCA ATTCGACTAT CTGGTCAACA GTATGCGCGT GATGATGGAT
CGCGTGCGTA CCCAGGAACG TCTGATCATG AAGCTCTGCG TCGAGCAGTG CAAAATGCCG
AAGAAGAACT TTATCACGCT GTTTACCGGT AATGAAACCA GCGAAACCTG GTTCAATGCC
GCTATCGCGA TGAACAAACC GTGGTCGGAA AAACTGCACG ATGTCGCGGA AGAAGTGCAA
CGCTGCCTGC AAAAACTGCG GCAGATTGAA GAAGAGACCG GTCTGACCAT CGAACAGGTG
AAAGACATCA ACCGTCGCAT GTCCATCGGG GAAGCGAAAG CCCGCCGTGC GAAGAAAGAG
ATGGTTGAAG CGAACTTGCG TCTGGTTATT TCTATCGCTA AGAAATACAC CAACCGTGGC
TTGCAATTCC TTGATCTGAT TCAGGAAGGC AACATCGGTC TGATGAAAGC GGTAGATAAG
TTCGAATACC GTCGCGGCTA CAAATTCTCC ACCTATGCAA CCTGGTGGAT CCGTCAGGCG
ATCACCCGTT CTATCGCCGA TCAGGCGCGC ACCATCCGTA TTCCGGTGCA TATGATTGAG
ACCATCAACA AGCTCAACCG TATTTCTCGC CAGATGCTGC AAGAGATGGG CCGCGAGCCA
ACGCCGGAAG AGCTGGCTGA ACGGATGCTG ATGCCGGAAG ATAAAATTCG TAAGGTGCTG
AAGATTGCCA AAGAGCCAAT CTCCATGGAA ACGCCGATCG GCGACGATGA AGATTCGCAT
CTGGGTGATT TCATCGAGGA TACCACCCTC GAGCTGCCGC TGGACTCTGC CACTACCGAG
AGCCTGCGTG CCGCCACTCA CGACGTTTTG GCTGGCCTGA CCGCTCGTGA AGCGAAAGTG
CTGCGTATGC GTTTCGGTAT CGATATGAAC ACCGACCACA CGCTGGAAGA AGTGGGTAAA
CAGTTCGATG TTACCCGCGA ACGTATCCGT CAGATCGAAG CGAAGGCGCT GCGTAAACTG
CGCCACCCGA GCCGTTCTGA AGTGCTGCGC AGCTTCCTCG ACGATTAA
 
Protein sequence
MEQNPQSQLK LLVTRGKEQG YLTYAEVNDH LPEDIVDSDQ IEDIIQMIND MGIQVMEEAP 
DADDLLLAEN TTSTDEDAEE AAAQVLSSVE SEIGRTTDPV RMYMREMGTV ELLTREGEID
IAKRIEDGIN QVQCSVAEYP EAITYLLEQY DRVEAEEARL SDLITGFVDP NAEEEMAPTA
THVGSELSQE DLDDDEDEDE EDGDDDAADD DNSIDPELAR EKFAELRAQY VVTRDTIKAK
GRSHAAAQEE ILKLSEVFKQ FRLVPKQFDY LVNSMRVMMD RVRTQERLIM KLCVEQCKMP
KKNFITLFTG NETSETWFNA AIAMNKPWSE KLHDVAEEVQ RCLQKLRQIE EETGLTIEQV
KDINRRMSIG EAKARRAKKE MVEANLRLVI SIAKKYTNRG LQFLDLIQEG NIGLMKAVDK
FEYRRGYKFS TYATWWIRQA ITRSIADQAR TIRIPVHMIE TINKLNRISR QMLQEMGREP
TPEELAERML MPEDKIRKVL KIAKEPISME TPIGDDEDSH LGDFIEDTTL ELPLDSATTE
SLRAATHDVL AGLTAREAKV LRMRFGIDMN TDHTLEEVGK QFDVTRERIR QIEAKALRKL
RHPSRSEVLR SFLDD