Gene HS_1235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1235 
SymbolrpoD 
ID4240746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1414343 
End bp1416130 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content38% 
IMG OID638104808 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_719447 
Protein GI113461378 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATGATC ATCTTCCTGA AGATCTTGTT GATGCTGATC AAATTGAAGA TGTGATTCAA 
ATGATCAATG ATATGGGTAT TCAAGTACTT GAAAGTGTGC CTGATGCTGA TGACTTGATG
CTCAATGAAA ATATTACGGA TGAGGATGCT GTTGAGGAAG CGACTCAAGT ATTATCAAGT
GTAGAGGCAG AGTTAGGAAG AACAACAGAT CCTGTGCGTA TGTATATGCG TGAAATGGGA
AGTGTTGAAC TTCTTACGCG TGAAGGCGAA ATCGATATTG CAAAACGCAT CGAAGAAGGG
ATTAATGAAG TACAAAGTGC CGTAGCCGCT TATCCTAAAG CACTCAATTA TCTTCTTGAA
CAGTATACTT TAGTTGAAGA AGGCGGTATG CGGTTGGCTG ATCTCATTAC CGGTTTTGTT
GATCCAAATG CGGTTGTTGA AGAAGAGATA ATTGAAACCG ACGACATACT TACAGAAGAT
GATGAAGAAC ATGTAGAAAG CAATGTTGAT GTAAACGATG AAGAAGAGGA AGAAGATAGC
GAAGATAATA GCAATGTAGA CTCTGATTCG GATAACACAA TTGCCCCAGA AGTTGCTCGT
GAGAAATTTG AAGCACTAAG AATACAACAC CAAAAAACCT TAGCCGCAAT AGAAAAATAT
GGTCGTTCCC GTAAACAAGC AAAAGATCAT ATTCAAGCAT TAGCCGATAT TTTTACTCAG
TTCCGTTTAG TGCCTAAACA GTTTGATGCG TTGGTCAATT ATATGCGTGA TATGATGAAA
TCTGTTCGTC AGCATGAACG ACAAATTCAA AAGCTGGTAG TTGATTTGGC TAAAATGCCA
AAAGAAAATT TCCAAAAATT ATTTATTGGC AATGAAAGTT CAGAGGGTTG GCTGGATAAA
TTACTGCTTG CAAGAAAACC TTGGTCTGAA AGACTAATAC AACATGAAAG TGCGGTTCGT
CAAAGCATAA ACCAATTATT AAAAATTGAA CAGGAAACCA ATTTAACTAT TTCTCAAATT
CGTGAAATTT GTGACAAAGT GGCACAAGGA GAATTAAAAG CTCGCCGTGC AAAAAAAGAA
ATGGTTGAAG CAAATTTACG TTTGGTTATT TCAATTGCGA AGAAATACAC TAACCGTGGA
TTGCAATTCC TTGATCTCAT TCAAGAAGGG AATATTGGTT TAATGAAAGC GGTAGATAAA
TTTGAATACC GGCGGGGTTA TAAATTTTCA ACCTATGCAA CTTGGTGGAT ACGTCAAGCG
ATTACACGCT CTATTGCAGA TCAAGCAAGA ACGATCCGCA TTCCAGTTCA TATGATTGAA
ACTATCAATA AATTAAACCG CATTTCTCGT CAAATGTTAC AAAAAATGGG GCGTGAAGCC
ACATCTGAAG AATTAGCTGA ACGAATGGGA ATGCCTGAGG ATAAAATTCG TAAAGTATTA
AAAATCGCAA AAGAGCCTAT CTCTATGGAA ACGCCTATCG GTGATGACGA TGATTCCCAT
TTAGGTGATT TTATTGAAGA CAATACGTTA GAGCTACCGT TAGACTCTGC GACAGCCCAA
AGTTTGCGTG CTGCAACGCA TGAAGTGTTG GAAGGACTAA CCCCAAGAGA AGCAAAAGTC
TTGCGTATGC GTTTTGGTAT TGATATGAAT ACAGATCATA CCTTGGAAGA AGTCGGGAAA
CAATTTGATG TAACTCGTGA ACGTATTCGT CAAATTGAAG CTAAAGCATT GCGTAAATTG
CGTCATCCAA GTCGTTCTGA AACCCTACGC AGTTTCTTAG ATGAGTAA
 
Protein sequence
MNDHLPEDLV DADQIEDVIQ MINDMGIQVL ESVPDADDLM LNENITDEDA VEEATQVLSS 
VEAELGRTTD PVRMYMREMG SVELLTREGE IDIAKRIEEG INEVQSAVAA YPKALNYLLE
QYTLVEEGGM RLADLITGFV DPNAVVEEEI IETDDILTED DEEHVESNVD VNDEEEEEDS
EDNSNVDSDS DNTIAPEVAR EKFEALRIQH QKTLAAIEKY GRSRKQAKDH IQALADIFTQ
FRLVPKQFDA LVNYMRDMMK SVRQHERQIQ KLVVDLAKMP KENFQKLFIG NESSEGWLDK
LLLARKPWSE RLIQHESAVR QSINQLLKIE QETNLTISQI REICDKVAQG ELKARRAKKE
MVEANLRLVI SIAKKYTNRG LQFLDLIQEG NIGLMKAVDK FEYRRGYKFS TYATWWIRQA
ITRSIADQAR TIRIPVHMIE TINKLNRISR QMLQKMGREA TSEELAERMG MPEDKIRKVL
KIAKEPISME TPIGDDDDSH LGDFIEDNTL ELPLDSATAQ SLRAATHEVL EGLTPREAKV
LRMRFGIDMN TDHTLEEVGK QFDVTRERIR QIEAKALRKL RHPSRSETLR SFLDE