Gene Hore_12400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_12400 
Symbol 
ID7313561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1333523 
End bp1334599 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content40% 
IMG OID643611680 
ProductRNA polymerase, sigma 70 subunit, RpoD family 
Protein accessionYP_002508985 
Protein GI220932077 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.0607 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAAAA ATTTCAATCC AGACAAAGTT AAAGAAGTTA AGGAATTAAT TAAAAAAGGT 
AAAGAAGAAG GCTATTTGAC CTATGAAGAA ATTATGGATT CTCTGGAAGA GATAGAACTC
TCCTCTGAAG ATATTGAGAA GATATATGAA CTCTTTAACG AGATGAATAT AGATGTAGTT
GATGATGTGG ATGAAGTTGA TGATGATAAA GGAGATGATG ACCTGGAGTT ATCTATACCG
GAAGGGGTTG GAATTGATGA CCCAGTAAGG ATGTATTTAA AAGAAATTGG AAAAGTACCA
CTTCTTACTG CTGAAGAAGA GGTTGATCTG GCAAAAAGAA TTGAACAGGG CGACGAACAG
GCTAAAAGGG AATTGGTTGA AGCTAATCTA AGACTGGTTG TTAGTATTGC TAAAAAGTAT
GTGGGAAGAG GTATGTCTTT CCTTGATTTG ATTCAGGAAG GAAATATGGG TCTTATTAAG
GCTGTTGAAA AATTTGATTA TCGTAAAGGA TATAAATTTA GCACTTATGC TACCTGGTGG
ATTCGCCAGG CTATAACCCG TGCTATTGCT GACCAGGCCC GTACTATACG TATCCCGGTG
CATATGGTAG AAACAATTAA TAAATTGATC AGGGTATCAA GACAATTACT CCAGGAAAAG
GGGCGTGAGC CTACTCCTGA GGAGATTGGT GAAGAAATGG GAATGCCGGC CGAAAAAGTC
CGGGAAATTT TAAAGATTGC CCAGGAACCG GTCTCCCTGG AAACACCTAT TGGTGAAGAA
GAGGATAGTC ATCTTGGTGA TTTTATTGAG GATGAAGATG CCCCAGCACC TGCCTCAGCT
GCTTCATTTA CTCTTTTAAG GGAACAGCTC GATGATGTGC TGGATACACT AACAGATAGA
GAAAAAAGGG TTCTTGAACT ACGTTTTGGT CTGGAGGATG GCCGTCCCCG GACTCTAGAG
GAAGTTGGAA AAGAATTTGG GGTTACCAGA GAAAGAATCA GGCAGATTGA GGCCAAGGCT
TTAAGGAAAC TCCGGCATCC AAGCCGTAGT AAAAAACTCA AAGATTACCT TGAGTAA
 
Protein sequence
MGKNFNPDKV KEVKELIKKG KEEGYLTYEE IMDSLEEIEL SSEDIEKIYE LFNEMNIDVV 
DDVDEVDDDK GDDDLELSIP EGVGIDDPVR MYLKEIGKVP LLTAEEEVDL AKRIEQGDEQ
AKRELVEANL RLVVSIAKKY VGRGMSFLDL IQEGNMGLIK AVEKFDYRKG YKFSTYATWW
IRQAITRAIA DQARTIRIPV HMVETINKLI RVSRQLLQEK GREPTPEEIG EEMGMPAEKV
REILKIAQEP VSLETPIGEE EDSHLGDFIE DEDAPAPASA ASFTLLREQL DDVLDTLTDR
EKRVLELRFG LEDGRPRTLE EVGKEFGVTR ERIRQIEAKA LRKLRHPSRS KKLKDYLE