Gene HY04AAS1_1530 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHY04AAS1_1530 
Symbol 
ID6744361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHydrogenobaculum sp. Y04AAS1 
KingdomBacteria 
Replicon accessionNC_011126 
Strand
Start bp1443052 
End bp1444788 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content35% 
IMG OID642751351 
ProductRNA polymerase, sigma 70 subunit, RpoD family 
Protein accessionYP_002122191 
Protein GI195953901 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAAAG CCAAAGGCAA AGAGCATCTT TTAGAAATCG GCAAAGAAAA AGGCTACGTA 
ACATACGACG ACATAAACGA GTATGTCAGT GAGGACTTGG TAGAGTCCGA TGAACTAGAG
TCTTTATTCG ACATTTTGTC TGAAAAGGGT ATACAGATTT TAGAAGATGA AGCTGAAGCC
GCTATACACC ACGAAACCGA TAATGAGATT TTAGAAGAAG ACGACGAGAT GATAGTATCT
GCTAGCATAG TAGAGGCAGA TTTAAAGAAC AACGATTCCG TAAGATACTA TCTAAAAGAG
ATGGGCAAGA TAGCTCTTCT TGGAAGAGCA GATGAAATAG AACATGCAAA ATACATAGAA
ATGGGTAGAA AGCAGCTAAG AAGAGGACTA TTAAGGACAT CTTTTTTAGT GGATATGGTG
CTAAGATATT GGGCTAAAGT CTGCGATGGA AAGATGAGAT CTCAAGAGAT ACTTGATATG
CAAGAAGAAC ACACAGACTA CGACGAGTAC GAGGAAGCAG AACACGTAGA TTCTGTGTTT
ATAGAAAAGG GCATAGAGCT TGCTAAAAAA TATAAAACAG TAATAGAAAA AAGGACATTA
TTTTTAACAT ACAAAGATAA GAAAACAAAA GCAGAATACT TAAAGGCTCA CGCTGAAATG
AATAAAACGT TAAAAAGTAT AAACATCAAG TTTTCAAGAT ACGAGAAAAT AGCTGATGAG
TTTATAAAAC TATGCAAAGA CTACAAGAGC AAATTAAATT CTTTAACTAT ACAAAAAAGA
AGGTTTGAAG CCGTACATCC AAATGTGGAA GAACTTTTAT CTTACTACGA TAAAAACAAA
GAACTATTGA ATAAAATAGA AAAACACATG CCTTTTCATA CTTTTGAGAT AAGCAGGTCA
AACTACTTAG CCCTTCAAAG AGAAATAGAA GACATAGAAA GACGTTTGGG AGTGTTACCG
GAGGAGCTTA ACAAAATTAT AAACATTATA GAGCAAGGTA GAAAACGCGT AAAAGAATCA
AAAGACATGA TGGTAAAATC AAACCTAAGG CTTGTAGTAT CGATAGCTAA AAAATACATA
AACAGAGGCC TTCATTTCTT AGATCTTATA CAAGAAGGCA ATATGGGACT TATGAAGGCT
GTTGATAAAT ACGATTACAA AAAAGGTTTT AAATTTTCTA CATACGCCAC ATGGTGGATA
AAACAAGCTA TTACAAGAGC TATAGCTGAC CAAGCAAAAA CTATACGTAT ACCAGTGCAC
ATGATAGAAA CTATCAACAA AATTTCTAAA GTATCCAAAA AACTATTCCA AGAGTATGGT
AGAGAACCAT CCCCCGAAGA AATAGCAAAA GCTCTAAACA TGTCTCCAGA AAAAGTAAGG
AAAATATTGA AGTCCATTCA AGAGCCTATA TCTCTTGAAA CACCTATAGG AGATGACGAA
GATACCCATC TTAAAGATTT CATAGAAGAC TCAAGCATAT CAAACCCAGA GGAAGCCACA
GCCAGAAGAC TTCTAAGGGA GCAAATAGAA AAGATTATCA ATACACTCTC AGATAAAGAA
AGAGAAGTAA TTATGTATAG ATTTGGATTG GTAGACGGTA TAGAGCACAC GCTGGAACAA
GTAGGGACTA TGTTTAATCT TACCAGAGAG CGAATAAGAC AGATAGAATC AAAAGCTATA
AGAAAGATAA GACATCCAAG TAGAGCAAAG TATTTAAAAG ATTTTGAAAT AATATGA
 
Protein sequence
MGKAKGKEHL LEIGKEKGYV TYDDINEYVS EDLVESDELE SLFDILSEKG IQILEDEAEA 
AIHHETDNEI LEEDDEMIVS ASIVEADLKN NDSVRYYLKE MGKIALLGRA DEIEHAKYIE
MGRKQLRRGL LRTSFLVDMV LRYWAKVCDG KMRSQEILDM QEEHTDYDEY EEAEHVDSVF
IEKGIELAKK YKTVIEKRTL FLTYKDKKTK AEYLKAHAEM NKTLKSINIK FSRYEKIADE
FIKLCKDYKS KLNSLTIQKR RFEAVHPNVE ELLSYYDKNK ELLNKIEKHM PFHTFEISRS
NYLALQREIE DIERRLGVLP EELNKIINII EQGRKRVKES KDMMVKSNLR LVVSIAKKYI
NRGLHFLDLI QEGNMGLMKA VDKYDYKKGF KFSTYATWWI KQAITRAIAD QAKTIRIPVH
MIETINKISK VSKKLFQEYG REPSPEEIAK ALNMSPEKVR KILKSIQEPI SLETPIGDDE
DTHLKDFIED SSISNPEEAT ARRLLREQIE KIINTLSDKE REVIMYRFGL VDGIEHTLEQ
VGTMFNLTRE RIRQIESKAI RKIRHPSRAK YLKDFEII