Gene HY04AAS1_1081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHY04AAS1_1081 
Symbol 
ID6743896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHydrogenobaculum sp. Y04AAS1 
KingdomBacteria 
Replicon accessionNC_011126 
Strand
Start bp1005773 
End bp1007089 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content38% 
IMG OID642750889 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_002121745 
Protein GI195953455 
COG category[C] Energy production and conversion 
COG ID[COG2048] Heterodisulfide reductase, subunit B 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.87184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTAC TTAGATATGA CAAGACCAAA TTTCCAATTC AAAACAACCA TGCTCATTAC 
GATGAAATAT TTGAGCGCAT GGAAGAATTG GAAGCAAAGG GCGAGATCCT TATACATAGA
ATCACCGAAG AACATAAACC TGTAGAGGTT TATACAAGAA CAGGTCGTAT AAAGACTGTG
CCTACCAACA AACTATGGCA TCATAAATCT TGTGGACAAT GTGGTAATAT ACCCGGTTAT
CCAGCATCTA TATTCTGGTT TATGAACAAG TTTGGATATG ATTATCTAAA TGAACCACAC
CAAACTTCTT GTACCGCATG GAACTATCAC GGTTCTGGTA CGTCAAATCC AGTGGCTTTG
GCAGCTGTAT GGCTAAGAAA CATGCACCAA GCTTGGAAGA CTGGTTATTA TCCTTTAATT
CACTGTGGTA CTTCATTTGG TTCTTACAAG GAAACTAGAG AACAACTCAT AATGAATAAA
GAACTTAGAG ATGCTGTAAG ACCTATATTG AAGAAGTTGG GTAGATTAGG ACCAAATGGT
GAATTGGTGA TACCTCAAGA GGTAGTACAT TATTCAGAAT GGACACATGC AAACAGATAT
AAAATAAAAG AATTATACGA AAAAGAAGGT AAACCAAGAG GTATAGATGT GTCAAATGTA
AGAGTTGCTA TACACAACGC TTGCCACGTT TGGAAAATGA TAGCTGACGA TTACGTATAT
GACCCAGAAA TATATGGAGG CCAAAGACCA GCAGCATCTA CGGCTGTTAT AAAAGAATTG
GGAGCTATAG TTGCTGACTA TTACACATGG TATGATTGCT GTGGTTTTGG ATTTAGGCAT
ATATTGACAG AAAGAGAGTT TACTAGGTCT TTTGCTATAA ATAGAAAGTT GAAGGTAATA
TATGAGGATG CTAAGGCAGA CCTCATCGTA ACTCACGATA CTGGTTGTAC CACAACTTTC
GAAAAGAATC AATGGATAGG CAAAGCTCAT GATATGTACT ATCCGGTAGC TGTTATGTCA
GATGTTATGT TCTCAGCTTT AGCCTGCGGT GCACATCCGT ACAAGATAGT CCAGCTGTAC
TGGAACTGCT CAAGTTATGA ACCTCTTTTG GAAAAAATGG GTATAACCAA CTGGAAAGAG
CTAAAAAAAG AGTGGGAAGA CACCGTAAAA TATATAAATG AGCTTGATAA AGCAGGCAAA
CACGACGAAC TTCAAGAATT CTTTAAAACC TATGACTTGT ATGAACCATA CAGCAGAACA
TCCGACGGCA AACCAAGAGC AAGTGCAACG GCTGATAAGG TATTGTTTAG ATCTTAA
 
Protein sequence
MSLLRYDKTK FPIQNNHAHY DEIFERMEEL EAKGEILIHR ITEEHKPVEV YTRTGRIKTV 
PTNKLWHHKS CGQCGNIPGY PASIFWFMNK FGYDYLNEPH QTSCTAWNYH GSGTSNPVAL
AAVWLRNMHQ AWKTGYYPLI HCGTSFGSYK ETREQLIMNK ELRDAVRPIL KKLGRLGPNG
ELVIPQEVVH YSEWTHANRY KIKELYEKEG KPRGIDVSNV RVAIHNACHV WKMIADDYVY
DPEIYGGQRP AASTAVIKEL GAIVADYYTW YDCCGFGFRH ILTEREFTRS FAINRKLKVI
YEDAKADLIV THDTGCTTTF EKNQWIGKAH DMYYPVAVMS DVMFSALACG AHPYKIVQLY
WNCSSYEPLL EKMGITNWKE LKKEWEDTVK YINELDKAGK HDELQEFFKT YDLYEPYSRT
SDGKPRASAT ADKVLFRS