Gene HY04AAS1_0307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHY04AAS1_0307 
Symbol 
ID6743101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHydrogenobaculum sp. Y04AAS1 
KingdomBacteria 
Replicon accessionNC_011126 
Strand
Start bp265542 
End bp268358 
Gene Length2817 bp 
Protein Length938 aa 
Translation table11 
GC content38% 
IMG OID642750100 
Productmolybdopterin oxidoreductase 
Protein accessionYP_002120975 
Protein GI195952685 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing
[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000512628 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACATTA CAAGGCGTGG ATTTTTAAAA GGAATGGCTG CAACTGGTGC TGCTACACTG 
GTTGGTAAAA AGTTACTGAC GCCAGAACAG GCAAAGGCAA TCCAAACAAA AAACATCCAA
TACGCTTATA AACCAAACAT ATGTAACTTT TGCGCAAATG CCTGTGGTAT ACAAGTAAGG
GTCGCTTCTA TCTCCGGTAA ACCAAATAGA CTTATGAAGA TAGAGGGTAA TATACATCAT
CCTTACAATA GAGGTGTTGC CTGCGCTAGG GGTCAAAGTG GTATATCTTA CATATATGAT
AAAGACCGCA TTAAAAAACC CCTTATAAGA ATTGAAGGCT CTAAAAAAGG CGAGTGGAAA
TTTAAAGAAG CTTCTTGGCA AGAAGCTTTT AATTATATGA TGAACAAGTT AAAAAATATA
AAACCTTATG AAATGGCTCT TATGGGAGGA TGGCAAGGAT GTGCTTTTTA TGGTACTTAT
TTGCTTCCTT TTGTGGTAGC TGCTCAAATC CCAACGCTTT ATGGCTCTCC GATACAGCAT
TGTGTTGGTT CTGAGCATTT AGGTCTTCAT ACTGTGTTTG GCAATTATAA CACCCACGAT
GAGGTGGTGT GCGATTACGA TAGAGCAAGA TATATATTGG CAGTAAGAAG CAACGGTTCT
TTGGCCGGTA TTTCCACTGG AAGAGCTCAT AGGTTTGGAG CAGGCATTAA AAACGGTGCT
AAAGTGGTGG TATTAGACCC AAGAGCATCG GAGCTTGCTG CCAAAGCAGA TGAATGGATT
CCCATAAGAC CAGGCACCGA CAACGCCTTT GCTTTGGCTA TGCTTCATGT GATCTTAAGA
GAGCAAAAAG ATGGAAAAGT ACTTTACGAT GAAGAAACCC TTAGATTTTT TACAAACGCT
CCTTTTTTGG CTTATAAGGA TGAAAAAGGA AATCTGCAAC TATTGTCTGA TGTAGATAAA
GACGGGGCTG TGGAGGCTTG GTACGTTTAC GATGAACTCT CAAACTCTGT CCAGAAAGTA
TTTGGCTTTT ATAATACAAA TAAAATATCT AAAGACAACA AAGTGTTAAA ACCAGCTCTA
TTTACAAAAA ATCTCAACGT AAACGGCAAA AATGTAAAAA CTGTATTTGA GTATCTTATG
GATTATACGC AAAACTTTAC ACCAGAATGG GCTTCTAAGA TAACCGATGT ACCAGCATCC
ACTATAAAAA GGGTTGCCGT GGAGTTTGCC ACTATGAAAC CAGCCATTGT AGAGCCAGGG
ATTTACGACT CTAGATATGA AAACACTATC CAGCTAAGGA AAACCTTGGC TATCATACAG
GCTATAACAA GTGGATACGA TAAACCGGGT ACTTGGGTTA CAGGCGGAAG TTATAAGATG
CTTATAAAAG ACTTTTTTGA GTTTACCAAG AAAAATGGAA ATAAGATCAC CATTCCCGTA
AAAGGATATC CAGATGTAGA CATACCTGGC ATGCTTAGGG TATTGGACGT TGGCTTTAAA
TATTTTTTTA ATCCAAAAGC TTGGGCACAC GAATATCCAT CCGTACAATG GGCTTATTTG
CAAACCGAGC TTTCCAAAGG CAAAGAAGCA ACGGTATTTC CTTTTATTAC AGATAACGGT
TTTTACGAAT CTACCAAAAA AGAGGTGTTT TGGAAAGGCC AGCCTTATCA GCTAAAAGCA
GCTTTCGTAT ACGCCCTTAA CTTAGTAAGA GGTGACGTAG AAACTCAAAG ATGGAAAGAA
TTTTTAACAA ATCTTGATTT GGTGGTGGGT TTTGATACGA TGCCTTCAGA TACCATGCTC
TATGCAGATG TAATATTCCC AGATATTCCT TATATACTAA AAAAAGATGT TATTTTTGAT
TTAAACACAT CTCACGATTA TAGTTTTGGC ACAAGAGAAG CTGCTATGCC AAAAGACGGC
GATGAAATGC ACGCTTTAGA TTTTATGTAC ATGCTATCAA AAGCCATGAA TGTACCTTGG
CTTGATACGA TGGCAGAGCT TTATCAGTAC TGGAAGTGGG ATAAAAAAGA GCTCAATCAA
AAATGCGAAG AATCTTGGAA CAAATACGGC ACCATAGTAC CAGCTATAAG AGAGCTTCAG
CTTAAAAACA AAGTAGCTCC TGAGTTAAAA GAATATAAAG GTATATCAAA AACAGTACAA
GAGTTGGAAG AAGAGATATC TAAAAAAGGT GTGATAACGG CGATGTATAG AGAAGAGCTT
ATGGCTAAAT ACTCTGTGCC ATGGCATCAA CCAGTGCCTA CTCCAAGTGG ACGCATGGAA
ATCTATTCAA ATGTATTTGC TACACTTCAG AACATGTTTG GTTATAAGCC AAATTACGAT
CCTTTGATAG CATATATACC ACCAAAATGG AAAGGTGATA TAGCCCCAGA AGATGTGAAA
CTTGAAGAAA ACGAATTTTT TATAACACAT GGTAAAGTAC CAATACAATC TCATACTTGT
GCTGCTACGG TAGACAACCC TATACTAGTA AGCATAGGAA AGTGGAGAGA AGGTGTTTAT
TATGGTATAT GGATTAACGA TAAAAAGGCT AAGTCTCTTG GTATAAAAGA AGGCGATGAT
ATTTTGGTGA CAAACGTTAT GTATCCAAAT TTAAAAGTAA AAGGTAAAGC GCATCTTACA
AAACTAATAA GACCAGATAC GATATTTATA CCGGGGGCTT TTGGAGCTTC TTCTAAAAAA
CTAACATACG GCGATGGTTT GGGAACTCCG TTAAATGATC TGATACCCTA TAGACCAGAA
CCGGTTATAG GTGGATATAG AGCAAACGAA TTTACTGTAA AAGTTGTGAA GGCTTAA
 
Protein sequence
MNITRRGFLK GMAATGAATL VGKKLLTPEQ AKAIQTKNIQ YAYKPNICNF CANACGIQVR 
VASISGKPNR LMKIEGNIHH PYNRGVACAR GQSGISYIYD KDRIKKPLIR IEGSKKGEWK
FKEASWQEAF NYMMNKLKNI KPYEMALMGG WQGCAFYGTY LLPFVVAAQI PTLYGSPIQH
CVGSEHLGLH TVFGNYNTHD EVVCDYDRAR YILAVRSNGS LAGISTGRAH RFGAGIKNGA
KVVVLDPRAS ELAAKADEWI PIRPGTDNAF ALAMLHVILR EQKDGKVLYD EETLRFFTNA
PFLAYKDEKG NLQLLSDVDK DGAVEAWYVY DELSNSVQKV FGFYNTNKIS KDNKVLKPAL
FTKNLNVNGK NVKTVFEYLM DYTQNFTPEW ASKITDVPAS TIKRVAVEFA TMKPAIVEPG
IYDSRYENTI QLRKTLAIIQ AITSGYDKPG TWVTGGSYKM LIKDFFEFTK KNGNKITIPV
KGYPDVDIPG MLRVLDVGFK YFFNPKAWAH EYPSVQWAYL QTELSKGKEA TVFPFITDNG
FYESTKKEVF WKGQPYQLKA AFVYALNLVR GDVETQRWKE FLTNLDLVVG FDTMPSDTML
YADVIFPDIP YILKKDVIFD LNTSHDYSFG TREAAMPKDG DEMHALDFMY MLSKAMNVPW
LDTMAELYQY WKWDKKELNQ KCEESWNKYG TIVPAIRELQ LKNKVAPELK EYKGISKTVQ
ELEEEISKKG VITAMYREEL MAKYSVPWHQ PVPTPSGRME IYSNVFATLQ NMFGYKPNYD
PLIAYIPPKW KGDIAPEDVK LEENEFFITH GKVPIQSHTC AATVDNPILV SIGKWREGVY
YGIWINDKKA KSLGIKEGDD ILVTNVMYPN LKVKGKAHLT KLIRPDTIFI PGAFGASSKK
LTYGDGLGTP LNDLIPYRPE PVIGGYRANE FTVKVVKA