Gene HY04AAS1_1126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHY04AAS1_1126 
Symbol 
ID6743942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHydrogenobaculum sp. Y04AAS1 
KingdomBacteria 
Replicon accessionNC_011126 
Strand
Start bp1042686 
End bp1043717 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content35% 
IMG OID642750935 
ProductSel1 domain protein repeat-containing protein 
Protein accessionYP_002121790 
Protein GI195953500 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAATA TAATAGCTAA TATAAAGAAG TCTTACATTA TAGGAGCAGT GATCCTATTA 
GTTGTGATTG GTGGCTTTTT TGTTTATGAA CATGAATATG GTCCAGAAGC AAAATACCAA
AAAGGTCTTC ATTATTATAA AGATAAAAAC TACCAAAAAG CATTACCTTT ATTTAAAGAA
TCAGCAAAGC AGGGGTATGC ACCAGCAGAA GCTAAACTTG GGTATATGTA TTTACGTGGT
TTAGGAGTGT CAAGAGATGA CGATAAGGCT GCTTATTGGT TTAAAAAAGC TGCACACCAA
GGTAATGCTA GAGGAGAAGT TGGTCTTGGT TATATGTATT TGTTTGGCAA AGGCGGAGTA
TCAAAAGATT ATCAAAAAGC TTTATATTGG ATTAAGAAAG CAGTTAAACA AGGTGATGCT
CGAGGAGAAA ATAACCTTGG ATATATGTAT GAATATGGTT TAGGAGTACC ACAGGATTAT
AGCAAAGCTG TATATTGGTA TAAAAAAGCT GCTGAACAAG GACTTGCAGC AGCAGAAGAT
AGTCTTGGAT ATATGTATGA ATATGGTTTA GGAGTACCAC AGGATTATAG CAAAGCTGTA
TATTGGTATA AAAAAGCTGC TGAACAAGGA CTTGCAGCAG CAGAAGATAA TCTTGGATAT
ATGTATTTGT TTGGCAAAGG CGGAGTATCA AAAGATTATC AAAAAGCTTT ATATTGGATT
AAGAAAGCTG CACATCAAGG TGATGCTTTA GGAGAAGCTA CTCTTGGACA TATGTATGCA
GAAGGTTTAG GAGTACCACA GGATTATAGC AAAGCTTTAT ATTGGTTTAA AAAAGCTGCT
AAACAAGGAC TTGCACAAGC AGAAAATAAT CTTGGATATA TGTATGCAGA AGGTTTAGGA
GTACCACAGG ATTACAACGA AGCCGTATAT TGGTTACAGA AAGCTGCTGA ACAAGGACTT
GCACAAGCTA AAATCAACCT TGAATATATA AAAACAAAAC TTGCATTAAT GCACCTATTT
GGTGGTTATT GA
 
Protein sequence
MNNIIANIKK SYIIGAVILL VVIGGFFVYE HEYGPEAKYQ KGLHYYKDKN YQKALPLFKE 
SAKQGYAPAE AKLGYMYLRG LGVSRDDDKA AYWFKKAAHQ GNARGEVGLG YMYLFGKGGV
SKDYQKALYW IKKAVKQGDA RGENNLGYMY EYGLGVPQDY SKAVYWYKKA AEQGLAAAED
SLGYMYEYGL GVPQDYSKAV YWYKKAAEQG LAAAEDNLGY MYLFGKGGVS KDYQKALYWI
KKAAHQGDAL GEATLGHMYA EGLGVPQDYS KALYWFKKAA KQGLAQAENN LGYMYAEGLG
VPQDYNEAVY WLQKAAEQGL AQAKINLEYI KTKLALMHLF GGY