Gene HY04AAS1_1034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHY04AAS1_1034 
Symbol 
ID6743849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHydrogenobaculum sp. Y04AAS1 
KingdomBacteria 
Replicon accessionNC_011126 
Strand
Start bp968432 
End bp970078 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content32% 
IMG OID642750842 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_002121698 
Protein GI195953408 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAGAA AAATTTTGTT ATCACTTTTG TTTGCAAGCT CTTTAGGTTT GGCAAACCCG 
TATTTTGACT ATGCAATGTG TCATTATTAT ATAGATAACC CTCAAAAAGC TTATCCTTAT
TGCTCAAAAG CCCTTAAGGA ACTTCCAACC CCAAGTGTGT TTGAAGATGT TATAAGTATG
TATGTAAGAG CCCATTATTT AGAAGGTGCT ATTTACGTTG CAAAGCTTTA CAAAGATAGA
TATCCAAATC TAAAAGAACC TTATATAGAT CTATACACTC TTTATACTAT GGATAAAGAC
TATGAAAATG CAAAAAAGAC ATTGGAAGAA GCGCTAAGTA AATTTCCTCA GGACAATTTC
TTTGCTCTAA ATCTTATAAC CACGTATATA CATGATGGAG AAATTAAAAA AGCCGAAGAT
ATTATAAACA GGTTTTTGGC TTTTAAAAAC GAAAACGGAA AAGAGATTTT TTACTATATT
AGAGCACGTA TAGAACTTGC ACAGCAAAAC AAAGAAAAGG CAATAGAAGA TCTTAAAAAG
GCTATAGAGT TAAAACCAAA CTTTGATGAG GCCGTAGATA CTTTGGCAAG TATATACGAT
CAAGAAAGTA AATATCAAGA TGAGGAAAAA CTTTATGAAG ATATACTAAA AAAAGATTCT
TCTAATATAA GCGCCTTGGA GCGGTTGGGT AACCTGTTTT TTAAATTAGG CCTTAGCTAC
AAGGCTTCGG ATATATATAA AAAACTTGCA GAACTAAACA AAAACAATCT AAACTATCAG
TATCAGTATG CATTATCTTT GCTCCAAAGC ATGAGATACG ATAAGGCTTT ATCAGTTTTA
GCCCCTTTGT ATAAGAAGCA TCCAAACAAC AAACCCATTG CTTACCTTTA CGGACTGACG
TTGGAAGCTG CTCACAAACC GGTAAAAGCT CTTGAAGTAT ATAAAAAGCT TCTTCAAATA
GATAAGAAAA ACCCAAAGCT ATACGAAAGA ATAGCAAGTA TTTTGATAGA CGAAGGTAAA
TATAACGAGG CTATGCCTTA TATAGAAAAA GGCTTAAAAC TCAACCCCTT GAGCTCAAAG
CTTTACATAT TTAAAGCTAT AATAGCGGCT AGTCATCGTC ATTACATAAT GGCAAAAGTA
TATGCGGATC AATCTATAAA GCTAAATCAT CATGATTATA GAAGCTATTT TATAAGAGCT
ATGATTGAAG ATAAGCTTCA TCAGATAGAC AATGAAATAA AGGATTTAAA AAAAGTAGTA
GAGTTAAAGC CAAATGATGC AGATATGCTA AATTACTTAG GTTATACTAT GCTTATTTAC
AATAAAGATA TAAAAGAGGG AATGAAGTAT ATAGAAAAAG CAGTGAAACT TTCTCCTAAA
AACCCCTCTT ACCTTGATAG CTTAGCTTAT GGATACTTTC TTTTGCATGA TTATAAAAAG
GCTTTAAAAT ACGAAGAAGA GGCTTACAAG CTAAATTCAA AAGATTCCGT AATTATACAA
CATCTTGGCA TGATAAAACT TATGCTGGGA CAACCAAAAG AAGCCAAAAG TTTGTTACTA
AAAGCCCTAT ACATAGTAAA TAAAAGAGGA GAAGAAGAGC CTGATCAAAA GAAAAAGATT
TTAGAATTTT TAAAAGAGAT AAAATAA
 
Protein sequence
MFRKILLSLL FASSLGLANP YFDYAMCHYY IDNPQKAYPY CSKALKELPT PSVFEDVISM 
YVRAHYLEGA IYVAKLYKDR YPNLKEPYID LYTLYTMDKD YENAKKTLEE ALSKFPQDNF
FALNLITTYI HDGEIKKAED IINRFLAFKN ENGKEIFYYI RARIELAQQN KEKAIEDLKK
AIELKPNFDE AVDTLASIYD QESKYQDEEK LYEDILKKDS SNISALERLG NLFFKLGLSY
KASDIYKKLA ELNKNNLNYQ YQYALSLLQS MRYDKALSVL APLYKKHPNN KPIAYLYGLT
LEAAHKPVKA LEVYKKLLQI DKKNPKLYER IASILIDEGK YNEAMPYIEK GLKLNPLSSK
LYIFKAIIAA SHRHYIMAKV YADQSIKLNH HDYRSYFIRA MIEDKLHQID NEIKDLKKVV
ELKPNDADML NYLGYTMLIY NKDIKEGMKY IEKAVKLSPK NPSYLDSLAY GYFLLHDYKK
ALKYEEEAYK LNSKDSVIIQ HLGMIKLMLG QPKEAKSLLL KALYIVNKRG EEEPDQKKKI
LEFLKEIK