Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HY04AAS1_1034 |
Symbol | |
ID | 6743849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Hydrogenobaculum sp. Y04AAS1 |
Kingdom | Bacteria |
Replicon accession | NC_011126 |
Strand | - |
Start bp | 968432 |
End bp | 970078 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 642750842 |
Product | Tetratricopeptide TPR_2 repeat protein |
Protein accession | YP_002121698 |
Protein GI | 195953408 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTAGAA AAATTTTGTT ATCACTTTTG TTTGCAAGCT CTTTAGGTTT GGCAAACCCG TATTTTGACT ATGCAATGTG TCATTATTAT ATAGATAACC CTCAAAAAGC TTATCCTTAT TGCTCAAAAG CCCTTAAGGA ACTTCCAACC CCAAGTGTGT TTGAAGATGT TATAAGTATG TATGTAAGAG CCCATTATTT AGAAGGTGCT ATTTACGTTG CAAAGCTTTA CAAAGATAGA TATCCAAATC TAAAAGAACC TTATATAGAT CTATACACTC TTTATACTAT GGATAAAGAC TATGAAAATG CAAAAAAGAC ATTGGAAGAA GCGCTAAGTA AATTTCCTCA GGACAATTTC TTTGCTCTAA ATCTTATAAC CACGTATATA CATGATGGAG AAATTAAAAA AGCCGAAGAT ATTATAAACA GGTTTTTGGC TTTTAAAAAC GAAAACGGAA AAGAGATTTT TTACTATATT AGAGCACGTA TAGAACTTGC ACAGCAAAAC AAAGAAAAGG CAATAGAAGA TCTTAAAAAG GCTATAGAGT TAAAACCAAA CTTTGATGAG GCCGTAGATA CTTTGGCAAG TATATACGAT CAAGAAAGTA AATATCAAGA TGAGGAAAAA CTTTATGAAG ATATACTAAA AAAAGATTCT TCTAATATAA GCGCCTTGGA GCGGTTGGGT AACCTGTTTT TTAAATTAGG CCTTAGCTAC AAGGCTTCGG ATATATATAA AAAACTTGCA GAACTAAACA AAAACAATCT AAACTATCAG TATCAGTATG CATTATCTTT GCTCCAAAGC ATGAGATACG ATAAGGCTTT ATCAGTTTTA GCCCCTTTGT ATAAGAAGCA TCCAAACAAC AAACCCATTG CTTACCTTTA CGGACTGACG TTGGAAGCTG CTCACAAACC GGTAAAAGCT CTTGAAGTAT ATAAAAAGCT TCTTCAAATA GATAAGAAAA ACCCAAAGCT ATACGAAAGA ATAGCAAGTA TTTTGATAGA CGAAGGTAAA TATAACGAGG CTATGCCTTA TATAGAAAAA GGCTTAAAAC TCAACCCCTT GAGCTCAAAG CTTTACATAT TTAAAGCTAT AATAGCGGCT AGTCATCGTC ATTACATAAT GGCAAAAGTA TATGCGGATC AATCTATAAA GCTAAATCAT CATGATTATA GAAGCTATTT TATAAGAGCT ATGATTGAAG ATAAGCTTCA TCAGATAGAC AATGAAATAA AGGATTTAAA AAAAGTAGTA GAGTTAAAGC CAAATGATGC AGATATGCTA AATTACTTAG GTTATACTAT GCTTATTTAC AATAAAGATA TAAAAGAGGG AATGAAGTAT ATAGAAAAAG CAGTGAAACT TTCTCCTAAA AACCCCTCTT ACCTTGATAG CTTAGCTTAT GGATACTTTC TTTTGCATGA TTATAAAAAG GCTTTAAAAT ACGAAGAAGA GGCTTACAAG CTAAATTCAA AAGATTCCGT AATTATACAA CATCTTGGCA TGATAAAACT TATGCTGGGA CAACCAAAAG AAGCCAAAAG TTTGTTACTA AAAGCCCTAT ACATAGTAAA TAAAAGAGGA GAAGAAGAGC CTGATCAAAA GAAAAAGATT TTAGAATTTT TAAAAGAGAT AAAATAA
|
Protein sequence | MFRKILLSLL FASSLGLANP YFDYAMCHYY IDNPQKAYPY CSKALKELPT PSVFEDVISM YVRAHYLEGA IYVAKLYKDR YPNLKEPYID LYTLYTMDKD YENAKKTLEE ALSKFPQDNF FALNLITTYI HDGEIKKAED IINRFLAFKN ENGKEIFYYI RARIELAQQN KEKAIEDLKK AIELKPNFDE AVDTLASIYD QESKYQDEEK LYEDILKKDS SNISALERLG NLFFKLGLSY KASDIYKKLA ELNKNNLNYQ YQYALSLLQS MRYDKALSVL APLYKKHPNN KPIAYLYGLT LEAAHKPVKA LEVYKKLLQI DKKNPKLYER IASILIDEGK YNEAMPYIEK GLKLNPLSSK LYIFKAIIAA SHRHYIMAKV YADQSIKLNH HDYRSYFIRA MIEDKLHQID NEIKDLKKVV ELKPNDADML NYLGYTMLIY NKDIKEGMKY IEKAVKLSPK NPSYLDSLAY GYFLLHDYKK ALKYEEEAYK LNSKDSVIIQ HLGMIKLMLG QPKEAKSLLL KALYIVNKRG EEEPDQKKKI LEFLKEIK
|
| |