Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HY04AAS1_0825 |
Symbol | |
ID | 6743631 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Hydrogenobaculum sp. Y04AAS1 |
Kingdom | Bacteria |
Replicon accession | NC_011126 |
Strand | - |
Start bp | 766807 |
End bp | 768600 |
Gene Length | 1794 bp |
Protein Length | 597 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 642750626 |
Product | hypothetical protein |
Protein accession | YP_002121490 |
Protein GI | 195953200 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.000284706 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGGAG CAATATCGGA TTTAAGTCTT TACAACTTTT ACAAATCTCA AGATAGTGTA GTGTTAAATA ATCTTCAAGA TACTATAGAA GAGGCTTCCA CTGGGTATAA GCTTTTAAAC ATAGGTCAAA ATCCAGGCGA TACTCAGCAG GTTATAAATT TGAAAAAAGA AATAGTGCTT TTATCAACAT ATTCACAAAA CGCTTTTTCA GCTAGCAACG TACTTACTAC CACCACTTCT GTGCTTGGCA ATCTTTATGA CTATCTTCAA ACAGTAAATA CCGATGTTGT AGCAGCTGCT AACGAAGCTA CGTACAATTC TACGCAACTT ATAAATATGG GCCAAAGTAT TTATTCTATA TTGAATCTTA CGTTGTCAAA AGCTAATGAA AAATTTGGAG ACAACTATTT GTTTGGTGGC TCTTCGTTGT CTATACAGCC TTTTACTGCT GATTTTTCTT ACCAAGCTTC TACTACAGAT TTTTATACGC AAATATCAAA CTCTTATCAA GTACCTACAT ATCTAAACGG TCAAAACGTT TTTGGGCTTA ATATCCAAAC TACCTCCACC TCATATAACT CTTATACACA AAGTTTTTCT GGACCCGGTG AACTTATAAT ACACTATGGT ACCAATGTGT ATCAGATAAA CTATAACAAT ACGCCCTATG AATGGGATTG GGACGCTGGA CTTTCTTCTA CAAATGCACC TCTTGGAGCA TCTGGTATGA TATCTCTTAG TATGGTGACA GCCTCTGCTA CAAATGTTTA CAACATTTCT TATACAAGCA CAGATACGTT ATCAACGCTG TTAAACAAAA TATCTACATC TACAGGTGGC GACTTTAAAG CGTCTATTCT ACACAATTTA GATAATACTT ACGGAATAGA AATATCACCA TCCACAAATA ATTTATCTGC TACTTATTCT CTTTATGATA GCAACTCTTT ATATAAATTA GACCAAACCC CTTCAAATGT ATTAGAGCTA TCAAATTATA TAAACAACGT TTTTAGCGGC ACTTTGCAAG CTTTTATAAG ACAAAATTCG AATTCGACAT TTTCATTGGA GATCGCTGGA AAAGATGTAG CAAAACCTTT AAATATAATA GATTTGAATC AATACGTATC TTCTTCTTTT AAACAAGAAA GTGTTTTTTC TGTGTTAAAA CAAACGGCTG ATAGACTTAG TTTGGGGCTT CCAACCATAG ATAACGAGCT TGGGGCAAAC ATGGTTATAT CATCTCAAAG TTTTAATAGC CTAACATCAC CTATTGGAGT AAACGGTACA TTAGAAATAA GGATTAGTTC AAGCACGATA CCGATAGATT ATACTGCAAA TATGAATCTA GCAGATGTAG CAAATCTTAT CAATAAAGCT TTAAACGGAG CTGCTTATGC AGATTTTGTT CAAAATCAAA ACGGCACTTA TAACTTAGAG ATAAGCTCAA TGTTAGCTTC TCAATCGCTT ACTGCCACAG ATGTAGTGAA TGGGGCTTTC TCCCAATTCA ATAACAACCA GATACCAAAC GGTAGCTATA TATTCAATGT TCAAAGAGCT TTAGACCAAA TATCTTATGC AAACGCTCAG GTAGGAAGCT ATATCCAAAA TATTCAAACC CAGGATAACG TACTGACAAA CACTACTACT GTGGCTACTA CAGAGCTTGC AAACTATCAG GACGCAAATG TACCCAATGT GTTAACAGAC TATTCTCAAT ATCAGCTAGC TTACGAATCT TTGATGAACT TGATAGCAAA TCAAAAGAAC TTAACGATAT TGAAGTATAT ATAG
|
Protein sequence | MSGAISDLSL YNFYKSQDSV VLNNLQDTIE EASTGYKLLN IGQNPGDTQQ VINLKKEIVL LSTYSQNAFS ASNVLTTTTS VLGNLYDYLQ TVNTDVVAAA NEATYNSTQL INMGQSIYSI LNLTLSKANE KFGDNYLFGG SSLSIQPFTA DFSYQASTTD FYTQISNSYQ VPTYLNGQNV FGLNIQTTST SYNSYTQSFS GPGELIIHYG TNVYQINYNN TPYEWDWDAG LSSTNAPLGA SGMISLSMVT ASATNVYNIS YTSTDTLSTL LNKISTSTGG DFKASILHNL DNTYGIEISP STNNLSATYS LYDSNSLYKL DQTPSNVLEL SNYINNVFSG TLQAFIRQNS NSTFSLEIAG KDVAKPLNII DLNQYVSSSF KQESVFSVLK QTADRLSLGL PTIDNELGAN MVISSQSFNS LTSPIGVNGT LEIRISSSTI PIDYTANMNL ADVANLINKA LNGAAYADFV QNQNGTYNLE ISSMLASQSL TATDVVNGAF SQFNNNQIPN GSYIFNVQRA LDQISYANAQ VGSYIQNIQT QDNVLTNTTT VATTELANYQ DANVPNVLTD YSQYQLAYES LMNLIANQKN LTILKYI
|
| |