Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HY04AAS1_1214 |
Symbol | |
ID | 6744031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Hydrogenobaculum sp. Y04AAS1 |
Kingdom | Bacteria |
Replicon accession | NC_011126 |
Strand | + |
Start bp | 1125451 |
End bp | 1128423 |
Gene Length | 2973 bp |
Protein Length | 990 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 642751023 |
Product | valyl-tRNA synthetase |
Protein accession | YP_002121877 |
Protein GI | 195953587 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.152091 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGATT ATATACCAAA AGATATTGAA CAAGAAGTTT TGGAAAACTG GATTAAGTCA AATATATATA CTGTTAAAAG CCCGAAAAAG GCTTTTAGTA TGGTGATACC ACCACCAAAC GTCACTGGTT CTCTTCACAT AGGACACGCT TTAAACATCA CCATCCAAGA CATAATGGCA AGGTTTAAAA GAATGCAAGG CTACGATGTA GTATGGGTGC CAGGATTTGA TCATGCTGGT ATAGCCACTC AGTTTGTGGT GGAAAGAGAG CTTTCAAAAG AAAATAAATC AAGACTTGAA ATAGGAAGAG AAGAATTTTT AAAAAGAGTA TGGCAATGGG TTTATAAATC AAGAGACAAC ATCAAAAACC AAGTAAAAAG ATTAGGGGCT TCTGTAGATT GGTCAAGAGA ACGCTTTACG ATGGACGAAG GCTTTTCAAG AGCCGTAAGA CATGCTTTTA AAAAACTCTA CGAAGAAGGC CTTATAAAAA AAGACACATA CATTATAAAC TGGTGTCCAA AAGATCTGAC GGCTCTTTCT GATTTGGAAG TAGAACATGA AGAAGAAAAA GGAAAACTTT ATTACATAAA ATACCCAGTA TTAGATAGCA CCGAATATAT AATTGTAGCC ACTACAAGAC CAGAAACGAT GCTTGGAGAC GTGGCGGTGG CTGTAAATCC AAACGACGAA AGATACAAAC ACCTTATAGG TAAAAAACTA AAACTTCCTT TGGTAGATTG GCAAAGAGAA GATATGTCTG GTAACCAAGT AAGCCCCGAA ATACCAGTTA TAGCAGATGA GTTTGTGGAT ATGGAGTTTG GTACAGGGGC CGTTAAGATC ACACCAGCCC ACGATCCAAA CGACTACGAA GCAGGTTTAA GACAAAACCT TCCCATAGTA AAAATAATGG ATGAAAAGGC AAAACTTACA AAAAACGCCG GTAGATTTGA AAGTTTAGAC AGATACGAAG CAAGGCAAAA AATAGTAGAA GAGCTTAAGA GTTTAGGGCT TTTAGAAAAA GAAGAAGAGC ACATCCATGC TGTTGGAAAA TGCTATAGAT GTAAAACCAC TATAGAACCC ATGGTATCCA CCCAGTGGTT TTTAAAAGTC TCTGATAAAA AAGATGAGTT TTTAGAGGTT GCTAAAAGCC AAAAAACGAG GTTTATACCA CCAAACTGGG AGAAAAACTA CAACGAGTGG ATGGAGAATA TAAAAGATTG GTGCATATCG CGCCAAATAT GGTGGGGACA TAGAATACCA GTATGGGAAT GTGAGGATTG CAAAAACGAA TCTATCTATA CCGATGAAGA TTTTAACTAC GTCCAAGATA AACTTATTTT TAATCTTTTG GCAGATGGTA AGATAAAAAA GGTTTTTACA CCAAAAGAAA TAGACGATGT TTTAAACGGT AAGAATTTTG TACACCCTCA TATGAGTAGC CTTGATTTTT ACAAACGTTT TGCTTACAAA AAATACTATT CTACGGGCAT AAACGAGTTT TCTATATGGC AATACGTGGC CACCAGAAGA GACTTATATA AATATTACAA AGATATAAAA AGTTTTGAGA TGATCTTAAA ATGTAAACAC TGTGGTTCTA CAAACATAAA ACAGATAGAA GATGTGCTTG ATACATGGTT TTCATCCGCT TTATGGCCCT TTGGAGTCTT TGCTTGGCCA GAGCCAAATC ACGATTTAGA TGCTTGCTAT CCAAATAGCC TTTTGGTAAC GGCTTTTGAC ATACTCTTTT TCTGGGTGGC TCGCATGATC ATGATGAGCA AGATGCTAAA CGACAAAGAA CCCTTCAAAG ATGTCTACAT ACATGCTCTT ATAAGAGATG AAAAAGGCCA AAAGATGTCA AAAACCAAAG GAAACGTCAT AGACCCCATA GATGTAATAG AAAAATACGG AGCGGATAGC TTAAGGTTTA CATTGGCTTC TTTAAGCTCT GCTGTAAGGG ATATAAAACT TTCTGAACAA AAGTTTGAAG CAAGTAAGTT TTTTGCCAAC AAGATATGGA ACGCCGCCAA ATTTGTAATA TCAAACACCC CTGAGAATTT TCTCCAAGAC ATAATATACG GGGATATGTA TGAAAAAGAA GATTATTGGA TAATCACCGC CCTAAACCTT ACCATAGGAC AAGTGACCGA ATACCTTGAA CATTATCAGT TTTCCCATGC TGCTCAAAGT ATCTACAACT TTTTCTGGAA TGAATTTTGC GATTGGTATA TAGAGTTTTC AAAAATCCGC ATATACGAAA AGCCCATAGA AATAAAAGAA GATATGACAG ACGAAAAAAA GTCACAGATA GAACAAATAA ACAACCAAAT CCAAAAGAAA AAACTTACCG CTTTGGCAGT GCTAAACACC GTCCTATCAA AAGCTCTAAG ACTTTTACAC CCTTTTATGC CATTTATTAC AAGCTACATA CACGACAAGA CCATCTTTAA CGACAAAGAT ATATCTTTAA AAGAGTTTCC AGCTTTTGAT AAAGAAGCTA TAGATATTAA AAGCTATGAA ACCATAGAAA GACTAAAAAG ATTGATTTCT TTTATAAGAA AGATAAAATC AGATTTTAAG ATAGAATCAA AGATAGATAT GTATTTTGAA AGCCAAAACT CAAAAGAGTT TTTAGAAGAA TTTAAACCCC ACATAACAAA CCTTTGTAAA TTAAGCTCCT TTGAAATACC CACGGATAAA ACCAACATGC TATCGATACC TTTTGAAGAT ATAATGCTAT ATATACCAGA AAAAGACTTC GACAGAGATG CTCTTTTAAA AGATTACGAG AAAACGCTAA AGGATATAGA AAAGCAACTT TCTATATACA ACTCAAAACT ACAAAACAAA AACTTTATAG AAAAAGCTCC GCCCGAAGAA GTGGAAAAAG CAAAAACCAC AAAAGAAAAA TTAGATAAAG AAAAAGAAAA CGTACAAAAG CTTATAGCTA TATTAAAATC TGGTAAAATA TGA
|
Protein sequence | MKDYIPKDIE QEVLENWIKS NIYTVKSPKK AFSMVIPPPN VTGSLHIGHA LNITIQDIMA RFKRMQGYDV VWVPGFDHAG IATQFVVERE LSKENKSRLE IGREEFLKRV WQWVYKSRDN IKNQVKRLGA SVDWSRERFT MDEGFSRAVR HAFKKLYEEG LIKKDTYIIN WCPKDLTALS DLEVEHEEEK GKLYYIKYPV LDSTEYIIVA TTRPETMLGD VAVAVNPNDE RYKHLIGKKL KLPLVDWQRE DMSGNQVSPE IPVIADEFVD MEFGTGAVKI TPAHDPNDYE AGLRQNLPIV KIMDEKAKLT KNAGRFESLD RYEARQKIVE ELKSLGLLEK EEEHIHAVGK CYRCKTTIEP MVSTQWFLKV SDKKDEFLEV AKSQKTRFIP PNWEKNYNEW MENIKDWCIS RQIWWGHRIP VWECEDCKNE SIYTDEDFNY VQDKLIFNLL ADGKIKKVFT PKEIDDVLNG KNFVHPHMSS LDFYKRFAYK KYYSTGINEF SIWQYVATRR DLYKYYKDIK SFEMILKCKH CGSTNIKQIE DVLDTWFSSA LWPFGVFAWP EPNHDLDACY PNSLLVTAFD ILFFWVARMI MMSKMLNDKE PFKDVYIHAL IRDEKGQKMS KTKGNVIDPI DVIEKYGADS LRFTLASLSS AVRDIKLSEQ KFEASKFFAN KIWNAAKFVI SNTPENFLQD IIYGDMYEKE DYWIITALNL TIGQVTEYLE HYQFSHAAQS IYNFFWNEFC DWYIEFSKIR IYEKPIEIKE DMTDEKKSQI EQINNQIQKK KLTALAVLNT VLSKALRLLH PFMPFITSYI HDKTIFNDKD ISLKEFPAFD KEAIDIKSYE TIERLKRLIS FIRKIKSDFK IESKIDMYFE SQNSKEFLEE FKPHITNLCK LSSFEIPTDK TNMLSIPFED IMLYIPEKDF DRDALLKDYE KTLKDIEKQL SIYNSKLQNK NFIEKAPPEE VEKAKTTKEK LDKEKENVQK LIAILKSGKI
|
| |