Gene HY04AAS1_1214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHY04AAS1_1214 
Symbol 
ID6744031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHydrogenobaculum sp. Y04AAS1 
KingdomBacteria 
Replicon accessionNC_011126 
Strand
Start bp1125451 
End bp1128423 
Gene Length2973 bp 
Protein Length990 aa 
Translation table11 
GC content35% 
IMG OID642751023 
Productvalyl-tRNA synthetase 
Protein accessionYP_002121877 
Protein GI195953587 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.152091 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGATT ATATACCAAA AGATATTGAA CAAGAAGTTT TGGAAAACTG GATTAAGTCA 
AATATATATA CTGTTAAAAG CCCGAAAAAG GCTTTTAGTA TGGTGATACC ACCACCAAAC
GTCACTGGTT CTCTTCACAT AGGACACGCT TTAAACATCA CCATCCAAGA CATAATGGCA
AGGTTTAAAA GAATGCAAGG CTACGATGTA GTATGGGTGC CAGGATTTGA TCATGCTGGT
ATAGCCACTC AGTTTGTGGT GGAAAGAGAG CTTTCAAAAG AAAATAAATC AAGACTTGAA
ATAGGAAGAG AAGAATTTTT AAAAAGAGTA TGGCAATGGG TTTATAAATC AAGAGACAAC
ATCAAAAACC AAGTAAAAAG ATTAGGGGCT TCTGTAGATT GGTCAAGAGA ACGCTTTACG
ATGGACGAAG GCTTTTCAAG AGCCGTAAGA CATGCTTTTA AAAAACTCTA CGAAGAAGGC
CTTATAAAAA AAGACACATA CATTATAAAC TGGTGTCCAA AAGATCTGAC GGCTCTTTCT
GATTTGGAAG TAGAACATGA AGAAGAAAAA GGAAAACTTT ATTACATAAA ATACCCAGTA
TTAGATAGCA CCGAATATAT AATTGTAGCC ACTACAAGAC CAGAAACGAT GCTTGGAGAC
GTGGCGGTGG CTGTAAATCC AAACGACGAA AGATACAAAC ACCTTATAGG TAAAAAACTA
AAACTTCCTT TGGTAGATTG GCAAAGAGAA GATATGTCTG GTAACCAAGT AAGCCCCGAA
ATACCAGTTA TAGCAGATGA GTTTGTGGAT ATGGAGTTTG GTACAGGGGC CGTTAAGATC
ACACCAGCCC ACGATCCAAA CGACTACGAA GCAGGTTTAA GACAAAACCT TCCCATAGTA
AAAATAATGG ATGAAAAGGC AAAACTTACA AAAAACGCCG GTAGATTTGA AAGTTTAGAC
AGATACGAAG CAAGGCAAAA AATAGTAGAA GAGCTTAAGA GTTTAGGGCT TTTAGAAAAA
GAAGAAGAGC ACATCCATGC TGTTGGAAAA TGCTATAGAT GTAAAACCAC TATAGAACCC
ATGGTATCCA CCCAGTGGTT TTTAAAAGTC TCTGATAAAA AAGATGAGTT TTTAGAGGTT
GCTAAAAGCC AAAAAACGAG GTTTATACCA CCAAACTGGG AGAAAAACTA CAACGAGTGG
ATGGAGAATA TAAAAGATTG GTGCATATCG CGCCAAATAT GGTGGGGACA TAGAATACCA
GTATGGGAAT GTGAGGATTG CAAAAACGAA TCTATCTATA CCGATGAAGA TTTTAACTAC
GTCCAAGATA AACTTATTTT TAATCTTTTG GCAGATGGTA AGATAAAAAA GGTTTTTACA
CCAAAAGAAA TAGACGATGT TTTAAACGGT AAGAATTTTG TACACCCTCA TATGAGTAGC
CTTGATTTTT ACAAACGTTT TGCTTACAAA AAATACTATT CTACGGGCAT AAACGAGTTT
TCTATATGGC AATACGTGGC CACCAGAAGA GACTTATATA AATATTACAA AGATATAAAA
AGTTTTGAGA TGATCTTAAA ATGTAAACAC TGTGGTTCTA CAAACATAAA ACAGATAGAA
GATGTGCTTG ATACATGGTT TTCATCCGCT TTATGGCCCT TTGGAGTCTT TGCTTGGCCA
GAGCCAAATC ACGATTTAGA TGCTTGCTAT CCAAATAGCC TTTTGGTAAC GGCTTTTGAC
ATACTCTTTT TCTGGGTGGC TCGCATGATC ATGATGAGCA AGATGCTAAA CGACAAAGAA
CCCTTCAAAG ATGTCTACAT ACATGCTCTT ATAAGAGATG AAAAAGGCCA AAAGATGTCA
AAAACCAAAG GAAACGTCAT AGACCCCATA GATGTAATAG AAAAATACGG AGCGGATAGC
TTAAGGTTTA CATTGGCTTC TTTAAGCTCT GCTGTAAGGG ATATAAAACT TTCTGAACAA
AAGTTTGAAG CAAGTAAGTT TTTTGCCAAC AAGATATGGA ACGCCGCCAA ATTTGTAATA
TCAAACACCC CTGAGAATTT TCTCCAAGAC ATAATATACG GGGATATGTA TGAAAAAGAA
GATTATTGGA TAATCACCGC CCTAAACCTT ACCATAGGAC AAGTGACCGA ATACCTTGAA
CATTATCAGT TTTCCCATGC TGCTCAAAGT ATCTACAACT TTTTCTGGAA TGAATTTTGC
GATTGGTATA TAGAGTTTTC AAAAATCCGC ATATACGAAA AGCCCATAGA AATAAAAGAA
GATATGACAG ACGAAAAAAA GTCACAGATA GAACAAATAA ACAACCAAAT CCAAAAGAAA
AAACTTACCG CTTTGGCAGT GCTAAACACC GTCCTATCAA AAGCTCTAAG ACTTTTACAC
CCTTTTATGC CATTTATTAC AAGCTACATA CACGACAAGA CCATCTTTAA CGACAAAGAT
ATATCTTTAA AAGAGTTTCC AGCTTTTGAT AAAGAAGCTA TAGATATTAA AAGCTATGAA
ACCATAGAAA GACTAAAAAG ATTGATTTCT TTTATAAGAA AGATAAAATC AGATTTTAAG
ATAGAATCAA AGATAGATAT GTATTTTGAA AGCCAAAACT CAAAAGAGTT TTTAGAAGAA
TTTAAACCCC ACATAACAAA CCTTTGTAAA TTAAGCTCCT TTGAAATACC CACGGATAAA
ACCAACATGC TATCGATACC TTTTGAAGAT ATAATGCTAT ATATACCAGA AAAAGACTTC
GACAGAGATG CTCTTTTAAA AGATTACGAG AAAACGCTAA AGGATATAGA AAAGCAACTT
TCTATATACA ACTCAAAACT ACAAAACAAA AACTTTATAG AAAAAGCTCC GCCCGAAGAA
GTGGAAAAAG CAAAAACCAC AAAAGAAAAA TTAGATAAAG AAAAAGAAAA CGTACAAAAG
CTTATAGCTA TATTAAAATC TGGTAAAATA TGA
 
Protein sequence
MKDYIPKDIE QEVLENWIKS NIYTVKSPKK AFSMVIPPPN VTGSLHIGHA LNITIQDIMA 
RFKRMQGYDV VWVPGFDHAG IATQFVVERE LSKENKSRLE IGREEFLKRV WQWVYKSRDN
IKNQVKRLGA SVDWSRERFT MDEGFSRAVR HAFKKLYEEG LIKKDTYIIN WCPKDLTALS
DLEVEHEEEK GKLYYIKYPV LDSTEYIIVA TTRPETMLGD VAVAVNPNDE RYKHLIGKKL
KLPLVDWQRE DMSGNQVSPE IPVIADEFVD MEFGTGAVKI TPAHDPNDYE AGLRQNLPIV
KIMDEKAKLT KNAGRFESLD RYEARQKIVE ELKSLGLLEK EEEHIHAVGK CYRCKTTIEP
MVSTQWFLKV SDKKDEFLEV AKSQKTRFIP PNWEKNYNEW MENIKDWCIS RQIWWGHRIP
VWECEDCKNE SIYTDEDFNY VQDKLIFNLL ADGKIKKVFT PKEIDDVLNG KNFVHPHMSS
LDFYKRFAYK KYYSTGINEF SIWQYVATRR DLYKYYKDIK SFEMILKCKH CGSTNIKQIE
DVLDTWFSSA LWPFGVFAWP EPNHDLDACY PNSLLVTAFD ILFFWVARMI MMSKMLNDKE
PFKDVYIHAL IRDEKGQKMS KTKGNVIDPI DVIEKYGADS LRFTLASLSS AVRDIKLSEQ
KFEASKFFAN KIWNAAKFVI SNTPENFLQD IIYGDMYEKE DYWIITALNL TIGQVTEYLE
HYQFSHAAQS IYNFFWNEFC DWYIEFSKIR IYEKPIEIKE DMTDEKKSQI EQINNQIQKK
KLTALAVLNT VLSKALRLLH PFMPFITSYI HDKTIFNDKD ISLKEFPAFD KEAIDIKSYE
TIERLKRLIS FIRKIKSDFK IESKIDMYFE SQNSKEFLEE FKPHITNLCK LSSFEIPTDK
TNMLSIPFED IMLYIPEKDF DRDALLKDYE KTLKDIEKQL SIYNSKLQNK NFIEKAPPEE
VEKAKTTKEK LDKEKENVQK LIAILKSGKI