Gene HS_1259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1259 
SymbolvalS 
ID4240770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1444398 
End bp1447244 
Gene Length2847 bp 
Protein Length948 aa 
Translation table11 
GC content44% 
IMG OID638104832 
Productvalyl-tRNA synthetase 
Protein accessionYP_719471 
Protein GI113461402 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGACC GTTTCAACCC TTCCGCCGTA GAGCAAGCGC TCTACCAACA CTGGGAATCG 
CAAGGCTATT TTAAGCCAAG CGAAGATGTT AATGCACCAA GCTACTGTAT TGCGATTCCG
CCGCCGAATG TGACCGGTTC GCTACATATG GGGCACGCTT TCCAGCAAAT CTTGATGGAT
ACTTTAATCC GCTTCAACCG TATGGAGGGG AACAATACCC TTTGGCAAAC GGGAACAGAC
CATGCGGGGA TTGCGACTCA AATGGTGGTG GAGCGTAAAA TCGCTGCCGA AGAGGGCAAA
ACCCGCCACG ATTATGGTCG TGAGGCGTTT ATCAATAAAA TTTGGGATTG GAAAGCCTAT
TCGGGCGGTA CCATCAGCCA GCAAATGAGA CGACTTGGTA ACTCAATCGA CTGGGATCGT
GAGCGTTTTA CGATGGACGA GGGCTTGTCT AATGCGGTGA AAGAGGTCTT TGTGCGTTTG
CACGAAGAAG GCTTGATTTA CCGTGGCAAA CGCTTGGTGA ATTGGGATCC AAAACTGCAC
ACAGCCATTT CTGATTTGGA AGTGGAAAAC AAAGAGAGCA AAGGCTCGCT TTGGCATTTC
CGCTATCCGC TTGCTAATGG TGCGAAAACC GCTGAGGGCT TAGATTATTT AGTGGTGGCG
ACCACTCGTC CTGAAACAAT GTTGGGCGAT ACAGCGGTTG CGGTTCACCC CGAAGATGAG
CGTTATCAAT CGCTGATTGG CAAAACGGTC GTGCTACCGC TTGCCAACCG TGAAATTCCG
ATTATCGCTG ATGAATATGT GGATCGTGAA TTCGGTACAG GTGTGGTAAA AATTACCCCT
GCACATGACT TCAACGACTA CGAAGTGGGC AAACGCCACG CCTTGCCGAT GGTAAATGTG
ATGACGCTCA ATGCCGATAT TCGTGCGGAA GCAGAAATTA TCGGCACAGA CGGAAAAACT
CTCGAAAATT ACACCGCACT TATTCCACAG GACTATCAAG GCTTAGAGCG TTTTGTGGCT
CGTAAGAAAA TCGTGGCTGA TTTTGAGGCT CTCGGCTTAT TAGACCAAAT CAAACCGCAC
GACTTAAAAG TCCCTTACGG TGACCGTGGT GGCGTGCCGA TTGAGCCGAT GCTGACCGAT
CAATGGTATG TGAGCGTGAA ACCGCTTGCC GAAGTCGCCA CCAAAGCGGT GGAAAACGGC
GAAATCCAAT TCGTACCTAA ACAGTATGAA AATCTTTACT TCTCTTGGAT GCGTGATATT
CAAGATTGGT GTATTTCTCG TCAACTTTGG TGGGGACATC GTATTCCGGC TTGGTATGAT
GAACAAGGCA ATGTCTATGT TGCTCGAGAT GAAGCGGAAG TGCGGTCAAA ATACGGCTTA
ACTTCAGACG TAGCGTTAAA ACAGGATGAA GACGTATTAG ACACTTGGTT CTCGTCCGGA
TTATGGACAT TCTCAACACT TGGCTGGCCT GAGCAAACCA AAGAACTGAA AATATTTCAC
CCGACCGATG TATTAATCAC CGGTTTTGAC ATCATCTTCT TTTGGGTAGC AAGAATGATT
ATGTTCACGA TGCACTTCAT CAAAGATGAA AACGGCAAAC CGCAAGTACC GTTTAAAACC
GTTTATGTAA CAGGTTTAAT TCGTGATGAA CAAGGGCAAA AAATGTCCAA ATCCAAAGGG
AACGTGTTAG ACCCGATTGA TATGATTGAT GGTATCAGTC TTGAGGATTT GCTTGAAAAA
CGCACCGGCA ATATGATGCA ACCACAATTG GCGGAAAAAA TTGCTAAAGC AACTCGTAAA
GAATTTGAAC ATGGCATTTC TGCACATGGA ACGGATGCGT TACGCTTTAC CCTTGCGGCG
TTAGCGAGCA ATGGACGTGA TATCAATTGG GATATGAAAC GTTTGGAGGG CTACCGCAAT
TTCTGTAATA AATTATGGAA TGCTAGTCGT TATGTATTAA CCCACGAAAA ATTGGATCTA
AGTGAGGGCG AAGCGGAATA TTCGTTGGCG GATCGTTGGA TTGAGAGTCA ATTCAACCGC
ACTATAGACG CATTTCGCAC CGCACTTAAG CAATACCGTT TCGATTTAGT GGCAAATACG
ATTTACGATT TTACTTGGAA TCAGTTCTGC GATTGGTATT TAGAGCTAAC CAAGCCTGTA
TTTGCTCATG GTACGGATGC TCAAAAACGT GGTACAAGTC GCATGCTTCT CAATATATTA
GAGAAATTAT TGCGTTTAGC ACATCCAATT ATTCCATTTA TCACGGAAGA AATTTGGCAG
AAATTGAAGG GTGTAATGAA GTTATCAGGC GATACCATTA TGTTACAACC ATTTCCACGC
ATAGAGGAAA ACCGGCTTGA TCTAGAGGCG GAAAGCCAAA TGAATTGGTT AAAAGAAGTG
ATTGTAGCAG TGCGTAATAT TCGTGCGGAA TGTAATATTT CACCGAGTCA TGCGTTGGAA
CTGTTGCTGA GAAACATTTC AGAGGAAACA AAAATTTGTC TTGAAAATAA CCGCACTTTA
TTACAATCTA TGGCAAAATT GTCCACGATT ACTTTACTGG ACGCCAATGA AGAACCGCCA
CTTTCGGTCA CGAAACTGGT TGAAAATAAT GAAATCTTTA TTCCAATGGC AGGTTTTATT
AATAAAGAGC AAGAACTCAC AAGATTGACT AAAGAAATTG AAAAATTGAA AGGTGAAATT
GTTCGAATTG AAAACAAACT CAGCAATGAA GCCTTTATTA CCAAAGCACC GGAGCAAGTT
ATCGCCAAAG AGCGGGAAAA AATGCAGGGG TATTTGGATA GTATAGAAAA ACTTCAACAG
CAATATCAGG CGATTGAGAT GTTGTAA
 
Protein sequence
MEDRFNPSAV EQALYQHWES QGYFKPSEDV NAPSYCIAIP PPNVTGSLHM GHAFQQILMD 
TLIRFNRMEG NNTLWQTGTD HAGIATQMVV ERKIAAEEGK TRHDYGREAF INKIWDWKAY
SGGTISQQMR RLGNSIDWDR ERFTMDEGLS NAVKEVFVRL HEEGLIYRGK RLVNWDPKLH
TAISDLEVEN KESKGSLWHF RYPLANGAKT AEGLDYLVVA TTRPETMLGD TAVAVHPEDE
RYQSLIGKTV VLPLANREIP IIADEYVDRE FGTGVVKITP AHDFNDYEVG KRHALPMVNV
MTLNADIRAE AEIIGTDGKT LENYTALIPQ DYQGLERFVA RKKIVADFEA LGLLDQIKPH
DLKVPYGDRG GVPIEPMLTD QWYVSVKPLA EVATKAVENG EIQFVPKQYE NLYFSWMRDI
QDWCISRQLW WGHRIPAWYD EQGNVYVARD EAEVRSKYGL TSDVALKQDE DVLDTWFSSG
LWTFSTLGWP EQTKELKIFH PTDVLITGFD IIFFWVARMI MFTMHFIKDE NGKPQVPFKT
VYVTGLIRDE QGQKMSKSKG NVLDPIDMID GISLEDLLEK RTGNMMQPQL AEKIAKATRK
EFEHGISAHG TDALRFTLAA LASNGRDINW DMKRLEGYRN FCNKLWNASR YVLTHEKLDL
SEGEAEYSLA DRWIESQFNR TIDAFRTALK QYRFDLVANT IYDFTWNQFC DWYLELTKPV
FAHGTDAQKR GTSRMLLNIL EKLLRLAHPI IPFITEEIWQ KLKGVMKLSG DTIMLQPFPR
IEENRLDLEA ESQMNWLKEV IVAVRNIRAE CNISPSHALE LLLRNISEET KICLENNRTL
LQSMAKLSTI TLLDANEEPP LSVTKLVENN EIFIPMAGFI NKEQELTRLT KEIEKLKGEI
VRIENKLSNE AFITKAPEQV IAKEREKMQG YLDSIEKLQQ QYQAIEML