Gene HS_0121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0121 
Symbol 
ID4239629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp107556 
End bp109847 
Gene Length2292 bp 
Protein Length763 aa 
Translation table11 
GC content38% 
IMG OID638103650 
Producthypothetical protein 
Protein accessionYP_718325 
Protein GI113460267 
COG category[R] General function prediction only 
COG ID[COG4258] Predicted exporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.142697 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAAAT TATTGACCGT TTACCGCTTA ATTTTTGCAG GGGTTTTATG TCTTGTGGTG 
GCGATTTTTA TGTATCATTT ACAAACAGGT AAATGGTTAC AAACGGATCT ACATACACTT
TTGCCGGACA GTCAACACTA TACAAAAATT CAATTAGAGG CAGATAAACA TCAAGAACAG
CAATTTAATC AGCAAGTTAT TGCATTGGTT GGACATTCAC AATCAGAAGC TGCTTTTAAA
TTGGCGGAAA AAGTTGCAGA ACAATGGCAA AAAAGTGGGT TATTTCAAAC GTTATCTGTA
AAAAATCAAC CGAATTTAGC TGAACTTCAA CAGCAGATTG AGTTATTAAA ATTAGCCACA
CTGCCTATTT CAACACGAAA TCAGATTATT CAACAGCCCG AACGTTATTT TCAGCAATAT
GCAGAGCAAA TCATTAACCC GTTCGGTTAT CAAAATTTAC TGCCATTGGA ACAAGACTGG
CTTGGTTTTG GGCGTTTTGT ATTGTCGCAA TCTCAACAGC AAAGCCAAAT TCAGTGGCAT
GCAGAAACAG GTATGCTCTA TGCTGTTCAA CAAGGTAAGA CATGGGTTTT ATTAACAGGC
AAAATTGTTG ATTCAGATTT GATCAAACCT CAGCAAAATT TAACCGCACT TCTTAAGCAA
AATGCACAAT TTATTCAAGA ACAACAAGGT CAATGGTTAA GTACAGGTGC GGTGATTTTT
GCGGATTATT CACAACAACA AGCTAAATAT GAAAGCACGA TCATGGGCGG GTTAGGTATC
AGCCTAACCT TGCTTTTACT GTTGCTGGTT TTCCGTAGCT TACGCATATT ATGGTTATTT
TTACCGATTT CTGTAGGTAT GGTTGCAGGT ATTACTGCTA CCATTAGTTG CTTTGGGCAG
ATTCATATTT TAACGCTCGT GATTGGCACG AGTTTAGTGG GGGTTCTGAT TGATTTCCCA
TTGCATTGGC TTACATCTTC TCTATTTTTA AGCCGATGGC GTGCCAATAA AGCGATGGCA
AAACTTCGCC TTACTTTTTT TGTCAGCTTA TTGGTGACTT TGCTGGGTTA TGCTTTACTG
GGATTTACTG CTTTACCTAT CCTAAAACAG ACCGCACTTT TTTCCGGTAT GGCGTTGATT
TTTGCTGTCT TAACGACCTT CTTGTATTTG CCCCTATTTT TCCGACATTA CCAGTCAGGT
AAGTCGTTGT TTTTACGCCG AATTTTGCAA ATAAATTTCC ACGTTAAGAT TAATTCATTA
TTAAATAAGA TTTTATTTGT GGTTAGCACG GGTTTCATTG TAGTAGGGTT GCAGAAAAGC
TATTGGCAAG ATGATATTCG TCAATGGGTT GCTATGCCGA TGGAATTGAT TGAGCAGGCA
CAAAAAATTC GTCAAATTAC CGGCATTGAT CTAAGCAATC AATACTTATT AATTACTGCT
GAAAATAATG AGCAATTATT GCAAAAAGAC CGAATTCTAA CCGAGAAATT ACAACGGTTC
GCACAAGAAA ATAATCTGAT AAAATTTCAG TCATTAAGCC AATGGATTAT GTCAAAAAAG
CAACAAGCGG AGTTTATTCA ACAGCTAAAA AATATTCCTG CTGAAAGCTA TAGCGTTTTT
GATGAAATTG GTATTCCAAA GGACATGATT CGTCATTCGC TAAAAAAATT GGAAAAACAG
CCTCTTGTTA GCTTAGAACA GGCCTTAAAT ACAGAATTGG GAAAAGTTTG GAAAAATCTA
TATTTAGGTG AGCTTGATCG AGGAAAAGTA GCAAGTATTA TTAAAGTATC AGGATTGAAT
AACCCAAAAA TTCTTGAGCA AATTGTTAAT AATCGGGATA TTTACTGGCA GGATAAACCT
GCCCATCTCA ATCAGTTGTT TGAGCAAACC CGCAACCAAG CGGCTTGGTT AAAATTGCTC
TCATTTGGCT TGGCAGCTTT ATTGTTGTGG CGAATGTTTG GCATATCGCA AACGTTGAAA
ATGCTCAGTA TCCCGCTTAT TTCAGTTGTG TGTACAGTCG CAATTTTAGG TTGGTTAAAT
ATCACCATCA GTTTGTTTGC TATGTTTGGG TTATTATTGG TGTCGGTCAT TGGTATTGAT
TACATTGCCT ATATGCAAAC AGCGAAAGAG CCGTTATCAA TAAAACGTTT TACGATTAGC
CTTGCAGCAC TTACCACGCT TATTTCATTT GCTTTATTAG GATTAAGCTC CACTCCGGCG
GTAGCAAGTT TTGGCTTGAG TGTCAGTTTG GGGGGGCTGA TTAGCTTAGG AATGATTTTG
CGAATAAAGT GA
 
Protein sequence
MRKLLTVYRL IFAGVLCLVV AIFMYHLQTG KWLQTDLHTL LPDSQHYTKI QLEADKHQEQ 
QFNQQVIALV GHSQSEAAFK LAEKVAEQWQ KSGLFQTLSV KNQPNLAELQ QQIELLKLAT
LPISTRNQII QQPERYFQQY AEQIINPFGY QNLLPLEQDW LGFGRFVLSQ SQQQSQIQWH
AETGMLYAVQ QGKTWVLLTG KIVDSDLIKP QQNLTALLKQ NAQFIQEQQG QWLSTGAVIF
ADYSQQQAKY ESTIMGGLGI SLTLLLLLLV FRSLRILWLF LPISVGMVAG ITATISCFGQ
IHILTLVIGT SLVGVLIDFP LHWLTSSLFL SRWRANKAMA KLRLTFFVSL LVTLLGYALL
GFTALPILKQ TALFSGMALI FAVLTTFLYL PLFFRHYQSG KSLFLRRILQ INFHVKINSL
LNKILFVVST GFIVVGLQKS YWQDDIRQWV AMPMELIEQA QKIRQITGID LSNQYLLITA
ENNEQLLQKD RILTEKLQRF AQENNLIKFQ SLSQWIMSKK QQAEFIQQLK NIPAESYSVF
DEIGIPKDMI RHSLKKLEKQ PLVSLEQALN TELGKVWKNL YLGELDRGKV ASIIKVSGLN
NPKILEQIVN NRDIYWQDKP AHLNQLFEQT RNQAAWLKLL SFGLAALLLW RMFGISQTLK
MLSIPLISVV CTVAILGWLN ITISLFAMFG LLLVSVIGID YIAYMQTAKE PLSIKRFTIS
LAALTTLISF ALLGLSSTPA VASFGLSVSL GGLISLGMIL RIK