Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0121 |
Symbol | |
ID | 4239629 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 107556 |
End bp | 109847 |
Gene Length | 2292 bp |
Protein Length | 763 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638103650 |
Product | hypothetical protein |
Protein accession | YP_718325 |
Protein GI | 113460267 |
COG category | [R] General function prediction only |
COG ID | [COG4258] Predicted exporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.142697 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCAAAT TATTGACCGT TTACCGCTTA ATTTTTGCAG GGGTTTTATG TCTTGTGGTG GCGATTTTTA TGTATCATTT ACAAACAGGT AAATGGTTAC AAACGGATCT ACATACACTT TTGCCGGACA GTCAACACTA TACAAAAATT CAATTAGAGG CAGATAAACA TCAAGAACAG CAATTTAATC AGCAAGTTAT TGCATTGGTT GGACATTCAC AATCAGAAGC TGCTTTTAAA TTGGCGGAAA AAGTTGCAGA ACAATGGCAA AAAAGTGGGT TATTTCAAAC GTTATCTGTA AAAAATCAAC CGAATTTAGC TGAACTTCAA CAGCAGATTG AGTTATTAAA ATTAGCCACA CTGCCTATTT CAACACGAAA TCAGATTATT CAACAGCCCG AACGTTATTT TCAGCAATAT GCAGAGCAAA TCATTAACCC GTTCGGTTAT CAAAATTTAC TGCCATTGGA ACAAGACTGG CTTGGTTTTG GGCGTTTTGT ATTGTCGCAA TCTCAACAGC AAAGCCAAAT TCAGTGGCAT GCAGAAACAG GTATGCTCTA TGCTGTTCAA CAAGGTAAGA CATGGGTTTT ATTAACAGGC AAAATTGTTG ATTCAGATTT GATCAAACCT CAGCAAAATT TAACCGCACT TCTTAAGCAA AATGCACAAT TTATTCAAGA ACAACAAGGT CAATGGTTAA GTACAGGTGC GGTGATTTTT GCGGATTATT CACAACAACA AGCTAAATAT GAAAGCACGA TCATGGGCGG GTTAGGTATC AGCCTAACCT TGCTTTTACT GTTGCTGGTT TTCCGTAGCT TACGCATATT ATGGTTATTT TTACCGATTT CTGTAGGTAT GGTTGCAGGT ATTACTGCTA CCATTAGTTG CTTTGGGCAG ATTCATATTT TAACGCTCGT GATTGGCACG AGTTTAGTGG GGGTTCTGAT TGATTTCCCA TTGCATTGGC TTACATCTTC TCTATTTTTA AGCCGATGGC GTGCCAATAA AGCGATGGCA AAACTTCGCC TTACTTTTTT TGTCAGCTTA TTGGTGACTT TGCTGGGTTA TGCTTTACTG GGATTTACTG CTTTACCTAT CCTAAAACAG ACCGCACTTT TTTCCGGTAT GGCGTTGATT TTTGCTGTCT TAACGACCTT CTTGTATTTG CCCCTATTTT TCCGACATTA CCAGTCAGGT AAGTCGTTGT TTTTACGCCG AATTTTGCAA ATAAATTTCC ACGTTAAGAT TAATTCATTA TTAAATAAGA TTTTATTTGT GGTTAGCACG GGTTTCATTG TAGTAGGGTT GCAGAAAAGC TATTGGCAAG ATGATATTCG TCAATGGGTT GCTATGCCGA TGGAATTGAT TGAGCAGGCA CAAAAAATTC GTCAAATTAC CGGCATTGAT CTAAGCAATC AATACTTATT AATTACTGCT GAAAATAATG AGCAATTATT GCAAAAAGAC CGAATTCTAA CCGAGAAATT ACAACGGTTC GCACAAGAAA ATAATCTGAT AAAATTTCAG TCATTAAGCC AATGGATTAT GTCAAAAAAG CAACAAGCGG AGTTTATTCA ACAGCTAAAA AATATTCCTG CTGAAAGCTA TAGCGTTTTT GATGAAATTG GTATTCCAAA GGACATGATT CGTCATTCGC TAAAAAAATT GGAAAAACAG CCTCTTGTTA GCTTAGAACA GGCCTTAAAT ACAGAATTGG GAAAAGTTTG GAAAAATCTA TATTTAGGTG AGCTTGATCG AGGAAAAGTA GCAAGTATTA TTAAAGTATC AGGATTGAAT AACCCAAAAA TTCTTGAGCA AATTGTTAAT AATCGGGATA TTTACTGGCA GGATAAACCT GCCCATCTCA ATCAGTTGTT TGAGCAAACC CGCAACCAAG CGGCTTGGTT AAAATTGCTC TCATTTGGCT TGGCAGCTTT ATTGTTGTGG CGAATGTTTG GCATATCGCA AACGTTGAAA ATGCTCAGTA TCCCGCTTAT TTCAGTTGTG TGTACAGTCG CAATTTTAGG TTGGTTAAAT ATCACCATCA GTTTGTTTGC TATGTTTGGG TTATTATTGG TGTCGGTCAT TGGTATTGAT TACATTGCCT ATATGCAAAC AGCGAAAGAG CCGTTATCAA TAAAACGTTT TACGATTAGC CTTGCAGCAC TTACCACGCT TATTTCATTT GCTTTATTAG GATTAAGCTC CACTCCGGCG GTAGCAAGTT TTGGCTTGAG TGTCAGTTTG GGGGGGCTGA TTAGCTTAGG AATGATTTTG CGAATAAAGT GA
|
Protein sequence | MRKLLTVYRL IFAGVLCLVV AIFMYHLQTG KWLQTDLHTL LPDSQHYTKI QLEADKHQEQ QFNQQVIALV GHSQSEAAFK LAEKVAEQWQ KSGLFQTLSV KNQPNLAELQ QQIELLKLAT LPISTRNQII QQPERYFQQY AEQIINPFGY QNLLPLEQDW LGFGRFVLSQ SQQQSQIQWH AETGMLYAVQ QGKTWVLLTG KIVDSDLIKP QQNLTALLKQ NAQFIQEQQG QWLSTGAVIF ADYSQQQAKY ESTIMGGLGI SLTLLLLLLV FRSLRILWLF LPISVGMVAG ITATISCFGQ IHILTLVIGT SLVGVLIDFP LHWLTSSLFL SRWRANKAMA KLRLTFFVSL LVTLLGYALL GFTALPILKQ TALFSGMALI FAVLTTFLYL PLFFRHYQSG KSLFLRRILQ INFHVKINSL LNKILFVVST GFIVVGLQKS YWQDDIRQWV AMPMELIEQA QKIRQITGID LSNQYLLITA ENNEQLLQKD RILTEKLQRF AQENNLIKFQ SLSQWIMSKK QQAEFIQQLK NIPAESYSVF DEIGIPKDMI RHSLKKLEKQ PLVSLEQALN TELGKVWKNL YLGELDRGKV ASIIKVSGLN NPKILEQIVN NRDIYWQDKP AHLNQLFEQT RNQAAWLKLL SFGLAALLLW RMFGISQTLK MLSIPLISVV CTVAILGWLN ITISLFAMFG LLLVSVIGID YIAYMQTAKE PLSIKRFTIS LAALTTLISF ALLGLSSTPA VASFGLSVSL GGLISLGMIL RIK
|
| |