Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_1764 |
Symbol | |
ID | 4241298 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 1985841 |
End bp | 1988657 |
Gene Length | 2817 bp |
Protein Length | 938 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 638105357 |
Product | hypothetical protein |
Protein accession | YP_719969 |
Protein GI | 113461900 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3451] Type IV secretory pathway, VirB4 components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGTAT TTTCCGATGT ATTGGCTTCC TTTTTTCCTG ATAAAAAGGG GGCTTCTCGT ACAGAAAAAG ACGAGGACGT AGAAAAAAAA CTCGACAAAG AAGCCTATTC AGCTGAGCTT TCAGTGCATC ACACCGTGAA ACGTAGTGGA AAATTAACCC AATCGGCACA TCATCAAAGA TTATACAGTA AACACCCCTC TTTTGTGGAT TATCTGCCGT GGGGAGAATT TTTAGAGCAA GAAGGTGTAA TGTTATTAGA TGATGTACGC AGTGTCGGTG TAGTGTTTGA TGTCAAACCT ATCGGTACAG AAGGACGTGG CGAGGACTAT TTAAGTCGTG TACGCAGCCT TGTTAAAGAC GCACTTCAAG ACAGTTTTGA TGAATTAGAT AATTATCCCT ACGTTGTACA ATTTTATTGT CAAGATGAAC AAGATTTACG TCCTTATGTG GAAAGACTGA AAGCCTATAT CGCTCCTGAT ATTGCACAAA GTGCGTTCAC GCAAGAATGG TTGAAAAACA CGGCAAAACA TTTAAACGAT ATCAGCAAAC CTGGGGGCTT ATTTGTTGAT GACCGTATTA CCAACACTAT TTGGTCCGGA CGTATTCGCC GTACACGAAT GGTGATTTAT CGCTATGTAG GCAAAAATGA AGAGCAAAGT GCGGTAGAAA GTTTAAATAA TGCCTGCGAG CGTGTGGTTG GGGCATTAAC GACCGCAGGT CTTCATTTAA CGCGTCAAAC TGGCGAGCAA ATTCATCAAT GGCTTTTACG TTGGTTTAAC CCTAATCCGG CGATGTCTTT TGATCTCGAT AATGGCAGTT GCTATGACAA ACTGCATCAA CCAATAAGCG ATGATCTACC GTTTATTGGG CAAGATTTTT CTGAAAATCT CTTTTATAAC CAGCCTGAAA GTAAAAAGGA TTATTGGTAT TTTGATGGGT TGCCCCATGA AGTAGTGGTT GTCGATAACT TAAGAAATAC CCCCGCTATT GGGCATTTTA CCGGTGAAGT ACGACGTGGC AAGAATATTA ATGCCTTATT TGATTTATTG CCTGAATCGA CCATTATGGT GACAACCATT GTGGTATATC CTCAAGACAA ATTAGAGGCA CATCTTGAAA AAATCAGCTC TCGTGCGGTA GGTGAAAATT ATGAATCTAT TTATCTGAAA CAAGATGTGA AAACCGTGAT GGGATACCTT AAAGATGGCC ACAAACTCTA TCAAGCCGGT TTGGCGTTTT ATATTCACGG TAAAGATGAA AAAGAACTCA AACGACGGTC TCGTGAACTT AGAACCATTT TACTGAGTAA TAATCTTGTT CCCGTCAAAG AAAATGCCGA AATTGCCCCG TTGAACAGTT ATCTACGTTG GTTACCGATG AATTTCAATC CGCAGTTGGA TATGAAAACA CGTTACTATA CCAAGTATTA TTTTGTGCAA CACCTTGCCA ATATGTTGCC TGTCTTTGGG CGTGAGACAG GGACGGGACA CCCAGGCATT ACCTATTTTA ATCGTGGTGG TTCACCCTTG GATTTTGATC CCATTAACCC AAAAGACAGA GCCAAAAATG CCCATAAATT GCTGTTAGGA CCAACAGGTT CGGGAAAATC AGCCACACTG AATGCACAAA TGGCACAACT GATGGCTGTA CATCGGCCAC GTTTATTCGT GGTTGAGGCA GGCAACTCCT TTGGGTTATT TGCCGATTAC GCTCAACGGT ATGGTTTAAC GGTAAACCGT ATTTCCCTTG AGCCGAGCAG TGATATTGCC TTACCGCTTT TTTCCGAAGC CCATAAACTT CTCGAAATGA ACGTAGATGT TGAGGCGGTC ATTGATGACG AAGAGGACGA CGAAGAAGGC AGTGAGCAAC GTGATTTATT GGCTGAAATG GAATTGACCA CACGCTTGAT GATCACAGGG GGCGAGCTAA AAGAAGATGA TAAAATGAGC CGTGCCGACC GTGCCCAAAT CCGTCGTGCC ATTATGATGG CAGCAGAAAA AACCTATGCT GAGGGACGAC AAACCTTAGC TGGTGATGTT CGAAACGCCT TAGAAACCTT AAGCCAACAC CTCGATACAC GAGAAGAACG ACGTGCAAGG TTGGCTGAAA TGGCGGAAGC GATGGATATG TTTTGTACCG GAATCAATGG GAAGTTTTTT AACCGTGAAG GGGAAATTTG GCCGGAGGCA GATATTACGC TTGTAGATTT AGCCATGTTT GCTCGTGAAG GCTATGAAGC AGAACTTTCC ATTGCCTATA TTTCCTTAAT CAACCATATC AACAGTCTTG GAGAAAAATA CCAACACAGT TTTCGCCCTA TTGTCAATAT TACTGATGAG TCTCACATTA TTACTGTCAA TCCTCTACTT GCCAAATTCT TGGTGAAAGG GTCAAAAATG TGGCGAAAAC TTGGAATTTG GTTGTGGCTT GCTACGCAAA ATATGGAGGA TTTTCCGAAA GAGGCGGCGA AGTTGCTCAG TATGATTGAA TGGTGGGAAT TACTTAATCT GACCAATGAT GAAATCGAAG AAGTGAACCG TTTTCGTCGA TTGTCTGAAG CACAAAAATT GATGCTACTT TCTGCGAAAA AAGCTGATAA GAAATATACC GAAGGTGTGG TGTTGGCGAC CAATATGGAA GCCTTGTTCC GTGTTGTGCC ACCGAGTATT TACTTGGCAC TGGGTATGAC AGAAAAACAC GAAAAAGCAC AGCGTAGAGC TATTATGGCT GAGCATCAGT GCAGTGAACT TGATGCTGCC TTGATAATGG CAAAACAGAT TGATAAAGCG AGAGGAATTA TTTATGAGCC AGCCTAA
|
Protein sequence | MGVFSDVLAS FFPDKKGASR TEKDEDVEKK LDKEAYSAEL SVHHTVKRSG KLTQSAHHQR LYSKHPSFVD YLPWGEFLEQ EGVMLLDDVR SVGVVFDVKP IGTEGRGEDY LSRVRSLVKD ALQDSFDELD NYPYVVQFYC QDEQDLRPYV ERLKAYIAPD IAQSAFTQEW LKNTAKHLND ISKPGGLFVD DRITNTIWSG RIRRTRMVIY RYVGKNEEQS AVESLNNACE RVVGALTTAG LHLTRQTGEQ IHQWLLRWFN PNPAMSFDLD NGSCYDKLHQ PISDDLPFIG QDFSENLFYN QPESKKDYWY FDGLPHEVVV VDNLRNTPAI GHFTGEVRRG KNINALFDLL PESTIMVTTI VVYPQDKLEA HLEKISSRAV GENYESIYLK QDVKTVMGYL KDGHKLYQAG LAFYIHGKDE KELKRRSREL RTILLSNNLV PVKENAEIAP LNSYLRWLPM NFNPQLDMKT RYYTKYYFVQ HLANMLPVFG RETGTGHPGI TYFNRGGSPL DFDPINPKDR AKNAHKLLLG PTGSGKSATL NAQMAQLMAV HRPRLFVVEA GNSFGLFADY AQRYGLTVNR ISLEPSSDIA LPLFSEAHKL LEMNVDVEAV IDDEEDDEEG SEQRDLLAEM ELTTRLMITG GELKEDDKMS RADRAQIRRA IMMAAEKTYA EGRQTLAGDV RNALETLSQH LDTREERRAR LAEMAEAMDM FCTGINGKFF NREGEIWPEA DITLVDLAMF AREGYEAELS IAYISLINHI NSLGEKYQHS FRPIVNITDE SHIITVNPLL AKFLVKGSKM WRKLGIWLWL ATQNMEDFPK EAAKLLSMIE WWELLNLTND EIEEVNRFRR LSEAQKLMLL SAKKADKKYT EGVVLATNME ALFRVVPPSI YLALGMTEKH EKAQRRAIMA EHQCSELDAA LIMAKQIDKA RGIIYEPA
|
| |