Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0205 |
Symbol | imp |
ID | 4239720 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 191691 |
End bp | 194036 |
Gene Length | 2346 bp |
Protein Length | 781 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 638103741 |
Product | organic solvent tolerance protein |
Protein accession | YP_718412 |
Protein GI | 113460351 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1452] Organic solvent tolerance protein OstA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.420377 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA ATTATTACAC TGTACTTTCT CTGTCAATTT TGACCGCACT TTATAGCACG TCCAGTCAAG CGAACTTACA ACAGCAATGT TTGATCGGTG TTCCTCATTT TCAAGGTGAA ATCGTTCAAG GCGATCCCAA TGAATTGCCG GTTTATATTG AAGCGGATCA CGCTAAAATG AATCAATCGA CACATGCTCA ATACGAAGGA AATGTTAATG TTAAACAGGG CAACCGTCAT TTAACAGCGG GAATGATTGA AATTGAGCAA CACGGAAAAG ATAATGCGAA ACGTTATGCG TATGCTAAAA ATGGGTTTGA CTACAAAGAT AATTTAATTC AGCTCAATGG TGATAATGCC AAAATTCACC TTGATAGCAA AGATGCCAAT ATCCAAGATG CAGATTATCA ATTGGTTGGA CGACAAGGGC GGGGAACTGC TGATGAAGTT GAACTTCGTG AACATTATCG AGTGATGAAA AATGCAACCT TCACATCTTG TTTGCCTAAT AGCGACGCTT GGTCAATTGA GGCTAAGGAA ATGCGTCAAC ATATTCAAGA AGAATATGCG GAAATGTGGC ATGCTCGTTT TAAAGTATCC GGTATCCCTA TTTTCTACAC GCCTTACCTA CAATTACCTA TCGGTGATCG CCGTCGATCC GGATTACTTA TTCCCAAAGC AGGTATCTCA ACTCGGCATG GTTATTGGTA TGCACAACCG TTTTATTGGA ATATAGCACC AAACTTTGAT GCGACATTTA CCCCTAAATA TATGTCTCAT CGAGGTTGGC AATTAAATGC AGAAACTCGC TATCTGACTC GTATCGGTGA GGGAAAATTT GTCGTTGAAT ACTTAAAAAC AGATCGTCAT TCTGACTCTT TAAATACGGC TCGTTCACGT CATCTCTTTT ATTGGGGACA TAATTCTCAT TTTCTAAAAG ATTGGCGTTT AAATGTAAAT TATACAAAAG TAAGCGATAA ACATTATTTC AATGATTTTG AGTCTGAATA TGGAAACAGT ACAGACGGAT ATGTAGATCA ACAGGCGAGC ATTTCTTACT ATCAACCGAA TTACAACCTT TCTATTTCAG CGAAACAATT CCAAATTTTC GATAAAGTAG ATATTGGACC TTATCGTGCG TTGCCACAAA TTGATTTTAA TTATTATCGC AACGAAATTG CTAATGGCTT AGTTGACTTT AGTTTATTTT CACAAGTAGT TCGTTTTGAT AATGACAGTG CGTTAATGCC AACTGCTTGG CGTTTCCATA TAGAACCGAG TTTGACTTTT CCACTTTCCA ATCGTTACGG CAGTTTAAAT ATTGAAACTA AACTTTACGC AACACGCTAT CTACAAAAAC GAGGTAAAGG AGAAAATGCA GAAGAGATTA AAAAAACGGT TAATCGTGTT TTACCACAAA TCAAGCTGGA TTTTCAAACG GTCTTAGCAA ATAGACAAAG TTTCATTGAG GGTTATACCC AAACTCTTGA GCCAAAATTT CAATATTTGT ACCGCCCTTA TAAAGATCAG TCGGATATTG GTCTAAAACA ACAAAATAAT GATTACTTAG GTTTCGGTTA CGACTCAACT TTATTACAAC AGGATTATTT TTCTTTGTTT CGAGATCGCC GTTATAGCGG TTTAGATCGC ATAGTTTCAG CAAATCAAAT TACTCTTGGT GGAACGACCA GATTTTATGA TAAAAATGCA AATGAACGCT TTAACTTATC TATTGGACAA ATTTATTACC TTAAAGACTC TCGCACAGAT AATAATCCAC AAAATATGGC TCAAGGCAGA TCTTCTTCCT GGTCTTTAGA AAGTAACTGG CGTATCAATA GCAAATGGAA TTGGCGTGGA AGTTATCAAT ATGACACACA TTTAAACCAA ACATCTTTGG CAAATACCGT CTTAGAATAC AATTCGGAGA AAAATAACTT AATCCAACTC AGTTATCGAT ATGTTAACCA GTCTTATATC GATCAAAATT TAATTGGTAA AAATACTTAT GGACAAAGTA TAAAACAACT TGGTATGACA ACAGCTTGGG AGCTAACTGA TCATTGGACA CTGGTTGGTC GCTATTATCA AGATCTCGCA TTGAAAAAGC CGGTTGAACA ATATTTGGGA ATACAATATA ACTCTTGTTG CTGGTCTATA GGTGTTGGAG CAAGACGTTA TGTAACCAAT AGAGCAAATC AACGCAATGA TGAAGTGCTT TATGATAATA GCTTAAGCCT CACTTTTGAG TTACGTGGAT TATCTCCTTC GGATCATAAA AATAATATAG ATGAAATGCT GAAAAAAGGA AAACTGCCTT ATATTAAAGC CTTTAGCCTA TACTAA
|
Protein sequence | MKKNYYTVLS LSILTALYST SSQANLQQQC LIGVPHFQGE IVQGDPNELP VYIEADHAKM NQSTHAQYEG NVNVKQGNRH LTAGMIEIEQ HGKDNAKRYA YAKNGFDYKD NLIQLNGDNA KIHLDSKDAN IQDADYQLVG RQGRGTADEV ELREHYRVMK NATFTSCLPN SDAWSIEAKE MRQHIQEEYA EMWHARFKVS GIPIFYTPYL QLPIGDRRRS GLLIPKAGIS TRHGYWYAQP FYWNIAPNFD ATFTPKYMSH RGWQLNAETR YLTRIGEGKF VVEYLKTDRH SDSLNTARSR HLFYWGHNSH FLKDWRLNVN YTKVSDKHYF NDFESEYGNS TDGYVDQQAS ISYYQPNYNL SISAKQFQIF DKVDIGPYRA LPQIDFNYYR NEIANGLVDF SLFSQVVRFD NDSALMPTAW RFHIEPSLTF PLSNRYGSLN IETKLYATRY LQKRGKGENA EEIKKTVNRV LPQIKLDFQT VLANRQSFIE GYTQTLEPKF QYLYRPYKDQ SDIGLKQQNN DYLGFGYDST LLQQDYFSLF RDRRYSGLDR IVSANQITLG GTTRFYDKNA NERFNLSIGQ IYYLKDSRTD NNPQNMAQGR SSSWSLESNW RINSKWNWRG SYQYDTHLNQ TSLANTVLEY NSEKNNLIQL SYRYVNQSYI DQNLIGKNTY GQSIKQLGMT TAWELTDHWT LVGRYYQDLA LKKPVEQYLG IQYNSCCWSI GVGARRYVTN RANQRNDEVL YDNSLSLTFE LRGLSPSDHK NNIDEMLKKG KLPYIKAFSL Y
|
| |