Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_1044 |
Symbol | |
ID | 4240542 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 1149951 |
End bp | 1153046 |
Gene Length | 3096 bp |
Protein Length | 1031 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 638104605 |
Product | hypothetical protein |
Protein accession | YP_719256 |
Protein GI | 113461187 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase [COG0277] FAD/FMN-containing dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCCCTA GATTAAAAAA CATTCCTCAA CTTGATCCTT TAGTTTCTTC CTATTTGGAA AAATTAAAAA CGCAACATTT TGAAGGTGAT ATAGCGACAA ATTATGCAGA ACGTTTAAGT CTAGCAACCG ATAACAGTGT TTATCAACAA CTTCCACAAG CTATTTTATT TCCTAAAACA ACAAATGATG TGGTATGTCT TACAAAGCTA GCTCAACAAA AAAATTTCCA ATCTCTTACC TTTACGCCAA GAGGTGGCGG TACAGGAACC AATGGTCAAG CAATTAACCA TAATATCATC GTTGATCTTT CTCGTTATAT GACAAACATC TTGGAACTCA ATGTGGAGCA ACGGTGGGTT CGTGTACAAG CCGGTGTTGT TAAAGATCAG CTCAATCAAT TTTTAAAACC CTACGGGTTG TTTTTTTCAC CGGAACTTTC TACCAGTAAT CGAGCTACTA TTGGCGGAAT GATTAATACT GATGCTTCCG GACAAGGCTC TCTAAAATAT GGTAAAACTT CAGATCATGT ACTAGCACTA AAATCTGTCT TAATGAATGG AGAAATTTTA GAAACAAGTG CGGTCAAATC TGATGAATTT TTACAAAATA TTCAGCATTT ATCATCAACA GGACAAAAAC TTCATCAAGA AATCTTTCAA CGCTGTCAAC AAAAACGCTC ACAAATTCTC ACTGATTTGC CTCAATTAAA CCGTTTTTTA ACCGGCTATG ATTTAAAAAA TGTATTTACC GAAGATCAAA GCGAATTTAA TCTTAGTCGT ATTTTAACCG GATCTGAAGG TTCATTAGCT TTTATTTGCG AAGCGGTTTT AGACTTAACA CCTATTCCTC AATACCGCAC TTTAATCAAC ATCAAATATA GCTCTTTCGA TGCAGCATTA CGCAATGCAC CTTTCATGCT GGCGGTTGAA GCATTATCCG TTGAAACCAT TGATAGTAAA GTACTGAATC TAGCCAAACA AGATATTATT TGGCATTCTG TACATGAATT ACTTACAGAA GAAAAAGAAA ACCCAATTTT AGGTTTAAAT ATCGTTGAAT ATGCAGGAAA CGACCTCACC TTAATTCAAA AACAAGTATC TCACTTGTGT CAACTGTTAG ACGATAAAAT AGCTCAACAA CAAGACAATA TTATCGGTTA TCAAGTTTGC TCAGATCTTC CTTCCATCGA ACGAATTTAT GCTATGCGAA AAAAAGCGGT GGGCTTATTA GGTAATAGCA AAGGCAATAA AAAACCTATT CCTTTTGTGG AAGACAGTTG CGTACCGCCT GAAAATCTAG CCAACTATAT TAGTGAGTTT CGTCAATTAT TAGATAAACA TCACCTAGAT TACGGTATGT TTGGACATGT CGATGCAGGT GTTCTACATG TACGCCCCGC TTTAGATTTA TGTAATAAAG AACAAGTTTT GCTATTCAAA ACTATTTCTG ATCAAGTAGC AGATTTAACA AAAAAATATG GTGGCTTAAT TTGGGGAGAA CATGGAAAAG GAATGCGTTC ACAATACGGT GAAAAATTCT TTACTCCGGA ATTATGGCAA GAATTACGCT ACATTAAATT TTTATTCGAT CCGAATAATC GTTTAAATCC GGGTAAAATT TGTACCGCAC TTTATTCTGA ACAAGAACTC TACTCCATTT TGTCGCCAAT GCGGGCAGAT CAAGATCGTC AAATTCCCAT TCAAATGCGA GAGGAATTTT CCGGTGCGAT GAATTGTAAT GGTAACGGAT TATGCTTTAA CTTTGATGTT CACAGTGCCA TGTGTCCGTC TATGAAAGTC AGCAAAAACC GGCTATTTTC GCCCAAAGGG CGTGCAGCAA TAATTCGTGA ATGGTTGCGT TTGATGGCAA ATGAAAATAT CTCGCCTGAA CAGCTGAATT TTCGTAAAGT CGAAGTGAAA TTAACAGATC TTGTCAAAAA AATTCGCAAT ACGGTTGCCC AAAAACAAGG GGAATACGAT TTTTCTCACG AAGTTAAAGA GGCTATGAAT ACTTGTTTAG CCTGTAAGGC TTGTGCTACG CAATGTCCAA TTAAAATTGA TGTGCCCAGC TTTCGTGCCA AGTTTTTCTA TTTTTACCAT AACCGCTATT TACGCCCATT GAAAGATTAT GTGGTTGCAA ATGTAGAAAT GATGGCACCT TTAATGGCAA AAGCACCGAA ATTTTTCAAC TTTTTTACAA CGGCTAAACT CACTCAATCT TTGGCTGAAA ATCTGCTAGG GATGACTGAT TTACCCTCAT TGTCTGTTCC CTCTTTGCAA CAACAGTTAG TTGAAATAAA TTATCAAGGT TATTCATTGG AGCAACTGGA AAATCTAAGT GCGGTAGAAA AACAAAATAT TTTATTGATT GTTCAAGATC CGTTTACCTC TTTTTACGAT GCAAAAGTCG TCGCTGATTT TGTAGCACTC TGTCAAAAAT TAGGTTACAA AGCAATTGTT CTGCCTTTTA AACCTAATGG TAAAGCAATG CACATAAAGG GATTCCTAGC ACGTTTTGCC AAAACTGCAA AAAATCAAGC GGACTTCCTC AATAAGATAA GTAAACTCGG TTTATCTCTG GTCGGTGTTG ATCCTGCCAT TGTTCTTTCT TATCGTGATG AATATAAAGA AATTCTCGGT GATGAAAGAG GAGATTTTAA TGTTATCACT GCTCATGAGT GGTTGAAACA AGAATTAAGT TCAGGGAAAC TTGAGCATAA ACTTACACAA ATTATGCAAA AAAATAACCG CACTTTTAAT AAAGAAAGTC AACAAAAATG GTATTTATTT CCACATTGTA CGGAAAGTAC TACACTACCG AACAGTGCAA AAGAATGGCA GCAAATTTTT TCAGCTTTTG GACAAGAATT ACAAACAAAA AATGTTGGTT GTTGCGGTAT GGCGGGAACG TTTGGGCATG AAATTCAACA CCTAGAGATG TCAAAAGAAA TTTACCATTT ATCCTGGGCT AAAAAATTAC AAGGAAAAAA TCCTGATTAT TGTTTAGCTA CGGGGTATTC TTGTCGCAGT CAAGTTAAAC GTATGCTCCA TTGGCAACCT AAGCATCCTA TTCAAGCCCT ATTATCAATT ATTTAA
|
Protein sequence | MLPRLKNIPQ LDPLVSSYLE KLKTQHFEGD IATNYAERLS LATDNSVYQQ LPQAILFPKT TNDVVCLTKL AQQKNFQSLT FTPRGGGTGT NGQAINHNII VDLSRYMTNI LELNVEQRWV RVQAGVVKDQ LNQFLKPYGL FFSPELSTSN RATIGGMINT DASGQGSLKY GKTSDHVLAL KSVLMNGEIL ETSAVKSDEF LQNIQHLSST GQKLHQEIFQ RCQQKRSQIL TDLPQLNRFL TGYDLKNVFT EDQSEFNLSR ILTGSEGSLA FICEAVLDLT PIPQYRTLIN IKYSSFDAAL RNAPFMLAVE ALSVETIDSK VLNLAKQDII WHSVHELLTE EKENPILGLN IVEYAGNDLT LIQKQVSHLC QLLDDKIAQQ QDNIIGYQVC SDLPSIERIY AMRKKAVGLL GNSKGNKKPI PFVEDSCVPP ENLANYISEF RQLLDKHHLD YGMFGHVDAG VLHVRPALDL CNKEQVLLFK TISDQVADLT KKYGGLIWGE HGKGMRSQYG EKFFTPELWQ ELRYIKFLFD PNNRLNPGKI CTALYSEQEL YSILSPMRAD QDRQIPIQMR EEFSGAMNCN GNGLCFNFDV HSAMCPSMKV SKNRLFSPKG RAAIIREWLR LMANENISPE QLNFRKVEVK LTDLVKKIRN TVAQKQGEYD FSHEVKEAMN TCLACKACAT QCPIKIDVPS FRAKFFYFYH NRYLRPLKDY VVANVEMMAP LMAKAPKFFN FFTTAKLTQS LAENLLGMTD LPSLSVPSLQ QQLVEINYQG YSLEQLENLS AVEKQNILLI VQDPFTSFYD AKVVADFVAL CQKLGYKAIV LPFKPNGKAM HIKGFLARFA KTAKNQADFL NKISKLGLSL VGVDPAIVLS YRDEYKEILG DERGDFNVIT AHEWLKQELS SGKLEHKLTQ IMQKNNRTFN KESQQKWYLF PHCTESTTLP NSAKEWQQIF SAFGQELQTK NVGCCGMAGT FGHEIQHLEM SKEIYHLSWA KKLQGKNPDY CLATGYSCRS QVKRMLHWQP KHPIQALLSI I
|
| |