Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_1774 |
Symbol | |
ID | 4241308 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | - |
Start bp | 1993321 |
End bp | 1995321 |
Gene Length | 2001 bp |
Protein Length | 666 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 638105367 |
Product | hypothetical protein |
Protein accession | YP_719979 |
Protein GI | 113461910 |
COG category | [R] General function prediction only |
COG ID | [COG3772] Phage-related lysozyme (muraminidase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0983055 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGAC TCAATAAAGT GAGTATTGCT ATACTTTTCG CACTCACTAC TTCCCCCATT TATGCAGAGA CTTCTCTGTT AGATAACCTT CAATTCAACA AATCAGACTC TGTGTTATCC GATCAAATTT ACTATCAAAT CGGTGGGGGC TCAGGCTATA TGACACCACC GACACGGGCA AAATCCCTCA AAGCCATTGA ATTTGGGATT GGTTGGAAAG CCAATTTAAT GTGCGGTAAT TTTGACATTA AAACCACCGT AAAAAATCAA TTAAATGGTA TCACGGAGGG ATTTAAAGAT TTAATGAGCA ATGTGATAGA ATCGGCAAAA GGTGCGGTAG CCAGTCTTCC TGCTATGGTT ATTCAACGTG CTAACCCGCA ACTTTATGAC ATTTTGACTA ATGGAGTGTA TCAGGGGCGG CTTGATTTTA ATCGTCTAAA AACCAGTTGT GAGGCTATGT CGAAACAACT CGCTGACTAT ACGCTAAATG GGCGTTGGGC AAAATCGGCG GATTTGGAAA ACTATAAAGA CATTGTGGCC ACTGAACCCG ATGCACAAAA AGCACAGAAA AAATCGGAAG ATAACAAAGG AAAAAATGGT AAGGAATGGA TTGGCGGTCA AAAACGTGGC GGTGAAAAGC AAGAACAAAT CAAACTGATT AAAGATGTTT CAGAAGCAGG TTACAACCTC TTACATAACC GCAACGCCTT GGACAAAAAA CCACTTCTAG GGCCATCTTG TACCGGGGCA ACCTGCCAAA TATTCGATTC GCCACAAAAA CTCTCCGACT TTTTAACCCG AACTATTGGA GAACAATCTA TTTCTACCTG TATTGAAGAC TGTGGTCCGA AAACCAGCTC AAAAGCGGGG GTCGGACTCG CTCCTGCGAT TGAAGAAGAA AATCTGCGAA CCGTAGAGAA ACTTGAACAA GTGCTGAATA TGGACACGCC AAGTAAAGAA ATACTGGCTG AACTGAGTTC GAATACCATT CCTGTTACAC GGGGACTGAT TGAAGCTTTA CGTGAAGATC CTGACGTGGA ACTGCTTAGC CAACGTTTGG GTGCCGAAAT CGCTACAGCT AAAGTCATGG AAAAAATGCT ATTGGCTCGC CGAGCAATAC TTGCCGGAAT GCGAGAACCC TATGTGGCTG CCGATAAAAA TGCACAGGAG GAGCTGGAAA AAGCTTTAAA TAAAATTGAC TTGGAAATCT CTCAAGTCAA ACTTGAAATG GATATGCAAA AAATGCTCAC CAGTAATACA GCATCTGTCG TGATTCAAAA TAAACTCAAC CGTGAAGCCA ATGTGGGGAA CCATGGAGAA AGCTATGACA ACGTGGATAA ACGAGTCAAT GACTTAGCTT ACGGCTCACC AAATGCAGAA GAGCATATGG AAGCGAGCGA TATTTCCTTA CCTAGCCGAA ACATCGCCCT AGATATTCCA ACAGTGAGTA ATACTGCTCC TTATAGACCG TCTTATAATA CAGGCACTCA AGTGGGAAGG TATTCGGGCA GCTATAATCC CATTGCACCA ATTAATGGTT CTTCACTCGA CCAAGCTACG GGATTATTAA GAAAATTTGA AGGATTTATT AGCAAAGCGG ATTGGGATGT AAACGCCCAT CGTGTAGGCT ACGGTTCAGA TACCATTACT AAAGCTGACG GCACGGTGAT CAAAGTACAA CCCGGAATGA CCGTAACCAG AGAAGATGCT GAGCGTGATT TAGCTCGAAG AACCCAACTC TACACGAACC AAATCAAACG AGAGATTTCA GAACAAACGT GGAATGGTTT ATCTGATCGT GCTCAAGCTG TATTAACCTC TTATATTTAT AACTACGGCA CGTTAAATAA GACCAAAAGT GTCATTAGTG CTGCACAAGC CTCAGCACAA TCAGGAGATA TGACCGCATT GGCCAATGCA ATTAGAAGAC GTCAAGTGGA CAATAAAGGC GTCAATGCAA GACGCCGAAA TCAAGAAGCC GATTACATTC TAGGCAAATA G
|
Protein sequence | MKRLNKVSIA ILFALTTSPI YAETSLLDNL QFNKSDSVLS DQIYYQIGGG SGYMTPPTRA KSLKAIEFGI GWKANLMCGN FDIKTTVKNQ LNGITEGFKD LMSNVIESAK GAVASLPAMV IQRANPQLYD ILTNGVYQGR LDFNRLKTSC EAMSKQLADY TLNGRWAKSA DLENYKDIVA TEPDAQKAQK KSEDNKGKNG KEWIGGQKRG GEKQEQIKLI KDVSEAGYNL LHNRNALDKK PLLGPSCTGA TCQIFDSPQK LSDFLTRTIG EQSISTCIED CGPKTSSKAG VGLAPAIEEE NLRTVEKLEQ VLNMDTPSKE ILAELSSNTI PVTRGLIEAL REDPDVELLS QRLGAEIATA KVMEKMLLAR RAILAGMREP YVAADKNAQE ELEKALNKID LEISQVKLEM DMQKMLTSNT ASVVIQNKLN REANVGNHGE SYDNVDKRVN DLAYGSPNAE EHMEASDISL PSRNIALDIP TVSNTAPYRP SYNTGTQVGR YSGSYNPIAP INGSSLDQAT GLLRKFEGFI SKADWDVNAH RVGYGSDTIT KADGTVIKVQ PGMTVTREDA ERDLARRTQL YTNQIKREIS EQTWNGLSDR AQAVLTSYIY NYGTLNKTKS VISAAQASAQ SGDMTALANA IRRRQVDNKG VNARRRNQEA DYILGK
|
| |