Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_1036 |
Symbol | |
ID | 4240534 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | - |
Start bp | 1142526 |
End bp | 1143902 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 638104597 |
Product | metalloprotease |
Protein accession | YP_719248 |
Protein GI | 113461179 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0739] Membrane proteins related to metalloendopeptidases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0193125 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGCAGC ACATTAAATT AGCTAGAGAT AGACGCAAAA AGAAATTCTA CATAAAAGTT GCCGTTTTTT TTGTAGCTAT TTTATTTATT CTAATGGGTA TAGGGTTGTT CTTGAGAAAC GATGTTTCTT CTCAGAAGCC TACATCTGTT GATATGCAAA TTGCAAATGA AAAAAATAAC AGTGAATTGT CTTATGATGA TTTAGACGGT TTAGATGATG AAACGGATCA AGTTAATTTG GATAATGAAA TATCATCATT GCCTGATAAT GCCAAAGATG CGTTAAATAG TTTATTAGAT GCAGCGGATC AGGCAATGCG GATTAAAGAT CAATTTTCTC ATACTGTTGT ACGTGGCGAT AAATTGAAAG ATGTATTGGA ACATTCAGGG TTGGATGAGG AAATAAGTCG TCAAATGATA GCGAATTATC CGGAATTAAA AAATCTGAAA GCTGGCCAAC AAATTTATTG GATATTGGAT AATGATGGAA ATTTGGAATA TTTAAATTGG CTTGTTTCCG AACGTGAAGA ACGTATTTAT GAGCGAGTTA ATGAATCTCA GTTTAAACGT CAAATTTTAG AAAAGAAAAG TGTTTGGACA GAAGAGGTTC TAAAAGGACA GATAGAAGGA TCATTTTATG CCAGTTTGAA AGCCTTGGGC TTGAGCAGTA AACAAATTGC TCAATTAACT ACCGCACTTC AGTGGCAGGT AAGTCTTAAT AAGTTAAAAA AAGGGGATAA GTTTGCCGTT TTAGTTTCGA GAGAGTATTT AGATAATAAA TTGACTGGGC AAGGCAAGGT CGAAGCCATA CATATTATGT CTGGTGGAAA AAGTTATTAC GCAATCCAAG CAAATAATGG ACGCTATTAT AGTCGTCAGG GAGAAACTTT AGGGAAAGGT TTCGCACGTT ATCCTTTATT ACGTCAAGCT AGGGTGTCTT CTCCTTTTAA TCTTGCTCGC CGTCATCCCG TTACAGGAAA AATACGACCG CACAAAGGGG TTGATTTTGC TGTACCTGTA GGCACGACTA TCATCGCTCC GGCAGATGGA GTAGTAGAGA AAGTCGCTTA TCAGGCTAAC GGTGCCGGAC GTTATATGAT GATTAGACAT GGCAAAGAAT ATCAGACAGT CTATATGCAT TTAAGTCGCT CATTAGTCAA ACCGGGGCAA TCGGTAAAAA GAGGACAACG TATAGCATTA TCGGGAAATA CCGGACGTTC AACCGGTGCT CATTTACATT ACGAATTTCA TATTAACGGT AGACCTGTTA ACCCGTTAAC AGTGAAATTA CCGGGAACAA GTAATCAAAT GGCAAGTCAT GAAAGAAAAG AATTCTTAGT TAAAGCAAAA AAAATGGAAA ACCTGCTTAA ATTTTAG
|
Protein sequence | MVQHIKLARD RRKKKFYIKV AVFFVAILFI LMGIGLFLRN DVSSQKPTSV DMQIANEKNN SELSYDDLDG LDDETDQVNL DNEISSLPDN AKDALNSLLD AADQAMRIKD QFSHTVVRGD KLKDVLEHSG LDEEISRQMI ANYPELKNLK AGQQIYWILD NDGNLEYLNW LVSEREERIY ERVNESQFKR QILEKKSVWT EEVLKGQIEG SFYASLKALG LSSKQIAQLT TALQWQVSLN KLKKGDKFAV LVSREYLDNK LTGQGKVEAI HIMSGGKSYY AIQANNGRYY SRQGETLGKG FARYPLLRQA RVSSPFNLAR RHPVTGKIRP HKGVDFAVPV GTTIIAPADG VVEKVAYQAN GAGRYMMIRH GKEYQTVYMH LSRSLVKPGQ SVKRGQRIAL SGNTGRSTGA HLHYEFHING RPVNPLTVKL PGTSNQMASH ERKEFLVKAK KMENLLKF
|
| |