Gene HS_1036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1036 
Symbol 
ID4240534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1142526 
End bp1143902 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content37% 
IMG OID638104597 
Productmetalloprotease 
Protein accessionYP_719248 
Protein GI113461179 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0193125 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGCAGC ACATTAAATT AGCTAGAGAT AGACGCAAAA AGAAATTCTA CATAAAAGTT 
GCCGTTTTTT TTGTAGCTAT TTTATTTATT CTAATGGGTA TAGGGTTGTT CTTGAGAAAC
GATGTTTCTT CTCAGAAGCC TACATCTGTT GATATGCAAA TTGCAAATGA AAAAAATAAC
AGTGAATTGT CTTATGATGA TTTAGACGGT TTAGATGATG AAACGGATCA AGTTAATTTG
GATAATGAAA TATCATCATT GCCTGATAAT GCCAAAGATG CGTTAAATAG TTTATTAGAT
GCAGCGGATC AGGCAATGCG GATTAAAGAT CAATTTTCTC ATACTGTTGT ACGTGGCGAT
AAATTGAAAG ATGTATTGGA ACATTCAGGG TTGGATGAGG AAATAAGTCG TCAAATGATA
GCGAATTATC CGGAATTAAA AAATCTGAAA GCTGGCCAAC AAATTTATTG GATATTGGAT
AATGATGGAA ATTTGGAATA TTTAAATTGG CTTGTTTCCG AACGTGAAGA ACGTATTTAT
GAGCGAGTTA ATGAATCTCA GTTTAAACGT CAAATTTTAG AAAAGAAAAG TGTTTGGACA
GAAGAGGTTC TAAAAGGACA GATAGAAGGA TCATTTTATG CCAGTTTGAA AGCCTTGGGC
TTGAGCAGTA AACAAATTGC TCAATTAACT ACCGCACTTC AGTGGCAGGT AAGTCTTAAT
AAGTTAAAAA AAGGGGATAA GTTTGCCGTT TTAGTTTCGA GAGAGTATTT AGATAATAAA
TTGACTGGGC AAGGCAAGGT CGAAGCCATA CATATTATGT CTGGTGGAAA AAGTTATTAC
GCAATCCAAG CAAATAATGG ACGCTATTAT AGTCGTCAGG GAGAAACTTT AGGGAAAGGT
TTCGCACGTT ATCCTTTATT ACGTCAAGCT AGGGTGTCTT CTCCTTTTAA TCTTGCTCGC
CGTCATCCCG TTACAGGAAA AATACGACCG CACAAAGGGG TTGATTTTGC TGTACCTGTA
GGCACGACTA TCATCGCTCC GGCAGATGGA GTAGTAGAGA AAGTCGCTTA TCAGGCTAAC
GGTGCCGGAC GTTATATGAT GATTAGACAT GGCAAAGAAT ATCAGACAGT CTATATGCAT
TTAAGTCGCT CATTAGTCAA ACCGGGGCAA TCGGTAAAAA GAGGACAACG TATAGCATTA
TCGGGAAATA CCGGACGTTC AACCGGTGCT CATTTACATT ACGAATTTCA TATTAACGGT
AGACCTGTTA ACCCGTTAAC AGTGAAATTA CCGGGAACAA GTAATCAAAT GGCAAGTCAT
GAAAGAAAAG AATTCTTAGT TAAAGCAAAA AAAATGGAAA ACCTGCTTAA ATTTTAG
 
Protein sequence
MVQHIKLARD RRKKKFYIKV AVFFVAILFI LMGIGLFLRN DVSSQKPTSV DMQIANEKNN 
SELSYDDLDG LDDETDQVNL DNEISSLPDN AKDALNSLLD AADQAMRIKD QFSHTVVRGD
KLKDVLEHSG LDEEISRQMI ANYPELKNLK AGQQIYWILD NDGNLEYLNW LVSEREERIY
ERVNESQFKR QILEKKSVWT EEVLKGQIEG SFYASLKALG LSSKQIAQLT TALQWQVSLN
KLKKGDKFAV LVSREYLDNK LTGQGKVEAI HIMSGGKSYY AIQANNGRYY SRQGETLGKG
FARYPLLRQA RVSSPFNLAR RHPVTGKIRP HKGVDFAVPV GTTIIAPADG VVEKVAYQAN
GAGRYMMIRH GKEYQTVYMH LSRSLVKPGQ SVKRGQRIAL SGNTGRSTGA HLHYEFHING
RPVNPLTVKL PGTSNQMASH ERKEFLVKAK KMENLLKF