Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0892 |
Symbol | pqqL |
ID | 4240384 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | - |
Start bp | 978062 |
End bp | 980845 |
Gene Length | 2784 bp |
Protein Length | 927 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638104447 |
Product | zinc protease |
Protein accession | YP_719102 |
Protein GI | 113461035 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.283708 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAAAAA AAATATTGCT AATTTTAACC GCTCTTTTGG GGATTGGTGC TTGCCAATCC TTATTTCATG AGAAAGATAC GTTACCTTTT CATTCGCAAG CCTACCAAGG ACAACTGGAA AATGGCTTGC GTTATGTCAT TTTACCTAAT CATTTTCCGC AAAATCGTGT GTATATGCGT TTGGTAGTTA ATGCCGGTTC AATGCACGAA GAGGATGATC AAAAAGGGGT TGCCCATATT GTTGAACATA TGGCATTTAA TGGCTCGCAA CAATATCCGC AAAATCAAAT TATTAATGCA CTTGAGAAAT TAGGAATGAA GTTTGCACGT GATATTAATG CGTTTACCGA TTTTGAAAAT ACGGTTTATA CATTAAATAT CGCTAAAAAT GATCTGCAAT CTCTTAGCTT GGCATTCAAT GTTATTGATC AATGGCTAAA CCATCTTACC ATTTTACCAG CGGATTTAGA GGCGGAACGT GGAATTGTGT TGGAAGAATG GCGTTCACGC TTAAGCCCTA TGTTGCGTTT AGGGGATAAA AAAAGCCAAA TAGAAATGGC GGGGTCACGT TATGTAGAAC GAGATCCGAT TGGTGATGTC AACGTCATTA AACATGTTTC GGCACAGCGT GTAAAAGATT TTTATCGAAA ATGGTATCGT CCGGATAATG TCAGTCTTAT TGTGGTAGGC GATGTGAATC CAGTAAAAAT TAAGTCTTTA ATTCAACAGA AATTAGGGAG TAGCCATCCG CTACAACACC AACCTTTACC CGAGATTGAT TTTGATATTC CTTTACCGCC TAAATGGCGT TTAGCGACCG TGTCTGAGAA AGAAATGCGT GAACCGGGAT TGGACTTGAG TTTTTTTAAA CCGGCGGAAA ATATCAATAC GGTAGCACAA TACCGAGAGA ATTTATGGCA ACAAATTGTG GTACGTTTGC TTAATTTACG TTTGCAACAA TGGGAAACAT ATTTACAGCA GTCTTCAAGT GCGGTCGTAA AATCTGCTAA TTTTTATCAT GATTATTTAG GTAAGCAAAC GTTACAATCC ACATTTTCTT TACAACTTGA GAATGAGGAT TATCAAAAAG CGACAGAGCA ACTCTTTAAC TTTATTGCGG AAATTGCTCA AAACGGGTTT ACGCAGACGG AATTGGATGA TGAACTTCAG CGTTTGCGTA AAATAAATAC TAAACAACGA CATTTGGAAG TGAGTAGTAT CAAAATTGCT GATGAATTAG TTGTTACCAT GGCAAATGAT CAAGTGCTAC TCTCTCCGCA AGATCAATAT GAATTAAATC ATCAATTGTT CAATCAACTG ACTTTGGCAG ATATTAACCG CACTTTTCAA CAAATGTTGC AGTTAAAGTC CAAGTTGGTG CTGATTACTC AGCCAAAACC GCACAAATTA TCGTTTGATA GTCAATGGTT GGCACAAAAA TGGCAACAGG CTTTACAGCA ATCGCAATCT TCTTGGCAAC AACAAGATAA TAGGGTAGTT CAGCTTCCGC ATTTAGATTT AGTTGCGGGC AATGCCACTA AGCAAAAAGT ATGGCGTGGA CAAAAAATTA CTGAATATTT GTTAAGTAAC GGCAGTAAAT TGATTTATAT GTATAGTGAT AAAAGTCCGC AACAAGTTTA TTTTAAAGCC TTGACAGCAG GAGGATTACG TTCTGTGCCA AGATCACATT ATCATGCGTT AAGAGCGGCG ATTTCTGTGG TCGATGATAC TGGAATTGGC ATTGTACCTC AGGCTGATAT TGATCAGTAT TTCAGTCACG CTCCGATAGC TTTTACGACG GTTATTGATG AGGCTGATCA GGGTTTTACA GCGGCAGGTA AAACAGAAAG TTTAGCGGAT ATTCTGCGAT TATTTAGGTT AAAACTACAA ACCAGCCCTG TTTCAGATAA AGTCTTAGCC GAATATCGCC AAACTTTACA GGATGAAGTG AATGAAAAAA GTCCGGAAAA GACGTTTATG CAGAAAGTAG AGCAATTGCG TTTTCCACAA CAAGAAACGC TATATGGTGC AAATCGTTTC AGTAATCTGC ATTTAACGGC AGATAAACTT TCCGCAATTT ATCAGCAATA TATTACGGAT AAAACAGATT TCACTTATTT TATTGTTGGT GATATTAGCG AAAGTGCGGT ACAAAATTTG GCTGAAAAAT ATTTAGCCAA TGTACCGGTT AAAACACAAA ATAGAGTATT ACAGCCGATA AAGGCACATG TACCCGAGCA GCGTTTAGTT GTTAAAGGCT TGCATGAGCC AAGAGCAGAA GTTGAAATGT ATTTCGCAGC TGATCATCAA TGGCAAGTGG AAAACAAATA TTTACTGGAT ATTTTAGCGG ATATCATACA AGAGAAATTA CGGTTGAGTT TAAGAGAACA GGCATCGGGT ATCTATGCAG TGCATGCGTG GTTTGAACAA GAACATTTTT CACCTCAAAT TGAGGGGAAA ATAGAGTTTA GTTGTGATCC TATGCGTGTA CAGGAATTAA CGCAGATGAC TCATCATGTA TTAGATCAAA TACTTAAACA AGGCATTGAG CCTGAATTGT TGGCTAAAAA AGTGAGTGAA AAACAATCTC AGTTAAAACA GGCAAAGGAA TCATTGTTAG CCATTTTATC TCAATTGGAA CAAAGTTATT CTTTGAGTGA TGGTCCTTTG CTTATGTTTG CTGAGCAACA GTTATTACAG CAAGTAACTC AACAAAATAT AGAGGCTTTA GCCCATAAAA TTTTGCCATC TCAATTTAGA TTTGAAGCAA TATTGACACA GTAA
|
Protein sequence | MLKKILLILT ALLGIGACQS LFHEKDTLPF HSQAYQGQLE NGLRYVILPN HFPQNRVYMR LVVNAGSMHE EDDQKGVAHI VEHMAFNGSQ QYPQNQIINA LEKLGMKFAR DINAFTDFEN TVYTLNIAKN DLQSLSLAFN VIDQWLNHLT ILPADLEAER GIVLEEWRSR LSPMLRLGDK KSQIEMAGSR YVERDPIGDV NVIKHVSAQR VKDFYRKWYR PDNVSLIVVG DVNPVKIKSL IQQKLGSSHP LQHQPLPEID FDIPLPPKWR LATVSEKEMR EPGLDLSFFK PAENINTVAQ YRENLWQQIV VRLLNLRLQQ WETYLQQSSS AVVKSANFYH DYLGKQTLQS TFSLQLENED YQKATEQLFN FIAEIAQNGF TQTELDDELQ RLRKINTKQR HLEVSSIKIA DELVVTMAND QVLLSPQDQY ELNHQLFNQL TLADINRTFQ QMLQLKSKLV LITQPKPHKL SFDSQWLAQK WQQALQQSQS SWQQQDNRVV QLPHLDLVAG NATKQKVWRG QKITEYLLSN GSKLIYMYSD KSPQQVYFKA LTAGGLRSVP RSHYHALRAA ISVVDDTGIG IVPQADIDQY FSHAPIAFTT VIDEADQGFT AAGKTESLAD ILRLFRLKLQ TSPVSDKVLA EYRQTLQDEV NEKSPEKTFM QKVEQLRFPQ QETLYGANRF SNLHLTADKL SAIYQQYITD KTDFTYFIVG DISESAVQNL AEKYLANVPV KTQNRVLQPI KAHVPEQRLV VKGLHEPRAE VEMYFAADHQ WQVENKYLLD ILADIIQEKL RLSLREQASG IYAVHAWFEQ EHFSPQIEGK IEFSCDPMRV QELTQMTHHV LDQILKQGIE PELLAKKVSE KQSQLKQAKE SLLAILSQLE QSYSLSDGPL LMFAEQQLLQ QVTQQNIEAL AHKILPSQFR FEAILTQ
|
| |