Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0791 |
Symbol | prlC |
ID | 4240282 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | - |
Start bp | 858994 |
End bp | 861033 |
Gene Length | 2040 bp |
Protein Length | 679 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638104345 |
Product | oligopeptidase A |
Protein accession | YP_719001 |
Protein GI | 113460934 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0339] Zn-dependent oligopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTAATC CACTATTAAC CCATTCAGAA CTTCCACAAT TTTCAAAAAT CAAGCCTGAA CATGTTCAAC CGGCGATTGA GCAATTAATC CGAGAAAATC GTGAAACAGT AGAACAACTG TTAAAACAAC CGCACTTTAC ATGGGAGAAT TTTATTCAAC CCCTAAACGA TAAAGAGGAA AAACTTGGTC GTGCTTGGTC GCCTGTTTCT CATTTAAATG CGGTGAAAAA CAGTCCTGAA TTGCGTGCCG CTTATCAAGC TTGTTTACCG CTGATTTCAG AATACAGTAC TTGGTTAGGG CAACATAAAG GGTTATATAA TGCCTATGTG CAACTAAAAA ATAGTCCGGA ATTTACCCAA TATACGGTTG CACAAAAGAA AGCGATAGAA AATGCGTTGC GTGATTTTGA ATTATCGGGT ATTGCCTTAT CAGAAGATAA ACAGCAAAGA TACGGTGAAA TCGTTGCTCG TTTATCTGAA CTAAACGCTA AATTTAGTAA TAATGTCCTT GATGCAACAA TGGGATGGGA TAAAGTCGTC GAAAATGAAC AAGATCTAAT TGGCTTGCCT GAAAGTGCAT TACAAGCAGC AAAACAATCC GCACAATCTA AAGGGCTTGA TGGTTATCGT TTTACCTTGG AATTCCCTAG TTATTTACCC GTCATGACCT ATTGTGAAAA CCGAGAACTA CGAGAAGAAA TGTATCGAGC TTTTGCTACT CGTGCTTCAG AACTGGGACC AAATGCAGGA AAATGGGATA ATACAGATGT TATGCAAGAA ATTTTGACTT TGCGTGTAGA ACTTGCTCAT TTGTTAGGCT TTAATACTTA TACAGAGCTT TCCCTTGCGA CCAAAATGGC GGACACTCCA CAACAAGTTA TTGATTTCTT GGAAAGTTTA GCAAAACGTT CAAAAAATCA AGGCGAGAAA GAATTGACCG AATTACAAGA ATTTTGCCAA AAAAATTACA ATATCACCGC ACTTGAACCT TGGGATATTA GCTTTTATAG CGAAAAACAA AAACAGTATT TATACTCAAT TAATGATGAG GAACTTCGCC CTTATTTTCC AGAAGATCAA GTTATTTCAG GTCTATTTGA ATTAATTAAA CGAATTTTCA ATATTCGAGC AGTAGAGCGT TTTGACGTAG ACACTTGGCA CAAAGATGTA CGCTTCTTTG ATTTAATTGA TGAAAAAAAT GAAGTGAGAG GAAGTTTCTA CTTAGATCTT TATGCTCGAG AAAATAAACG TGGCGGTGCT TGGATGGATG ATTGTATTGG ACGAAAACGT AAAGCTGACG GTTCAATTCA AAAACCTGTT GCATACCTAA CTTGTAATTT TAATGCTCCG ATTGGCGACA AACCTGCTTT ATTCACCCAT GACGAAGTCA CTACCCTATT CCATGAATTT GGACATGGTA TTCATCATAT GCTCACAAAA ATTGATGTCG CTGATGTTGC CGGAATTAAT GGTGTACCTT GGGATGCAGT AGAATTGCCA AGTCAATTTT TAGAAAATTG GTGTTGGGAA GAAGAGGCGT TAGCATTCAT CTCAGGACAT TATGAAACCG GCAAGCCACT ACCAAAAGAA AAATTAGATC AATTACTAAA AGCGAAAAAC TTTCAAGCGG CAATGTTTGT TTTACGTCAA TTAGAATTCG GTTTATTTGA CTTCCGTTTA CACCATTATT TTGAAGCTAA TAAACCAAAT CAAATCTTAG CGACCTTAAA ACAGGTTAAA TCTGACGTTG CGGTGATCAA AGGTGTGGAT TGGGCAAGAA CACCTCACAG CTTTAGCCAT ATTTTTGCAG GCGGATATTC TGCCGGATAC TACAGCTATC TTTGGGCTGA AGTTTTATCT GCCGATGCTT ATTCTCGTTT TGAAGAAGAA GGAATTTTCA ATGCTGAAAC AGGTCGTTCA TTTTTAGAGG AAATTTTAAC TCGTGGCGGC TCGGAAGATC CGATGACTTT ATTTAAACGC TTTAGAGGTC GTGAACCTCA ATTAGATGCA CTACTGCGAC ATAAAGGTAT TGCTAATTAG
|
Protein sequence | MSNPLLTHSE LPQFSKIKPE HVQPAIEQLI RENRETVEQL LKQPHFTWEN FIQPLNDKEE KLGRAWSPVS HLNAVKNSPE LRAAYQACLP LISEYSTWLG QHKGLYNAYV QLKNSPEFTQ YTVAQKKAIE NALRDFELSG IALSEDKQQR YGEIVARLSE LNAKFSNNVL DATMGWDKVV ENEQDLIGLP ESALQAAKQS AQSKGLDGYR FTLEFPSYLP VMTYCENREL REEMYRAFAT RASELGPNAG KWDNTDVMQE ILTLRVELAH LLGFNTYTEL SLATKMADTP QQVIDFLESL AKRSKNQGEK ELTELQEFCQ KNYNITALEP WDISFYSEKQ KQYLYSINDE ELRPYFPEDQ VISGLFELIK RIFNIRAVER FDVDTWHKDV RFFDLIDEKN EVRGSFYLDL YARENKRGGA WMDDCIGRKR KADGSIQKPV AYLTCNFNAP IGDKPALFTH DEVTTLFHEF GHGIHHMLTK IDVADVAGIN GVPWDAVELP SQFLENWCWE EEALAFISGH YETGKPLPKE KLDQLLKAKN FQAAMFVLRQ LEFGLFDFRL HHYFEANKPN QILATLKQVK SDVAVIKGVD WARTPHSFSH IFAGGYSAGY YSYLWAEVLS ADAYSRFEEE GIFNAETGRS FLEEILTRGG SEDPMTLFKR FRGREPQLDA LLRHKGIAN
|
| |