Gene HS_0791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0791 
SymbolprlC 
ID4240282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp858994 
End bp861033 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content38% 
IMG OID638104345 
Productoligopeptidase A 
Protein accessionYP_719001 
Protein GI113460934 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAATC CACTATTAAC CCATTCAGAA CTTCCACAAT TTTCAAAAAT CAAGCCTGAA 
CATGTTCAAC CGGCGATTGA GCAATTAATC CGAGAAAATC GTGAAACAGT AGAACAACTG
TTAAAACAAC CGCACTTTAC ATGGGAGAAT TTTATTCAAC CCCTAAACGA TAAAGAGGAA
AAACTTGGTC GTGCTTGGTC GCCTGTTTCT CATTTAAATG CGGTGAAAAA CAGTCCTGAA
TTGCGTGCCG CTTATCAAGC TTGTTTACCG CTGATTTCAG AATACAGTAC TTGGTTAGGG
CAACATAAAG GGTTATATAA TGCCTATGTG CAACTAAAAA ATAGTCCGGA ATTTACCCAA
TATACGGTTG CACAAAAGAA AGCGATAGAA AATGCGTTGC GTGATTTTGA ATTATCGGGT
ATTGCCTTAT CAGAAGATAA ACAGCAAAGA TACGGTGAAA TCGTTGCTCG TTTATCTGAA
CTAAACGCTA AATTTAGTAA TAATGTCCTT GATGCAACAA TGGGATGGGA TAAAGTCGTC
GAAAATGAAC AAGATCTAAT TGGCTTGCCT GAAAGTGCAT TACAAGCAGC AAAACAATCC
GCACAATCTA AAGGGCTTGA TGGTTATCGT TTTACCTTGG AATTCCCTAG TTATTTACCC
GTCATGACCT ATTGTGAAAA CCGAGAACTA CGAGAAGAAA TGTATCGAGC TTTTGCTACT
CGTGCTTCAG AACTGGGACC AAATGCAGGA AAATGGGATA ATACAGATGT TATGCAAGAA
ATTTTGACTT TGCGTGTAGA ACTTGCTCAT TTGTTAGGCT TTAATACTTA TACAGAGCTT
TCCCTTGCGA CCAAAATGGC GGACACTCCA CAACAAGTTA TTGATTTCTT GGAAAGTTTA
GCAAAACGTT CAAAAAATCA AGGCGAGAAA GAATTGACCG AATTACAAGA ATTTTGCCAA
AAAAATTACA ATATCACCGC ACTTGAACCT TGGGATATTA GCTTTTATAG CGAAAAACAA
AAACAGTATT TATACTCAAT TAATGATGAG GAACTTCGCC CTTATTTTCC AGAAGATCAA
GTTATTTCAG GTCTATTTGA ATTAATTAAA CGAATTTTCA ATATTCGAGC AGTAGAGCGT
TTTGACGTAG ACACTTGGCA CAAAGATGTA CGCTTCTTTG ATTTAATTGA TGAAAAAAAT
GAAGTGAGAG GAAGTTTCTA CTTAGATCTT TATGCTCGAG AAAATAAACG TGGCGGTGCT
TGGATGGATG ATTGTATTGG ACGAAAACGT AAAGCTGACG GTTCAATTCA AAAACCTGTT
GCATACCTAA CTTGTAATTT TAATGCTCCG ATTGGCGACA AACCTGCTTT ATTCACCCAT
GACGAAGTCA CTACCCTATT CCATGAATTT GGACATGGTA TTCATCATAT GCTCACAAAA
ATTGATGTCG CTGATGTTGC CGGAATTAAT GGTGTACCTT GGGATGCAGT AGAATTGCCA
AGTCAATTTT TAGAAAATTG GTGTTGGGAA GAAGAGGCGT TAGCATTCAT CTCAGGACAT
TATGAAACCG GCAAGCCACT ACCAAAAGAA AAATTAGATC AATTACTAAA AGCGAAAAAC
TTTCAAGCGG CAATGTTTGT TTTACGTCAA TTAGAATTCG GTTTATTTGA CTTCCGTTTA
CACCATTATT TTGAAGCTAA TAAACCAAAT CAAATCTTAG CGACCTTAAA ACAGGTTAAA
TCTGACGTTG CGGTGATCAA AGGTGTGGAT TGGGCAAGAA CACCTCACAG CTTTAGCCAT
ATTTTTGCAG GCGGATATTC TGCCGGATAC TACAGCTATC TTTGGGCTGA AGTTTTATCT
GCCGATGCTT ATTCTCGTTT TGAAGAAGAA GGAATTTTCA ATGCTGAAAC AGGTCGTTCA
TTTTTAGAGG AAATTTTAAC TCGTGGCGGC TCGGAAGATC CGATGACTTT ATTTAAACGC
TTTAGAGGTC GTGAACCTCA ATTAGATGCA CTACTGCGAC ATAAAGGTAT TGCTAATTAG
 
Protein sequence
MSNPLLTHSE LPQFSKIKPE HVQPAIEQLI RENRETVEQL LKQPHFTWEN FIQPLNDKEE 
KLGRAWSPVS HLNAVKNSPE LRAAYQACLP LISEYSTWLG QHKGLYNAYV QLKNSPEFTQ
YTVAQKKAIE NALRDFELSG IALSEDKQQR YGEIVARLSE LNAKFSNNVL DATMGWDKVV
ENEQDLIGLP ESALQAAKQS AQSKGLDGYR FTLEFPSYLP VMTYCENREL REEMYRAFAT
RASELGPNAG KWDNTDVMQE ILTLRVELAH LLGFNTYTEL SLATKMADTP QQVIDFLESL
AKRSKNQGEK ELTELQEFCQ KNYNITALEP WDISFYSEKQ KQYLYSINDE ELRPYFPEDQ
VISGLFELIK RIFNIRAVER FDVDTWHKDV RFFDLIDEKN EVRGSFYLDL YARENKRGGA
WMDDCIGRKR KADGSIQKPV AYLTCNFNAP IGDKPALFTH DEVTTLFHEF GHGIHHMLTK
IDVADVAGIN GVPWDAVELP SQFLENWCWE EEALAFISGH YETGKPLPKE KLDQLLKAKN
FQAAMFVLRQ LEFGLFDFRL HHYFEANKPN QILATLKQVK SDVAVIKGVD WARTPHSFSH
IFAGGYSAGY YSYLWAEVLS ADAYSRFEEE GIFNAETGRS FLEEILTRGG SEDPMTLFKR
FRGREPQLDA LLRHKGIAN