Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0516 |
Symbol | pepA |
ID | 4239998 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | - |
Start bp | 558033 |
End bp | 559520 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638104064 |
Product | leucyl aminopeptidase |
Protein accession | YP_718727 |
Protein GI | 113460661 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0260] Leucyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.204663 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATATA GTGCAAAAAA TACCGCACTT TCTCAAATTG ACAGCAACAT TATTTTAGCC GTATTTGAGG ACGGAGAACT TTCTCCGACA GCAATGCAAT TTGACCAACT CAGTCAAGGC TACCTAACTC GCTTAATTCA AGTTGGTGAG GTTAGCGGAA AACAGGGACA AGTTCTTATT TTACGGGATA TACCGAATTG CCAAGCACAA CGTATTTTTA TCGTAGGGTG CGGTAAAAAA GATAAAATAA CAGAGCGTCA ATATAAACAG ATTATTCAAA AAACAATTCA AACGATTCTT GAGACTCAAG CAAGTGAAGT TGTGAGCTTT CTAAATGAAA TTGAACTAAA AAATCGTGAT ATTCATTGGA ATATTCGTTT TGCAATTGAA ACTATTGAAG CAAGTTTTTA TCAATTTGAT GCTTTCAAAA CTAAAAAAGG TGACGAAAAT TCAGTATTAA ATGAATTTAT TTTTGATGTT CAACCTGAAT TACAACAAGA TGCACTACTG GCAATTACTT ACGCACAAGC TATTGCATTG GGAGTCAAAC ATGCAAAAGA TATTGCCAAT TGCCCGCCTA ATATTTGCAA CCCAACTTAT CTTGCCGAAC AAGCACAATC CCTCGCTAAA CACTCAAACT TGATTAACGT GCAAGTTTTG GGTGAAAAAG AAATGGCAGA ATTGAACATG TTCTCGTATT TAGCTGTTTC GCAAGGAAGT GCTAACGAAG CAAAAATGTC GGTGATTGAA TATCGTAATC ACCCCGATAA AAATGCCAAA CCTATTGTTT TAGTTGGAAA AGGTTTAACC TTTGATGCCG GCGGTATTTC ATTAAAACCC GCTGATAGTA TGGACGAAAT GAAATATGAT ATGTGCGGTG CAGCTTCTGT ATTCGGTGTT ATGTACGCTC TGGCAACATT ACAATTACCC TTAAATGTAA TTGGTGTATT GGCTGGTTGT GAAAATTTGC CGGACGGAAA TTCATATCGA CCGGGAGATA TTTTAACCAC TATGTCAGGA TTAACCGTCG AAGTTTTAAA TACTGATGCG GAAGGACGTT TAGTTTTATG TGACGCACTC ACTTATGTTG AGCGATTTAA CCCTGAGTTG GTCATTGATG TAGCAACACT AACAGGTGCT TGTGTAGTGG CATTAGGTCA ACATAACAGT GGCTTAATCG CAACAGATGA AAAACTTGCT GAAAAATTAT TAAATGCGGC AGAAGAAACG ACAGATAAAG CTTGGCGTTT ACCTTTAAGC GAAGAGTATC AGGAACAGTT AAAATCTAAT TTTGCTGATT TAGCTAATAT TGGTGGACGT TGGGGTGGAG CCATTACCGC TGGTGCATTT TTAGCTAACT TTACGAAAAA TTATCCTTGG GCTCATTTAG ATATTGCAGG AACTGCCTGG TTACAAGGTA CAAACAAAGG TGCAACGGGA CGACCGGTAA GTTTACTGAC ACAATTCTTA ATTAATCAAT CCAAATAA
|
Protein sequence | MKYSAKNTAL SQIDSNIILA VFEDGELSPT AMQFDQLSQG YLTRLIQVGE VSGKQGQVLI LRDIPNCQAQ RIFIVGCGKK DKITERQYKQ IIQKTIQTIL ETQASEVVSF LNEIELKNRD IHWNIRFAIE TIEASFYQFD AFKTKKGDEN SVLNEFIFDV QPELQQDALL AITYAQAIAL GVKHAKDIAN CPPNICNPTY LAEQAQSLAK HSNLINVQVL GEKEMAELNM FSYLAVSQGS ANEAKMSVIE YRNHPDKNAK PIVLVGKGLT FDAGGISLKP ADSMDEMKYD MCGAASVFGV MYALATLQLP LNVIGVLAGC ENLPDGNSYR PGDILTTMSG LTVEVLNTDA EGRLVLCDAL TYVERFNPEL VIDVATLTGA CVVALGQHNS GLIATDEKLA EKLLNAAEET TDKAWRLPLS EEYQEQLKSN FADLANIGGR WGGAITAGAF LANFTKNYPW AHLDIAGTAW LQGTNKGATG RPVSLLTQFL INQSK
|
| |