Gene HS_0516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0516 
SymbolpepA 
ID4239998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp558033 
End bp559520 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content38% 
IMG OID638104064 
Productleucyl aminopeptidase 
Protein accessionYP_718727 
Protein GI113460661 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.204663 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATATA GTGCAAAAAA TACCGCACTT TCTCAAATTG ACAGCAACAT TATTTTAGCC 
GTATTTGAGG ACGGAGAACT TTCTCCGACA GCAATGCAAT TTGACCAACT CAGTCAAGGC
TACCTAACTC GCTTAATTCA AGTTGGTGAG GTTAGCGGAA AACAGGGACA AGTTCTTATT
TTACGGGATA TACCGAATTG CCAAGCACAA CGTATTTTTA TCGTAGGGTG CGGTAAAAAA
GATAAAATAA CAGAGCGTCA ATATAAACAG ATTATTCAAA AAACAATTCA AACGATTCTT
GAGACTCAAG CAAGTGAAGT TGTGAGCTTT CTAAATGAAA TTGAACTAAA AAATCGTGAT
ATTCATTGGA ATATTCGTTT TGCAATTGAA ACTATTGAAG CAAGTTTTTA TCAATTTGAT
GCTTTCAAAA CTAAAAAAGG TGACGAAAAT TCAGTATTAA ATGAATTTAT TTTTGATGTT
CAACCTGAAT TACAACAAGA TGCACTACTG GCAATTACTT ACGCACAAGC TATTGCATTG
GGAGTCAAAC ATGCAAAAGA TATTGCCAAT TGCCCGCCTA ATATTTGCAA CCCAACTTAT
CTTGCCGAAC AAGCACAATC CCTCGCTAAA CACTCAAACT TGATTAACGT GCAAGTTTTG
GGTGAAAAAG AAATGGCAGA ATTGAACATG TTCTCGTATT TAGCTGTTTC GCAAGGAAGT
GCTAACGAAG CAAAAATGTC GGTGATTGAA TATCGTAATC ACCCCGATAA AAATGCCAAA
CCTATTGTTT TAGTTGGAAA AGGTTTAACC TTTGATGCCG GCGGTATTTC ATTAAAACCC
GCTGATAGTA TGGACGAAAT GAAATATGAT ATGTGCGGTG CAGCTTCTGT ATTCGGTGTT
ATGTACGCTC TGGCAACATT ACAATTACCC TTAAATGTAA TTGGTGTATT GGCTGGTTGT
GAAAATTTGC CGGACGGAAA TTCATATCGA CCGGGAGATA TTTTAACCAC TATGTCAGGA
TTAACCGTCG AAGTTTTAAA TACTGATGCG GAAGGACGTT TAGTTTTATG TGACGCACTC
ACTTATGTTG AGCGATTTAA CCCTGAGTTG GTCATTGATG TAGCAACACT AACAGGTGCT
TGTGTAGTGG CATTAGGTCA ACATAACAGT GGCTTAATCG CAACAGATGA AAAACTTGCT
GAAAAATTAT TAAATGCGGC AGAAGAAACG ACAGATAAAG CTTGGCGTTT ACCTTTAAGC
GAAGAGTATC AGGAACAGTT AAAATCTAAT TTTGCTGATT TAGCTAATAT TGGTGGACGT
TGGGGTGGAG CCATTACCGC TGGTGCATTT TTAGCTAACT TTACGAAAAA TTATCCTTGG
GCTCATTTAG ATATTGCAGG AACTGCCTGG TTACAAGGTA CAAACAAAGG TGCAACGGGA
CGACCGGTAA GTTTACTGAC ACAATTCTTA ATTAATCAAT CCAAATAA
 
Protein sequence
MKYSAKNTAL SQIDSNIILA VFEDGELSPT AMQFDQLSQG YLTRLIQVGE VSGKQGQVLI 
LRDIPNCQAQ RIFIVGCGKK DKITERQYKQ IIQKTIQTIL ETQASEVVSF LNEIELKNRD
IHWNIRFAIE TIEASFYQFD AFKTKKGDEN SVLNEFIFDV QPELQQDALL AITYAQAIAL
GVKHAKDIAN CPPNICNPTY LAEQAQSLAK HSNLINVQVL GEKEMAELNM FSYLAVSQGS
ANEAKMSVIE YRNHPDKNAK PIVLVGKGLT FDAGGISLKP ADSMDEMKYD MCGAASVFGV
MYALATLQLP LNVIGVLAGC ENLPDGNSYR PGDILTTMSG LTVEVLNTDA EGRLVLCDAL
TYVERFNPEL VIDVATLTGA CVVALGQHNS GLIATDEKLA EKLLNAAEET TDKAWRLPLS
EEYQEQLKSN FADLANIGGR WGGAITAGAF LANFTKNYPW AHLDIAGTAW LQGTNKGATG
RPVSLLTQFL INQSK