Gene HS_0388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0388 
SymbolpepB 
ID4239864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp412919 
End bp414217 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content39% 
IMG OID638103931 
Productaminopeptidase B 
Protein accessionYP_718598 
Protein GI113460534 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAATTA AAATAGAAAT TTCACCAGCA AAAGAACAAT GGGGAAAAAA AGCTTTAATC 
AGTTTTACTC ACGATCAAGC CATCATTCAT CATCCGACAC GTAACCAAGA CATTTCTTTA
ATTCAAAAAG CCGCTCGAAA ATTACGCAAC CAAGGAATAA AAGAAGTTGA ACTTATTGGA
TCAGAATGGG ATTTAGAAAA TTGTTGGGCA TTCTATCAAG GCTTTTATAC CGCTAAACAA
GATTATACTA TTGAATTTCC TCATTTAGAG GACGAGCCAC AAGCTGAATT GCTGGCAAGA
ATTCAATGTG GCGATTTTGT GCGTGAAATT ATTAATCTCC CCTCTTCAAT TATCACCCCT
ATTGAACTTG CAAAGCGTGC GGCACAATTT ATCACCGAAC AAGCAAGTCT TTATGCTGAC
GAAAGTGCGG TATCTTTTCA GATTATTTCC GGACAAGAAC TTGCTGAAAA AAACTATCAA
GGGATTTGGC AAGTGGGTAA AGGCTCAGAA AATTTACCAG CTATGTTACA GCTGGATTTT
AATCCTACCG GAAATGAAAA TGCACCGGTA CTGGCTTGTT TAGTCGGTAA AGGAATTACT
TTTGACAGTG GTGGTTACAG CATTAAACCA AGTGATAGCA TGAGTACAAT GCGAACAGAT
ATGGGCGGTG CTGCACTATT AACGGGTGCA TTAGGTATGG CAATAGCCGG AGGGCTTAAC
AAACGAGTTA AACTATTTTT ATGTTGTGCT GAAAATATGG TGAGCCACAA CGCTTTAAAA
TTAGGTGATA TTATTCACTA CCGCAATGGT ATCAGTGCGG AAATTCTTAA TACTGATGCC
GAAGGGCGTT TGGTACTTGC CGATGGCTTA ATTGATGCGG ATTTAGCGAA GCCTAAATTT
ATTCTTGATT GTGCAACCTT AACCGGTGCA GCAAAAGTAG CGGTTGGTAA TGATTATCAT
GCGATACTTT CAATGGATAA CGAACTCACT CAACAGTTTT TCGATTGTGC AAAGAGTACA
AAAGAACCGT TTTGGCGATT GCCTTTTGAC GAATTACATC GTCACCAAAT TAGTTCATCC
TTTGCAGATA TTGCCAATAT TGGGACTGTT CCAATGGGTG CCGGTGCAAG TACGGCAATG
GCGTTCTTAT CTTATTTTGT AGAAAATTAT CAAGAAAATT GGTTACATAT TGATTGTTCG
GCAACTTATC GTAAATCAGC AAGTGACTTA TGGGCAACAG GTGCAACGGG AATCGGTGTA
CAAACTTTGG CGAATTTTTT ATTAAATAAA GCTGAATAA
 
Protein sequence
MQIKIEISPA KEQWGKKALI SFTHDQAIIH HPTRNQDISL IQKAARKLRN QGIKEVELIG 
SEWDLENCWA FYQGFYTAKQ DYTIEFPHLE DEPQAELLAR IQCGDFVREI INLPSSIITP
IELAKRAAQF ITEQASLYAD ESAVSFQIIS GQELAEKNYQ GIWQVGKGSE NLPAMLQLDF
NPTGNENAPV LACLVGKGIT FDSGGYSIKP SDSMSTMRTD MGGAALLTGA LGMAIAGGLN
KRVKLFLCCA ENMVSHNALK LGDIIHYRNG ISAEILNTDA EGRLVLADGL IDADLAKPKF
ILDCATLTGA AKVAVGNDYH AILSMDNELT QQFFDCAKST KEPFWRLPFD ELHRHQISSS
FADIANIGTV PMGAGASTAM AFLSYFVENY QENWLHIDCS ATYRKSASDL WATGATGIGV
QTLANFLLNK AE