Gene HS_1243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1243 
Symbol 
ID4240754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1424054 
End bp1425319 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content39% 
IMG OID638104816 
ProductM20/M25/M40 family peptidase 
Protein accessionYP_719455 
Protein GI113461386 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAATT TAGCTAGATT AATTGAATGG CGAAGAGAAT TTCACCGTTT TCCTGAAACT 
GGTTGGACAG AATTTTGGAC AACCAGTCGC ATCGCTGATT ATTTAGAAGA AATGGGGGCT
GAGATTTTAC TCGGCAATCA AATTATTCAT TATGATTTTG TGCGTGGTCG TGATACGAAA
TTAGTTGAGA AAGGCATTGA AAGTGCTATT TCTTATGGTG CAAAAACCAA ATGGCTAGAG
AGAATGAATG GCTATACAGG TTGTGTTGCT GTATTTGATT CTGGTAAACA GGGAAAAACC
TTGGCATTAC GCTTTGATAT AGATTGTGTT AATGTGACTG AAACCAATGA TCCTAATCAT
TTACCCAATA AATTAAAATT TGCCTCATTA AATCTAGGGT TTATGCATGC TTGTGGTCAT
GATGGACACA TCACAATTGG TTTAGGTATT GCGTCATGGC TTTCACAAAA TAAGGATAAA
TTTAACGGTA AAGTCAAAAT TGTCTTTCAG CCTGCTGAAG AGGGAGTGAG AGGCGCAGCA
GCTATTGCTG CAAGTGGAGT GATTGATGAC GCGGATTATT TTGCCTCTTC CCATATTAGC
TTTTGTGCCA ACACAGGTAC TGTCTTAGCA AATCCTCGTA ATTTCCTTTC CACAACCAAA
ATTGATATTC GTTATAAAGG ACGACCTGCA CATGCTGGTG CTGCTCCACA TTTAGGTCGA
AATGCATTAT TAGCAGCGGC TAATGCCGTA ACACAATTCC ATAGCATTGC TCGTCATGGT
GAGGGGATGT CTCGAATTAA TGTAGGTGTA TTAAAAGCAG GTGAAGGGCG TAATGTGATT
CCGAGCAGTG CGGAAATTCA ACTTGAAGTA CGTGGTGAAA ATAAAGAGAT TAATCAATAT
ATGGTGGATC AAGTTATGCA AATTGCCAAA GGCATTGCGA TTAGTTTCGA TGTAAGTTAT
GAAACTGAAA TTATGGGTGA AGCAGTGGAT ATGAATAATG ATCTTGAGTT AGTTAAATTG
CTCGAGGAAA TTGCCATTCA ACAACCTGGA ATCAATGAGT CTACTGCTGA TTATGCGTTT
AATGCCAGCG AGGATGCAAC GATTTTAGGG CGTCGTGTGC AAGATCATGG TGGTAAAGCG
ATTTATTTTA TTATTGGTGC TGATCGTACA GCTGGGCATC ATGAGGCAAA CTTCGATTTT
GATGAAAATC AATTGCTGAC TGGGGTAAAT ATTTATATTG GTTTAGTACA GCGGTTATTG
GGGTAG
 
Protein sequence
MTNLARLIEW RREFHRFPET GWTEFWTTSR IADYLEEMGA EILLGNQIIH YDFVRGRDTK 
LVEKGIESAI SYGAKTKWLE RMNGYTGCVA VFDSGKQGKT LALRFDIDCV NVTETNDPNH
LPNKLKFASL NLGFMHACGH DGHITIGLGI ASWLSQNKDK FNGKVKIVFQ PAEEGVRGAA
AIAASGVIDD ADYFASSHIS FCANTGTVLA NPRNFLSTTK IDIRYKGRPA HAGAAPHLGR
NALLAAANAV TQFHSIARHG EGMSRINVGV LKAGEGRNVI PSSAEIQLEV RGENKEINQY
MVDQVMQIAK GIAISFDVSY ETEIMGEAVD MNNDLELVKL LEEIAIQQPG INESTADYAF
NASEDATILG RRVQDHGGKA IYFIIGADRT AGHHEANFDF DENQLLTGVN IYIGLVQRLL
G