Gene HS_1287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1287 
SymboloapA 
ID4240798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1475568 
End bp1476776 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content37% 
IMG OID638104860 
Productopacity associated protein A 
Protein accessionYP_719499 
Protein GI113461430 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3061] Cell envelope opacity-associated protein A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value1.83549e-07 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGCTTCT TTAAACATAA CTTTCTGACT CATCAAGGGA CTATCATGAC AGCTGAACAA 
AATAAAATCG ACACTTCTGG ACAAGCTTCT ATACAAAAAG AATTGGAGCT TGATCTGGAT
CCTATGGAGC CGATACTTCC GAAAAAAAAT CTTGCATCTA AACAAACTTT ATTGGAAAAA
GCCAAACGTA TTTTTAAGAG AGCGGATAAG CAAAATAGAG TTGATAAAGC AGAAGTGAGT
ACACATAAAG AAGTAATATC CGCTAGTGAT GATGAACAAC CTCCAATAAT CACGGAAACA
CAAGAGCAAC AAATAAGCCA AGCAATAGAC CCGACTATAT CTGAAGTGCC TGTTAAGACA
CAAAAAGAAC TGAAAAAACC GGAAAATTGG ACTATGTTAG GTGTGTTACC GCCTAAATAT
CGCCGTATTT TTATTGCATT ATTAGTTGTC GTCATTATTT TATTAATTAT TTCTTGGTTA
AAACCGGATG ATAATATAGA GCAATATTTT GAACAATCAA CAGGCAATAG TATTCCGACA
CAATTTCAGC CTTTAGATCA TTCTCAACCG GTTGAACCGA GCATTTTAGA GCAACTGAAG
AATCCTCAAC CAAAAAGTGA AAATATCACA CAGAATGATC AGGAAAAAAA TATGCCTGTA
CAAGCGTTGC AGGTAGAAAA CGAACCCCAA AATATAGCCA TACCGATAAA TTCAACACCT
AGTAAGGCGG ACAGTCCAGT AGTGGAAACA AGCAAGCGAC CAAGTATTAA AGTTGAGGAA
GCAACATCTC AACCTATAGC ACCAATGGAA GAAGTGAAAA TTGTTGAGAA AAATGAGCCA
AATAAAAAGG CAGAAAAACA GGTGAAAGCA GAGCAGAAAG GCGTGCCTGT TGTTGATGCG
AAACCGGCTA ATGTGAATAA AGTAACAAAT TCACAGAAAA CCGCTTTACA AAAAAATGCA
CAGACTAAAA CCCTTGTTAT TCCGCAAGGA ACATCACTGA TGCAAGTATT CCGCAACAAC
AATTTGAATA TTGCCGATGT CAATGCAATG ACGAAAGCAA GTGGGGCCGG TAATACATTA
AGTAGTTTCA AAGCGGGTGA TAAAGTACAA GTATCCTTAA ATAAGCAAGG GCGTGTCAAT
GAATTGCGTC TATCCAATGG TGCGAGGTTT ATCCGCCAAG CCGATGGAAG TTATCAATTT
AAAAAATAA
 
Protein sequence
MCFFKHNFLT HQGTIMTAEQ NKIDTSGQAS IQKELELDLD PMEPILPKKN LASKQTLLEK 
AKRIFKRADK QNRVDKAEVS THKEVISASD DEQPPIITET QEQQISQAID PTISEVPVKT
QKELKKPENW TMLGVLPPKY RRIFIALLVV VIILLIISWL KPDDNIEQYF EQSTGNSIPT
QFQPLDHSQP VEPSILEQLK NPQPKSENIT QNDQEKNMPV QALQVENEPQ NIAIPINSTP
SKADSPVVET SKRPSIKVEE ATSQPIAPME EVKIVEKNEP NKKAEKQVKA EQKGVPVVDA
KPANVNKVTN SQKTALQKNA QTKTLVIPQG TSLMQVFRNN NLNIADVNAM TKASGAGNTL
SSFKAGDKVQ VSLNKQGRVN ELRLSNGARF IRQADGSYQF KK