Gene HS_1543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1543 
Symbol 
ID4241064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1740598 
End bp1741956 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content40% 
IMG OID638105123 
Productouter membrane protein 
Protein accessionYP_719748 
Protein GI113461679 
COG category[U] Intracellular trafficking, secretion, and vesicular transport
[W] Extracellular structures 
COG ID[COG5295] Autotransporter adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.316973 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG TACAATTTTT TAAATATTCA TCATTGGCAT TAGCATTGGG TTTAGGGGTA 
AGTGCTTCTG CTTTGGCAGC CCCAACAAGT ACAAGTACGA CTACTGGACC AGAGGCGCCT
CCTACAGGCC CTGCTCCTAC GGCGAAAGAC CCTCTAGCAG AAACAGCGTT AGCCTATGAT
TTGGAGAACG AAGTTGCGTA TCTTCGTATG AAGGCGGGTG AGTGGATGCA ATTGGGGCTT
GATCCTGAAA AAGAAGTCAT CAAAGGCTGG AATGAGGTAA AATCTCTCCC TCGTATCGAT
GGAAATGGAA AGGATAAACA GACAAAAGAT CAAATAGCAA TGTTGATAAG AACGGTTGAT
AATACAAAAG AGCTTGGTCG GATCGTTAGT ACAAACATTG AAGATATTAA GAACCTTAAA
AAAGAGCTTT ACGGTTTTGT AGAAGATGTG AACGAGAGTG AAGCACGCAA TATCTCAAGA
ATAGATGAGA ATGAGAAAGA TATTAAGAAC CTTAAAAAAG AGCTTTACGA TTTTGTAGAA
GATGTGAACG AGAGTGAAGC ACGCAATATC TCAAGAATAG ATGAAAATGA GAAGGACATT
AATACTCTTA AAGAGCTAAT GGATGAGGAT TTAAATTCAG TCTTAACCCA AATTGAAGAT
GTAAAACTCA CATTTCAAGA TGTCAATGAT AACGTTAATT TGGCATTTGA AGAGATTAAT
GGAAATGCCC AAAAGTTTGA CACTGCTATT GAAGGACTTA CTTCAGGTTT GAGCGATTTA
CAAGCTAAAG TCGATGCAAA TAAACAAGAA ACTGAAGACG ATATTGCGGA CAATGCCAAG
GCTATTCATA GCAACACAAA AGGTATTGCT AAAAATACCA AGGATATTCG TGACTTGGAC
ACCAAAACCA AGCAAATGTT GGAAAATGAC AAAAACTTGA TGACCGGTTT AGAATCTTTA
GCAACAGAAA CAAGCAAAGG CTTTGAAAGA TTTGATGTCA AAACACAACA ATTAGATCAA
GCCGTCGCAA ATGTCGTCGG TCGAGTAGAC ATAACTGAGC AAGCTATTCG CCAAAACACT
GCAGGCTTAG TCAATGTGAA TAAACGTGTC GATACACTCG ACAAAAACAC CAAAGCCGGT
ATCGCTTCTG CAGTCGCTTT AGGTATGTTG CCACAATCCA CTGCTCCGGG TAAATCATTA
GTGAGCTTAG GTGTCGGTCA TCACCGTGGG CAAAGTGCTA CTGCTATTGG AGTATCTTCT
ATGAGCAGTA ACGGTAAATG GGTTGTTAAA GGCGGTATGA GCTATGATAC ACAGCGTCAT
GCTACTTTCG GCGGTTCTGT CGGTTTTTTC TTTAACTAA
 
Protein sequence
MKKVQFFKYS SLALALGLGV SASALAAPTS TSTTTGPEAP PTGPAPTAKD PLAETALAYD 
LENEVAYLRM KAGEWMQLGL DPEKEVIKGW NEVKSLPRID GNGKDKQTKD QIAMLIRTVD
NTKELGRIVS TNIEDIKNLK KELYGFVEDV NESEARNISR IDENEKDIKN LKKELYDFVE
DVNESEARNI SRIDENEKDI NTLKELMDED LNSVLTQIED VKLTFQDVND NVNLAFEEIN
GNAQKFDTAI EGLTSGLSDL QAKVDANKQE TEDDIADNAK AIHSNTKGIA KNTKDIRDLD
TKTKQMLEND KNLMTGLESL ATETSKGFER FDVKTQQLDQ AVANVVGRVD ITEQAIRQNT
AGLVNVNKRV DTLDKNTKAG IASAVALGML PQSTAPGKSL VSLGVGHHRG QSATAIGVSS
MSSNGKWVVK GGMSYDTQRH ATFGGSVGFF FN