Gene HS_1247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1247 
Symbol 
ID4240758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1428263 
End bp1430725 
Gene Length2463 bp 
Protein Length820 aa 
Translation table11 
GC content42% 
IMG OID638104820 
Productautotransporter protein YapE 
Protein accessionYP_719459 
Protein GI113461390 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAATAG CGGTACTTCT ACAGTTACCG TGCAAAATGC CAACATTAAA GAAAGGCGGT 
CAAATTACCG GTCTTTATGT CGGGACAAAC ACAGATGCAG ATACTAGCTA TACCTCAACC
GGTTTAACTA AAATCCTTGC ACAAGACATT AGCACGAGCA AAACTGCCAG CGACTACATT
ATGGGGGTTT ATGTATTTGG TCCAAAATCC AAAGTCATCT TAAATGATTC AGAGATTAAA
GTGGTTTCTA AAGGGCAAAA TTCCTTTACG CTAAAAATTG GCAACTTTGA AAATAACGGT
AAAAGCTATA AAGGCGAAAT TATCTCGACC GGTAAAATGC AATTAGACAG CACCGAAGCA
ACTAATGCAC CGACTATTTT ATTAGTCGCT GACGATTCTA AATTAGATGC CTCGGCAGAT
ACTGCCAGTG CGGAAATTAA ATCGGCAAAC AGTGCCGTTG TATTTGGAGT TACAGATTTG
GTGTATAAAA ATGCACTTGG AGGCATTGGA GGTAATTTAA GTCGCAATAA ATCCGCTAAA
GATCAATCTG TTAAATTAAA TAATGCAGTG ATTTCAACGA CATCAGAAAA TGCCAGTTTG
ATTAAAGCAG CCAGTGCATT AAATGTAGAT TCTTTAGCCA ATAGCAAAGC AGGACTTGGT
TGGAGTAACG GCACTTTCAC CACCAAAGGC GACTTTACCC TTTCTGGCGA AAAATCTCTG
GCAACAGCAG CCAAAAACGG TTGGTTGTTT GAAGTTGATG ACGGCTCGGA ATTGACCGCA
CTTATCAATA AAAAAGCGAA AGTTGTCGGT CTTTCCAGCA AAAATACCTC AGGTACGCTG
AATATCACTC TTGATGATGC GACTTGGGAG CTGCAAGCAA AAGAAAATGG TGTACCTACA
AGTACATTGA ATAAATTAAC CCTAAACTCT CATGCTATCT TAGATGCAAG TAAGCCAACA
GATACTGCCA GCACTAAAGC ACAATATGAC ATCCAACTCA CCTCAGACGC TACTAAAGAA
GACGGCACAT TAAATAACGG CGGTATCATT ACCCTAGCCA ACAACAGCTT CAACGATATT
TTAACTATCA AAGGAAATTA CGAAGGCAAA AATGGTGTTT TGAAAGTAAA TACTGAATGG
AATTCACCGG GCGATGATAA CGGAGCAAAT GCCGCCAGTG ACTTATTGGT TATCAAAGGA
AATGCGTCCG GTAACACAAC AGTAAAAGCC ATTAAAGCTG ACGGTACTGA AGATGTGATT
GACGGTAACA TTGGTAGTAT TGCCGAAGAT TTAAACAAAA ATAGTGCGGT TCTGATTAGG
GTTCATGGAA CAGATAACGG TAATGATGTA GCCGACACAG CCGAAGGGGG TTACAAATAC
CGTAGCACCT TTACCGGTGA AGCTAGAACC ACAGGGGCAG GGGTGTTAAA ACTCGCTTCC
CGTAAAAACA ATAACGGTCA TACCGAGTAC TTTTGGACAT TAACATCGAT TAACACAAAC
AATATCAACC TTGATCCAGT TGTTCCAGCG TATGTGCTTG CACCCAAAGC TGGTTTGGAA
TTGGGTTATA CCACATTGTC AACCCTTCAC GAACGCCGTG GCGAAAACCA AACTTCAAAG
GCTCAAAATC AAACATGGGG ACGAATTTTC GGCAAACATT CAGAGCTGAA CGGCAAAACC
CGTTTAGGCA CACAACACAA TATCTATGGT TTTCAATTTG GGCATGATTT TGCGATTCAA
CATACAGAAG AGGGCGATCT TCGCTTAACT GGTGGTTATG TGAGCTATGG CATAATGAAT
TCTACTTACA GTGACCGTCT TGATGATCAA CCCCAAACTG GTAAAGGCAA ACAAAAAGGC
TGGAACTTAG GTTTAACGCA TACTCGTTAT GCCCCGAGCG GAGCATATGT TGATTTAGTG
GGTCAAATCG GTTTTTTAAA TAACCAATTC AATGCCCGTA ATGGTGTAGA AGTAAAACAA
AAAGCTACCG CTCTTGCATT GTCAGCGGAA ATCGGACTCC CTTATGCCCT GCGTGAATAC
CCAACCAAAG ATGTGTGGTT AATCGAGCCG CAAGCCCAGT TGGTGTATCA AATGTTAAAA
CTTAACAGCT TTAAAGATGA TGTCAAATAC ATTCAAGGCG GTTACCATCA CGGTTTGCGT
GGTCGTTTAG GTGTGCGTGC GGTTTATAAC GTTCAGTCGG TGGAAGGTAA ATACCGCCCG
AACAGCGTTT ATATAACTGC CAACGTACTG CATGACTTCA TGAATGGAAA AGGTGTCACC
ATCGGTCAAG ATAAAGTAAA AGAAACCTTG GCTAAAACTT GGGCAGAAGT CGGTGTAGGC
GGACAGTTAC CAGTAGGCAA ACAAAGCCTT GTGTACGCTG ATGTCCGTTA CGAACACAGC
CTAAGCGGTA CAAAGCATGA AGGATATCGT GGCACAGTAG GCTTTAAATA TACTTGGAAA
TAA
 
Protein sequence
MPIAVLLQLP CKMPTLKKGG QITGLYVGTN TDADTSYTST GLTKILAQDI STSKTASDYI 
MGVYVFGPKS KVILNDSEIK VVSKGQNSFT LKIGNFENNG KSYKGEIIST GKMQLDSTEA
TNAPTILLVA DDSKLDASAD TASAEIKSAN SAVVFGVTDL VYKNALGGIG GNLSRNKSAK
DQSVKLNNAV ISTTSENASL IKAASALNVD SLANSKAGLG WSNGTFTTKG DFTLSGEKSL
ATAAKNGWLF EVDDGSELTA LINKKAKVVG LSSKNTSGTL NITLDDATWE LQAKENGVPT
STLNKLTLNS HAILDASKPT DTASTKAQYD IQLTSDATKE DGTLNNGGII TLANNSFNDI
LTIKGNYEGK NGVLKVNTEW NSPGDDNGAN AASDLLVIKG NASGNTTVKA IKADGTEDVI
DGNIGSIAED LNKNSAVLIR VHGTDNGNDV ADTAEGGYKY RSTFTGEART TGAGVLKLAS
RKNNNGHTEY FWTLTSINTN NINLDPVVPA YVLAPKAGLE LGYTTLSTLH ERRGENQTSK
AQNQTWGRIF GKHSELNGKT RLGTQHNIYG FQFGHDFAIQ HTEEGDLRLT GGYVSYGIMN
STYSDRLDDQ PQTGKGKQKG WNLGLTHTRY APSGAYVDLV GQIGFLNNQF NARNGVEVKQ
KATALALSAE IGLPYALREY PTKDVWLIEP QAQLVYQMLK LNSFKDDVKY IQGGYHHGLR
GRLGVRAVYN VQSVEGKYRP NSVYITANVL HDFMNGKGVT IGQDKVKETL AKTWAEVGVG
GQLPVGKQSL VYADVRYEHS LSGTKHEGYR GTVGFKYTWK