Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_1247 |
Symbol | |
ID | 4240758 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 1428263 |
End bp | 1430725 |
Gene Length | 2463 bp |
Protein Length | 820 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 638104820 |
Product | autotransporter protein YapE |
Protein accession | YP_719459 |
Protein GI | 113461390 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3468] Type V secretory pathway, adhesin AidA |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAATAG CGGTACTTCT ACAGTTACCG TGCAAAATGC CAACATTAAA GAAAGGCGGT CAAATTACCG GTCTTTATGT CGGGACAAAC ACAGATGCAG ATACTAGCTA TACCTCAACC GGTTTAACTA AAATCCTTGC ACAAGACATT AGCACGAGCA AAACTGCCAG CGACTACATT ATGGGGGTTT ATGTATTTGG TCCAAAATCC AAAGTCATCT TAAATGATTC AGAGATTAAA GTGGTTTCTA AAGGGCAAAA TTCCTTTACG CTAAAAATTG GCAACTTTGA AAATAACGGT AAAAGCTATA AAGGCGAAAT TATCTCGACC GGTAAAATGC AATTAGACAG CACCGAAGCA ACTAATGCAC CGACTATTTT ATTAGTCGCT GACGATTCTA AATTAGATGC CTCGGCAGAT ACTGCCAGTG CGGAAATTAA ATCGGCAAAC AGTGCCGTTG TATTTGGAGT TACAGATTTG GTGTATAAAA ATGCACTTGG AGGCATTGGA GGTAATTTAA GTCGCAATAA ATCCGCTAAA GATCAATCTG TTAAATTAAA TAATGCAGTG ATTTCAACGA CATCAGAAAA TGCCAGTTTG ATTAAAGCAG CCAGTGCATT AAATGTAGAT TCTTTAGCCA ATAGCAAAGC AGGACTTGGT TGGAGTAACG GCACTTTCAC CACCAAAGGC GACTTTACCC TTTCTGGCGA AAAATCTCTG GCAACAGCAG CCAAAAACGG TTGGTTGTTT GAAGTTGATG ACGGCTCGGA ATTGACCGCA CTTATCAATA AAAAAGCGAA AGTTGTCGGT CTTTCCAGCA AAAATACCTC AGGTACGCTG AATATCACTC TTGATGATGC GACTTGGGAG CTGCAAGCAA AAGAAAATGG TGTACCTACA AGTACATTGA ATAAATTAAC CCTAAACTCT CATGCTATCT TAGATGCAAG TAAGCCAACA GATACTGCCA GCACTAAAGC ACAATATGAC ATCCAACTCA CCTCAGACGC TACTAAAGAA GACGGCACAT TAAATAACGG CGGTATCATT ACCCTAGCCA ACAACAGCTT CAACGATATT TTAACTATCA AAGGAAATTA CGAAGGCAAA AATGGTGTTT TGAAAGTAAA TACTGAATGG AATTCACCGG GCGATGATAA CGGAGCAAAT GCCGCCAGTG ACTTATTGGT TATCAAAGGA AATGCGTCCG GTAACACAAC AGTAAAAGCC ATTAAAGCTG ACGGTACTGA AGATGTGATT GACGGTAACA TTGGTAGTAT TGCCGAAGAT TTAAACAAAA ATAGTGCGGT TCTGATTAGG GTTCATGGAA CAGATAACGG TAATGATGTA GCCGACACAG CCGAAGGGGG TTACAAATAC CGTAGCACCT TTACCGGTGA AGCTAGAACC ACAGGGGCAG GGGTGTTAAA ACTCGCTTCC CGTAAAAACA ATAACGGTCA TACCGAGTAC TTTTGGACAT TAACATCGAT TAACACAAAC AATATCAACC TTGATCCAGT TGTTCCAGCG TATGTGCTTG CACCCAAAGC TGGTTTGGAA TTGGGTTATA CCACATTGTC AACCCTTCAC GAACGCCGTG GCGAAAACCA AACTTCAAAG GCTCAAAATC AAACATGGGG ACGAATTTTC GGCAAACATT CAGAGCTGAA CGGCAAAACC CGTTTAGGCA CACAACACAA TATCTATGGT TTTCAATTTG GGCATGATTT TGCGATTCAA CATACAGAAG AGGGCGATCT TCGCTTAACT GGTGGTTATG TGAGCTATGG CATAATGAAT TCTACTTACA GTGACCGTCT TGATGATCAA CCCCAAACTG GTAAAGGCAA ACAAAAAGGC TGGAACTTAG GTTTAACGCA TACTCGTTAT GCCCCGAGCG GAGCATATGT TGATTTAGTG GGTCAAATCG GTTTTTTAAA TAACCAATTC AATGCCCGTA ATGGTGTAGA AGTAAAACAA AAAGCTACCG CTCTTGCATT GTCAGCGGAA ATCGGACTCC CTTATGCCCT GCGTGAATAC CCAACCAAAG ATGTGTGGTT AATCGAGCCG CAAGCCCAGT TGGTGTATCA AATGTTAAAA CTTAACAGCT TTAAAGATGA TGTCAAATAC ATTCAAGGCG GTTACCATCA CGGTTTGCGT GGTCGTTTAG GTGTGCGTGC GGTTTATAAC GTTCAGTCGG TGGAAGGTAA ATACCGCCCG AACAGCGTTT ATATAACTGC CAACGTACTG CATGACTTCA TGAATGGAAA AGGTGTCACC ATCGGTCAAG ATAAAGTAAA AGAAACCTTG GCTAAAACTT GGGCAGAAGT CGGTGTAGGC GGACAGTTAC CAGTAGGCAA ACAAAGCCTT GTGTACGCTG ATGTCCGTTA CGAACACAGC CTAAGCGGTA CAAAGCATGA AGGATATCGT GGCACAGTAG GCTTTAAATA TACTTGGAAA TAA
|
Protein sequence | MPIAVLLQLP CKMPTLKKGG QITGLYVGTN TDADTSYTST GLTKILAQDI STSKTASDYI MGVYVFGPKS KVILNDSEIK VVSKGQNSFT LKIGNFENNG KSYKGEIIST GKMQLDSTEA TNAPTILLVA DDSKLDASAD TASAEIKSAN SAVVFGVTDL VYKNALGGIG GNLSRNKSAK DQSVKLNNAV ISTTSENASL IKAASALNVD SLANSKAGLG WSNGTFTTKG DFTLSGEKSL ATAAKNGWLF EVDDGSELTA LINKKAKVVG LSSKNTSGTL NITLDDATWE LQAKENGVPT STLNKLTLNS HAILDASKPT DTASTKAQYD IQLTSDATKE DGTLNNGGII TLANNSFNDI LTIKGNYEGK NGVLKVNTEW NSPGDDNGAN AASDLLVIKG NASGNTTVKA IKADGTEDVI DGNIGSIAED LNKNSAVLIR VHGTDNGNDV ADTAEGGYKY RSTFTGEART TGAGVLKLAS RKNNNGHTEY FWTLTSINTN NINLDPVVPA YVLAPKAGLE LGYTTLSTLH ERRGENQTSK AQNQTWGRIF GKHSELNGKT RLGTQHNIYG FQFGHDFAIQ HTEEGDLRLT GGYVSYGIMN STYSDRLDDQ PQTGKGKQKG WNLGLTHTRY APSGAYVDLV GQIGFLNNQF NARNGVEVKQ KATALALSAE IGLPYALREY PTKDVWLIEP QAQLVYQMLK LNSFKDDVKY IQGGYHHGLR GRLGVRAVYN VQSVEGKYRP NSVYITANVL HDFMNGKGVT IGQDKVKETL AKTWAEVGVG GQLPVGKQSL VYADVRYEHS LSGTKHEGYR GTVGFKYTWK
|
| |