Gene YpsIP31758_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_2044 
Symbol 
ID5386080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp2351366 
End bp2356213 
Gene Length4848 bp 
Protein Length1615 aa 
Translation table11 
GC content41% 
IMG OID640865028 
Productadhesin/hemagglutinin 
Protein accessionYP_001401017 
Protein GI153946979 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.105561 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG ATAAATTCAA ACTTTCTCCT GCAGGGAAAT TAACCGTTTT ACTTTCATTG 
ATTCTAACGC CAGTAACGAT TAGCCATGCT GCTGAAATAG AGGCCGCTGG GAGAACGTAT
CTGAGGGGGG GGGAATATAT TCCAGATGTT CATAGCAATC CTGATGGCAT TAGTGTAATC
AATATTACGT CTCCTTCGGA ACAAGGCTTA TCGCATAATC AATATATGGA GTTTAATGTT
AATGAACATG GGGTGGTGTT TAATAATTCA CTTGAGACGG TTGCAAAAAA TGGGATAACC
TATCAGGATA ATAGAAATTT ACGAGGTTCA ACAGCGCGTA TTATTTTAAA TGAAGTTGTG
GGGTCGAATA TTTCGATATT GAACGGACAT CAAGATATCA TAGGCATGCC CGCAGATTAT
ATTCTGGCGA ATGCTAATGG CATTAGCTGT CAGGGGTGCA GTTTTTCACC CGAATTTAAA
AATGTCACAT TAGCCGTCGG TAAAGTAAAT GTAGATCGCG GTGATTTACG TAGTATAAAT
ACCCTTGGGA ATGCTAATTT ATTAAATGTA TCAACTGGTC AGGGGGGTAT TGCTGATGCA
TTGACACTTA TTGCACCTGT TATTAATACC AGTGGTCATA TTAAAGTCAA AGGCGATGCG
GAGTTCATTG CGGGTCAAAA TACGTATCAT ATTGAGAAGG ATAAAACCTC ACATGTCGTG
GCTGGTGATA GTGAAATAAA AACAATTGAC GGCTACTATC TTGGCAGCAT TTCTGCTAAC
CGTATTAATT TACTTGATAC AAGAGAAGAT AATAATATTA ATTTATTTGG TGATGTGGTC
GCTGAGGAAG CTGAGGTGGT CACCAATGGC ACATTACGAT TAAGAGGAAA GGGAGATGGT
ACACAGAGGA TAACCATAAA AAATGGAATG AAAATATCGA GTAATAATAT TGATATCACC
CAAAACATGA CTGCTGATGA AATTAAATTT TTTAACGTAA AAGAAAATAA AATAAATAAC
ACCATTATTA ATGCTGGACG TATTGATTTT GTTGCTGTTG GCGATGTTAA ATTGTCAGGT
GCTTCGTTTT TAAGTGATGA TGATCTCTCA ATTACCGCAA AGAGTTTACA TGTTGATTCA
CATTTGGTTA AACACTCATC GACGACAGGA GTGGTAGTTA CTGAAGTGAA TCCTCTTGAT
GCACCAACTA AGAAAGTAGA AAGTGAATAT AATGACCATG TTTCACAAGC CAGTTCAATT
ATTAGTCGGG GAAATGTTAA GTTATATGGG CAGGATGATT TAGTATTAAA AAATGCTAAT
GTCCAAGCCT ATGGTAATAT TGATTTATTT TCTGAAGGTG ATATTCATTT AAATGGTTCA
ACTGAAACTA ATACTACAAT AAATAATATT ACCTATATTA ATCATGATGA GGACTTGAAA
AGAGGTCATA ATAATATAAA AACCGTTACT GAAAGGTTTG TCCCTCTTAA TATGAAGGCG
AGGGGGGATA TTAATATACA GAGCAAAAAC ACGCATATCC ATGGTGCCCA AATAGCTTCA
GAGGGTGAGT TATCAATAGA TGCCAAGGAC AACGTCTATA TTGGGGTGGC AAGTATGTTA
ACTTCAGAGT TTAAAGATAT TAACTACGAT CAGTGGGGGG GGGCTCATGG TTCGGAAAAA
GACAAGATAG AAGAGTATGT TTATACCGGT AATAAGTCAG ATTTAGTCGG AGGGCGGATA
AAAATCACTG CGGGTAATGA TGCCAGAATA TTTGGTGGAA AAATAAATGG AGTAGATGGT
GGAGAGATCT CAGCTCAAAA TTATCTGAGT ATTGATGGTG TGATGGGGAC ACGCAGCTTT
AAAAGGGATC AAAAAACTGG TGGTATAATG CATACCAACA AGAGTACTTC TACTGCCGAT
AATCACTATG AAAAATTTAT CGACAGCGAA ATTAGCTCTG ATGGAGATTT CAGCATATTC
AGCAAAAAAG ACCTTTATAT TGATGGTAGT CGAATTAATG TAAACGGAAA ACTAGATATT
AATGCTAATG AGAAGTTGAC CGTACAGGCT GCGCGTCAGC AACAAAAAAT AGATGAGGAA
AAAACCCGCC TCAGCATTGA GTGGTTTGCC AAAGAAAGTA GTGATAAACA GTATCGTGCG
GGCTTTCTCA TTAATCATCA AAAAGACACT GAAAATACGC TTAGAGATGA GCACCAAATT
GCAACATTAA GTGCAGAACA GATTAAACTC ACTGCCGGAG ACGATATTAC GTTTTTTGGT
ACGGCCATTA GTACCAGCGA GGGTGATTTT ACTGTAAAAA CAGAAAAAAA CATTGGCTTC
TTTAGTGCCA AAAATCGCGC ACTGATTAAT AAAAATAGGG TTGAAAATAG TGGTGGTTTC
TATTACACAG GAGGAATCGA TAAAATTGGC AATGGGGTGC AATATACCCA TATCGATAGT
GATAGCCACC ATGATATTGA AAATAATCTT GTGGTCAAAA CCCATATTAA CGGCAATATG
AATATTAAGG CTGGCGGGGA TATTACTCAA CAGGGTGCAC AGCATCAGGT TGCCAAAAGT
TATCATGCCG ATGCAACGCA TATCAATAAT ATGGCAAGCC ATAATATTGA GATTACAAAA
ACAAATAAAT TGCAGGTTGA TGCGGGTATT GGTTTTAATA TTGACTACAG TGGATTTACC
CGCCCGATAG AGAAATCAAT AAAAACCCCT GCGAATACGC TCGATAATAT TGGTGGAAGA
GGGAGCTTGC CAGGGATTGC AGATCCAACT GCGGGCATAG ATCTTGAGGC ATCGGGCAGC
AATACCAAAT CAACAGAGAG TAATTCACTG GCGTTGGTTA CCACGATTAA AGCTCAGGAT
ATTGAGCTGG TGGCTAAAAA AGATGTGTTG GATGAAGGGA CACAGTATCA CGCCACTCAT
GGGGCTATGA AATTGAAGGC AGAGCGTCAT TTCAGTAACG CCGCAGTCAA TAGTGAAAAG
CAAACCACCC AAGGAGAAAA AGGTGAGGCG GGGGGACGTC TTGGTATCAC TGCTACTAAA
GATATAAAAG TCAGCTTGGG GGCTAAGGCT GAAACCAGCG AGGATGAGAC CTATGCTGAA
CGGATGTTAT CGGCGAACAT TAAAGCTCAG CAGGGGGTTC ATATTCGTAC TACCGGAGAT
ATCTATCATT ATGCAACTGA TATTAATGGC GGCGCCGGGG ATATTGATAT TAAGGCCGGC
AAGAACCTCT ATTTTGATCA GGTGCAGGAT AGCCAGCGCA GCAGTAATAT AAAAATTTCG
GGTAATGGGA AACTAAGTTT TGGCAAAGAG GCTAGTGGCA AGAGTTTCCG TCTCGCAGGT
GGTGGTGGAT ACGAAAAAGG CCAAAGTCAG CGAACCGAGG CAGGGGTCAG CCAAATTAAT
ACTCAGGGCA ATGTATTACT TGAGGCTGGT GCGGATCTTA CCGCAAAAGG TATACAAATA
GGCCGTCCGG ATGTTCAAGT AAATAATGTT TCTTTGGTTG CTGCTGGGCA GGTCAACATG
CCGGCAGCGG TGTTTGGTAG TGTCGATATT AATGATGGCG TGCTGGCTGA TTTCCGACTG
GGAGGCAAGC GTTCTACAGA CAGTACCTCA AAGGAAAATG TTGTAGGAGG CGGCGTTAAG
GTTGATCAAG TCAATCAGTC TGTTTCTGAC CAGCAGGGGG GGCATATTTA CAGTAAAAAT
ACGGTGTCTA TTAAATCAGA CAGTGACAGC AATCAGGCTA TTCATTTACA AGGGCTAAAA
GTTATTGCAC CAAAAGTTGA TTTGAGTGCA CGCCATGGTG GGGTATTTAT TGAGTCTGCC
TTAAGTGAAT TGCCAAAAGA GAATTGGACT TTCGGGGTTG ACCTGGATCT GGTACTGAAA
AGCGCATCGC CTAAAAAAGC AGACGGAACG ATTGACAGCG ATAAATCCAG TAAAAGTAAT
TATAAGGGGG CGGGAGTTAA AGTGATGGTT GATCTGCAGG ATGCATTTAG ACATCAAAAT
ACTTATATCA ATACGGCACA TTTTTCTCTG AATACCAAAA AAGATGCGGT AATGAAAGGG
GCGCGTATTG ATACCACACA GGCTAATATT AATGTGGGTG GCGATCTAAC AATCGAAAGT
GTCAAAAGTA AGGTAGATAG CGTCAAGGTG GATGTTGAGT TATCCCTGAG CCATACCAAC
GATAAAGGTA GCAGTGTTAC ATCTAAACTA TCGAAGCTCG GTACTAAGAA GTTCGAGAAG
GATATTAAAG AAAAAATAGA TTCAGGTATT AAAAAAGGTG AGTTGATGTA TAACAAAAAA
TCACCACCCA AAGACACAAT GGGCGGGGTT GGCTTCAGTA AAGAGTCTGG CAGCGTATAC
CTTCCAACAT TATCGGCTGA AACGAAATCG CGTAATTTTA GTGATGCAAC CGCCCGCTTT
ATGGGGGAGC AATTTAAGGG GGCCCTCACC AATCCGGAAG GTGCACAAGG GCATGCAAAG
CTGGATGTGC AGCTAGTCAA TAATGATGCT GTCGCTGAGC AATCCGGTAT TTTCGGTGAC
AGTGATGTAG TGATTACCGT GAAAGGCACG ACGAAACTGC ACGGGGCAGA AATAAGTAGT
GGCAGTAAAG ATGTTATCCT GAAGACGAAT AAGCAAGAGT TGAGTTCTAT TAAAAACAGT
TATTATAAAA ACGGTGGCGG GTTTAATGTT GCCCCAACCC TATTGGGGAC AATGAAACAG
GGTGTTAACG GTGAGTTCCC CTATGTCAAT ATTCCTGATG GCACTTCTCA TGACGATGAG
TCTATCGGGC AAGTGGTGAA TAAAAAGCAC ACCGGCGTAG GGGGTTAA
 
Protein sequence
MKKDKFKLSP AGKLTVLLSL ILTPVTISHA AEIEAAGRTY LRGGEYIPDV HSNPDGISVI 
NITSPSEQGL SHNQYMEFNV NEHGVVFNNS LETVAKNGIT YQDNRNLRGS TARIILNEVV
GSNISILNGH QDIIGMPADY ILANANGISC QGCSFSPEFK NVTLAVGKVN VDRGDLRSIN
TLGNANLLNV STGQGGIADA LTLIAPVINT SGHIKVKGDA EFIAGQNTYH IEKDKTSHVV
AGDSEIKTID GYYLGSISAN RINLLDTRED NNINLFGDVV AEEAEVVTNG TLRLRGKGDG
TQRITIKNGM KISSNNIDIT QNMTADEIKF FNVKENKINN TIINAGRIDF VAVGDVKLSG
ASFLSDDDLS ITAKSLHVDS HLVKHSSTTG VVVTEVNPLD APTKKVESEY NDHVSQASSI
ISRGNVKLYG QDDLVLKNAN VQAYGNIDLF SEGDIHLNGS TETNTTINNI TYINHDEDLK
RGHNNIKTVT ERFVPLNMKA RGDINIQSKN THIHGAQIAS EGELSIDAKD NVYIGVASML
TSEFKDINYD QWGGAHGSEK DKIEEYVYTG NKSDLVGGRI KITAGNDARI FGGKINGVDG
GEISAQNYLS IDGVMGTRSF KRDQKTGGIM HTNKSTSTAD NHYEKFIDSE ISSDGDFSIF
SKKDLYIDGS RINVNGKLDI NANEKLTVQA ARQQQKIDEE KTRLSIEWFA KESSDKQYRA
GFLINHQKDT ENTLRDEHQI ATLSAEQIKL TAGDDITFFG TAISTSEGDF TVKTEKNIGF
FSAKNRALIN KNRVENSGGF YYTGGIDKIG NGVQYTHIDS DSHHDIENNL VVKTHINGNM
NIKAGGDITQ QGAQHQVAKS YHADATHINN MASHNIEITK TNKLQVDAGI GFNIDYSGFT
RPIEKSIKTP ANTLDNIGGR GSLPGIADPT AGIDLEASGS NTKSTESNSL ALVTTIKAQD
IELVAKKDVL DEGTQYHATH GAMKLKAERH FSNAAVNSEK QTTQGEKGEA GGRLGITATK
DIKVSLGAKA ETSEDETYAE RMLSANIKAQ QGVHIRTTGD IYHYATDING GAGDIDIKAG
KNLYFDQVQD SQRSSNIKIS GNGKLSFGKE ASGKSFRLAG GGGYEKGQSQ RTEAGVSQIN
TQGNVLLEAG ADLTAKGIQI GRPDVQVNNV SLVAAGQVNM PAAVFGSVDI NDGVLADFRL
GGKRSTDSTS KENVVGGGVK VDQVNQSVSD QQGGHIYSKN TVSIKSDSDS NQAIHLQGLK
VIAPKVDLSA RHGGVFIESA LSELPKENWT FGVDLDLVLK SASPKKADGT IDSDKSSKSN
YKGAGVKVMV DLQDAFRHQN TYINTAHFSL NTKKDAVMKG ARIDTTQANI NVGGDLTIES
VKSKVDSVKV DVELSLSHTN DKGSSVTSKL SKLGTKKFEK DIKEKIDSGI KKGELMYNKK
SPPKDTMGGV GFSKESGSVY LPTLSAETKS RNFSDATARF MGEQFKGALT NPEGAQGHAK
LDVQLVNNDA VAEQSGIFGD SDVVITVKGT TKLHGAEISS GSKDVILKTN KQELSSIKNS
YYKNGGGFNV APTLLGTMKQ GVNGEFPYVN IPDGTSHDDE SIGQVVNKKH TGVGG