Gene YpsIP31758_2979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_2979 
Symbol 
ID5387181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp3352172 
End bp3358168 
Gene Length5997 bp 
Protein Length1998 aa 
Translation table11 
GC content50% 
IMG OID640865985 
Productalpha-2-macroglobulin domain-containing protein 
Protein accessionYP_001401941 
Protein GI153950405 
COG category[R] General function prediction only 
COG ID[COG2373] Large extracellular alpha-helical protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTTGT TGAGGTTTCT GCTGATCTCC CCCTTCGCCC TTATCAAGGG ACTCTATCGG 
TTAAGCGCTT ATTTTTTACG GTTAGTGGGC CGCCTATTGC GCCCAGTCGT CGGTAACCTG
AATTGGCGCG CGCCACAATG GATGACGAAA ACTGCCAACG GGCTACACTG TGCTTTCAAC
CGTTCAGAGC AATGGGTCGC TAAGCATCCC AAAGGGATCA GTGCGGCAAT AGTGCTGTTG
ATGGCTGCCG CCAGCGCCGC ATTCTATGGC TATCACTGGT ATCTTAATCG GCCCCAACCG
ATTGAACCGG CACCTATGGT TTATCAGGAA ACCAGTATCA GAGTCTCGGC ACCAAGAACG
GTTAACTATC AGGCGCAAAA ACCAGAAGCC CAGCCGCTTA GCCTTAATTT TATGCATTCA
GCCGCACCTA TCACCGCGAT GGGTCAGGTC ATCGATCAGG GCATCTCATT AACCCCCGCG
ATGGAGGGTG AATGGAAATG GGCAACTGAA CGCACGTTGG TGTTTACCCC TAAAAAAGCC
TGGCCGATGG GCGCTAACTA CCAAATTACG ATCGATGCAG AAAAACTTTT AGCACCACAG
ATTAAGCTCA ATCAGACCGA ACTTAATTTC ACAACGCCAG CTTTTGCTTA TCAATTGGAA
AAAGCGGAAT ATTATCAGGA TCCGCAAGAG GCCCAAAAGC GCAGCACCAT TTTCCATGTG
CAATTTAATG CCCCGGTTGA TGTTGCCAGC TTTGAAAAAC AGATCCTCTT GGGATTGGTC
GAAGGTAAAT CCAAGTCAGA GAAGAAACTT AATTTCTCCG TCGTTTATGA TGAGAAAAAG
CTTAATGCCT GGATACATTC GCAACCCTTG ATGCCAATGG ATAAAGGCGG TTCGGTCCAT
CTATCGATTA ATAAAGGGGT GAATGCCAGT GTTGCCGCCA CGCCTACGAC ACAGGCACAG
AATAAATGGG TATCCGTCCC TAACCTATAT AGCCTGGCGG TTAATAGTAT TAATGCCACG
TTGGTTGAGT CAGATAACAA TAATGGTGAG CGGGCCTTAA TTATTGCTAT CAGCGACGCG
GTTAAAGATA AAGAGATCAA AAATGCGGTC AAAGCCTGGT TACTGCCGCA ACATAATTTT
CAAGCGAAAG AGAGCGCCAA AACATCAACC GATTTCTATC CTTGGGATAT GGATGATATT
GACGATAATC TGCTGCAACA ATCAACGCCG CTGGCGCTGA CCCTCAATGA GGCCGAGCAA
GAGTATCAGC CAATATTCAG CTTTAAGTTT GATGCCCCTT CCTATCGCAC ACTGCTGATC
GAGGTTAACA ATAGCCTGAC ATCGGTGGGC GGTTATAAAA TGCCGGAAAA AATCTACCAA
ATAGTCAGGG TTCCCGATTA CCCTAAGACG CTGCGCTTTA TGTCACAAGG CTCGTTATTA
TCGATGCAGG GTGATAAGCA GATCAGCGTC GCCGCCCGTA ATATGACTGG CATGAAGCTG
GATATTAAGC GGGTTATTCC TAGCCAGTTA CAACATATTG TGTCATTTAA AAGCAGCGAA
TATTCATCAG CTCACTTTAA CCGCCTGAGT GATGAATATT TTACTGAACA CTTCCAGTAC
CAAACCGCGC TGAATAATGA CAACCCCGGC GAGATCAATT ATCAAGGGGT CGATCTGTCC
CGTTATCTTG CAAATAATCC GAGTGCTCGG CGTGGGGTGT TCTTACTTAC CCTGTCAGCT
TGGGATCCGG AGAAAAGGGA TAATCAGCAA CACAGCGAGG AAGACTACGA CGAAGACCAG
GAATGGGTCG GCGATTCACG CTTTGTGGTG ATCACGGACT TAGGCATTAT CACCAAGCAA
TCGCAGGATA GATCCCGTGA TGTGTTTGTG CAATCCATTC ACTCGGGTCT GCCCACCGCC
GATGCCAAAG TCTCTGTGGT GGCAAAAAAT GGTGTGGTCT TACTGAGCCA AATCACCGAT
AGCAAAGGGC ATGTTCATTT TCCTGCGCTG GACGCCTTTA AAAATGAACG CCAACCGGTC
ATGTTCCTGG TGGAAAAAGA AGGGGATGTC TCCTTCCTGC CCACCCGAGC CACCTATGAC
CGTAACCTTG ATTTCTCACG TTTTGATATT GATGGCGAAG AGACACCGTC CGACCCACGT
ACTCTAAGCA GCTATCTGTT TTCTGACCGG GGAGTTTATC GCCCAGGCGA TCGCTTCAAT
ATTGGTCTGA TCACCCGGAC CGCCAACTGG GCTACCGCAC TCGATGGCGT CCCCCTGCGG
GCGGAGATCC GTGACCCACG AGATACCTTG ATGAGTACCC TGCCGATAAC CTTGGACAGC
AGTGGTTTCA ATGAGCTCAG CTATACGACC GGTGAAAACT CACCTACCGG TGAATGGAAC
GTCTATCTCT ATCTGGTTGG TAAGAATAAT GAAACGTCGA TGTTGCTGGG GCACACCACC
GTAAATGTTA AAGAGTTCGA GCCTGATCGC TTAAAAGTGC AACTGCAACT GACGCCAGAG
CGTCAACAAG GCTGGGTTAA ACCGCAGGAG CTGCAAGCCA ATATCAATGT ACAAAATTTA
TTCGGTACAC CAGCACAGGA GCGCCGTGTC ACCTCTAGAC TGATCTTGCG GCCAATGTAC
CCGAGTTTTG CCCCGTTCCC TGATTACCTG TTCTATGAGA ATCGTCATAA CAGCGATGGT
TTTGAGACTG AACTGGAAGA ACAAACGACC GATCTACAAG GGATGGCGAC CATCCCATTG
GATCTGAAAT CCTATGCTGA CGCCACCTAT CAACTGCAAT TGTTGTCGGA AGCCTTTGAA
GCGGGTGGAG GCCGCTCCGT GGCCGCGACT GCGCGGGTTC TGGTTTCACC TTACGACTCT
CTGGTTGGGG TGAAAGCCGA TGGCGATCTG AGTTATATCA ACCGTGATGC CGTGCGTAAG
CTGAATATTA TTGCTGTGGA CCCGAGCCTG AATAAAATTG CGCTGCCAGA CTTGAGTCTG
TCATTGATTG AGCAGAAGTA TATTTCAGTG CTAACCAAAC AGGATTCAGG CGTTTATAAA
TATCAATCAC GGCTAAAGGA GCAGTTGGTC TCAGAGCAAC CGCTAAAAAT CAGCCCGACA
GGGACGGATT TCACCCTAGT GACCCAGCAG CCTGGTGATT TTATTCTGGT GGTTAAGGAC
AGTCAGGGGC AGGTTCTGAA CCGTATTAGT TATACGGTGG CGGGTAACGC AAACCTGACC
CGCTCACTGG ATCGCAACAC CGAATTAAAG CTAAAACTGA ATCAGGCCGA ATATCTGCAA
GGCGAAGAAA TTGAGATTGC GATTAATGCA CCTTATGCCG GTAGCGGTCT GATCACGATA
GAAAAAGATA AAGTGTATAG CTGGCAGTGG TTCCACAGTG ATACCACCAG CTCTGTGCAG
AGAATCCGCA TCCCACCGGC AATGGAAGGC AATGGCTATA TCAACGTACA ATTCGTGCGT
GATGTGAATT CCGATGAGAT CTTTATGAGC CCACTGAGTT ACGGTGTGAT GCCATTTAAG
ATCAGTACCA AAGCGCGTCA GGCGGCTATC GAGTTAGCGT CGCCGTCAGT CATTAAACCG
GGTGAAGTGT TACCGATTAA AGTGACCACC GATTCACCAC AGCGCGTGGT GGTGTTTGCC
GTCGATGAAG GTATTTTGCA GGTGGCACGC TATCGCCTGA AAGATCCACT GGACTACTTC
TTCCGTAAAC GTGAACTGAG TGTACAGAGT GCACAAATTC TCGATTTGAT CCTGCCGGAA
TTCAGCAAGC TGATGGCACT GACCTCCGCA CCCGGAGGCG ATGCCGGGGA AGGGCTGGAT
CTGCACCTCA ATCCGTTTAA ACGCAAACAA GACAAGCCGG TGGCTTATTG GTCTGGTATC
ACCGAAGTGA ATGGTGAAAC CACCTTCAAT TACCCGATTC CCGACTATTT CAATGGTAAA
ATTCGCGTGA TGGCCATCTC TGCGACCCCT GATCGCATTG GTAAAGTCCA GACCTCGACC
ACCGTGCGGG ATAACTTTAT TCTGACACCG AATGTCCCCG CGATGGTAGC ACCGGGAGAT
GAATTTGATG TCACCGTGGG TGTGAGTAAC AACCTGCAAG GATTGAAGGG TAAAGCGGTT
GATATCACCG TGCGTCTGAC ACCACCGCCA CAACTGGAAG TGGTGGGTGA AGCGCAACAC
AGCCTGTCGC TGGCAGAAAA ACGTGAAACG CTTGTCAGCT TCCGCCTGCG CGCCCGTTCA
GCATTGGGTG ATGCTCCACT GGTGTTTGAT GCCAGCTATG GCTCTCAATC CAGCCGCCGG
ACGGTCAGTA CCTCGGTACG CCCGGCGATG CCATTCCGAA CGCAATCGGT GATGGGCCGG
ATGGAGGGTA ACAAGCATAC TGTGACCAAT CTGCGCCAGA TGTTTGATAA TTATGCTCAA
CGTCAGGCGA CCGCTTCCCA CTCACCGTTG GTCTTAACCC AAGGTCTGGC GCGGTACCTG
GCTGATTACC CGTACTACAG TTCTGAGCAA ATTGTCAGCC GCTCGATTCC GTTGATTATG
CAAAGCAAAC ATCCTGAAAT GGACAGTGCC CTCAATCAGA ATGAGGTCCG TGATCAACTG
AAAAACATGC TACGTATCCT GAGCTCTCGG CAGAATAGCA CTGGTGCAAT CGGTTTGTGG
CACGCCTCCC CTACCCCTGA TCCGTTTGTC ACACCTTATG TCGTGCAATT TCTGCTGGAA
GCGAAATCTG CCGGTTACAG CTTGCCGAAT GACATCTTGG AGGGGGCCAA CAACGCACTG
CGTCTGTTAG CGGCTCGACC TTATGATGAC CTTTACTCTC TGCGTTTGCG GGCCTTTGCT
GTTTACCTGT TGACCTTGCA GGGGGAGATC ACCACCAATA CTCTGGCATC GGTGCAAAGT
ACGTTACAGC AACTTTATCC TGACAGTTGG CAGACTGATC TGAGTGCCAT TTATCTGGCC
TCATCATACC GTCTGCTCAA AATGGATGAC GAAGCCAATA AACTGCTGCA ACCCACCTGG
AAACAACTGG GTAAAGCCTA CAGCAAGGCC TGGTGGACGC AGAATTATTT TGATCCACTG
GTGCAAGATG CAACCCGGTT GTATCTGATC ACTCGCCATT TCCCAGAGAA AGTCTCTTCT
ATTCCGCCAC AAGCACTGGA AAATATGGTG CTGGCACTGA GGGATGAGCA TTACACGACC
TATTCATCCG CGATGAGCAT TCTGGCACTG GAAAGTTACA CCAGCCAGGT AGCCGCCCAG
CAAGATACGC CAGAAACCCT GCAAATCATC GAGATCAGTA AAAGCAAAGG GATCGACCCT
AACGTTATCT CAACGCTGAA CGGCCTGTTC GTTCAAGGTG ATTTTACCGG TGAGGCTAAA
GCGATTCAGT TTAACAACTA TGCCTCGGCA CCCGCTTGGT ATGTGGTCAA TCAATCAGGC
TATGACCTTC AGCCACCAAA AGACGCCATC TCTAATGGGC TGGAAATCAG CCGCAGCTAC
ACCGATGAGC AGGGTAAGCC GGTGACCCAA GTCACCTTAG GGCAGAAAGT TAACGTGCAC
CTAAAAATCC GGGCTAACGC TAAACAAGGT CAAAATAATC TGGCGATTGT CGATCTACTG
CCGGGCGGTT TTGAAGTGGT ACAACAAACG GCACCTGAAC CAGAGTTTTA TGATAATCAG
GATGATCAGG ATGATCAGGA TGATCAGGAT GAGGAAACTG GCAGCGGCTG GCAGTCGCCG
CTAATGGTAT CTGGCTCCAG TTGGTACCCT GACTACAGTG ATATTCGTGA AGATCGCGTG
ATCATTTATG GCAGTGCCAG TACCGACGTT AAAGAGTTTA TCTACCAAAT CAAATCAACC
AATACGGGTC GCTTTGTGGT GCCACCGGCT TACGGCGAAG CCATGTATGA TCGTAATGTA
CAGGCGCTGT CGGTCGGTAA AGGGCATATC CTTGTCGTTC CACCTGAGGC AAAATAG
 
Protein sequence
MDLLRFLLIS PFALIKGLYR LSAYFLRLVG RLLRPVVGNL NWRAPQWMTK TANGLHCAFN 
RSEQWVAKHP KGISAAIVLL MAAASAAFYG YHWYLNRPQP IEPAPMVYQE TSIRVSAPRT
VNYQAQKPEA QPLSLNFMHS AAPITAMGQV IDQGISLTPA MEGEWKWATE RTLVFTPKKA
WPMGANYQIT IDAEKLLAPQ IKLNQTELNF TTPAFAYQLE KAEYYQDPQE AQKRSTIFHV
QFNAPVDVAS FEKQILLGLV EGKSKSEKKL NFSVVYDEKK LNAWIHSQPL MPMDKGGSVH
LSINKGVNAS VAATPTTQAQ NKWVSVPNLY SLAVNSINAT LVESDNNNGE RALIIAISDA
VKDKEIKNAV KAWLLPQHNF QAKESAKTST DFYPWDMDDI DDNLLQQSTP LALTLNEAEQ
EYQPIFSFKF DAPSYRTLLI EVNNSLTSVG GYKMPEKIYQ IVRVPDYPKT LRFMSQGSLL
SMQGDKQISV AARNMTGMKL DIKRVIPSQL QHIVSFKSSE YSSAHFNRLS DEYFTEHFQY
QTALNNDNPG EINYQGVDLS RYLANNPSAR RGVFLLTLSA WDPEKRDNQQ HSEEDYDEDQ
EWVGDSRFVV ITDLGIITKQ SQDRSRDVFV QSIHSGLPTA DAKVSVVAKN GVVLLSQITD
SKGHVHFPAL DAFKNERQPV MFLVEKEGDV SFLPTRATYD RNLDFSRFDI DGEETPSDPR
TLSSYLFSDR GVYRPGDRFN IGLITRTANW ATALDGVPLR AEIRDPRDTL MSTLPITLDS
SGFNELSYTT GENSPTGEWN VYLYLVGKNN ETSMLLGHTT VNVKEFEPDR LKVQLQLTPE
RQQGWVKPQE LQANINVQNL FGTPAQERRV TSRLILRPMY PSFAPFPDYL FYENRHNSDG
FETELEEQTT DLQGMATIPL DLKSYADATY QLQLLSEAFE AGGGRSVAAT ARVLVSPYDS
LVGVKADGDL SYINRDAVRK LNIIAVDPSL NKIALPDLSL SLIEQKYISV LTKQDSGVYK
YQSRLKEQLV SEQPLKISPT GTDFTLVTQQ PGDFILVVKD SQGQVLNRIS YTVAGNANLT
RSLDRNTELK LKLNQAEYLQ GEEIEIAINA PYAGSGLITI EKDKVYSWQW FHSDTTSSVQ
RIRIPPAMEG NGYINVQFVR DVNSDEIFMS PLSYGVMPFK ISTKARQAAI ELASPSVIKP
GEVLPIKVTT DSPQRVVVFA VDEGILQVAR YRLKDPLDYF FRKRELSVQS AQILDLILPE
FSKLMALTSA PGGDAGEGLD LHLNPFKRKQ DKPVAYWSGI TEVNGETTFN YPIPDYFNGK
IRVMAISATP DRIGKVQTST TVRDNFILTP NVPAMVAPGD EFDVTVGVSN NLQGLKGKAV
DITVRLTPPP QLEVVGEAQH SLSLAEKRET LVSFRLRARS ALGDAPLVFD ASYGSQSSRR
TVSTSVRPAM PFRTQSVMGR MEGNKHTVTN LRQMFDNYAQ RQATASHSPL VLTQGLARYL
ADYPYYSSEQ IVSRSIPLIM QSKHPEMDSA LNQNEVRDQL KNMLRILSSR QNSTGAIGLW
HASPTPDPFV TPYVVQFLLE AKSAGYSLPN DILEGANNAL RLLAARPYDD LYSLRLRAFA
VYLLTLQGEI TTNTLASVQS TLQQLYPDSW QTDLSAIYLA SSYRLLKMDD EANKLLQPTW
KQLGKAYSKA WWTQNYFDPL VQDATRLYLI TRHFPEKVSS IPPQALENMV LALRDEHYTT
YSSAMSILAL ESYTSQVAAQ QDTPETLQII EISKSKGIDP NVISTLNGLF VQGDFTGEAK
AIQFNNYASA PAWYVVNQSG YDLQPPKDAI SNGLEISRSY TDEQGKPVTQ VTLGQKVNVH
LKIRANAKQG QNNLAIVDLL PGGFEVVQQT APEPEFYDNQ DDQDDQDDQD EETGSGWQSP
LMVSGSSWYP DYSDIREDRV IIYGSASTDV KEFIYQIKST NTGRFVVPPA YGEAMYDRNV
QALSVGKGHI LVVPPEAK