Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_0698 |
Symbol | |
ID | 5387886 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | + |
Start bp | 853063 |
End bp | 857541 |
Gene Length | 4479 bp |
Protein Length | 1492 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640863667 |
Product | YD repeat-/RHS repeat-containing protein |
Protein accession | YP_001399688 |
Protein GI | 153948079 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.632091 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAATGC CAGCGGTAAA ACATTTTGAT CCGGTCATTG GGCTGGATGT CCATACGGTC ATTATTCCTC CCTCACCGGT CCCTATCCCT ATTCCCCATC CCCATGTGGG ATTTGTGCTG GATTTACGCG AGTGCGTTAA CGGCGTTAAA TCGGTGGTCG GTTCAATTGT GTTCAGTTTT GTCGCTGAGG CTGCTGCTGA TGTGATGGAG GACAACCCTG AATTATTGGC GGCGGGTATG GCGCTGGTCA GTAAGGCAGG GAGTGCACTG GAGGCCGCGG CGAATAATCC GGTAGTAAAA ACCGGGCTGG CGTTAAAAGA TGCCAAAGAT CAGCTATCCG GGTTGAAAGC GGGCATTATC GACACGCTGG GTGGCAATAT TGGCGGCGGC GGGGGGTCTA ACCGACCGGT AAAAGTGAAT GGCACATTGC GCAGTACCGT CGGGACGCAC ACGTTTCATA TTCCGGGGCT GCATTTTCCC CTCGGTGCCA TGTTCGCGCC GTTGATCCCC AGCAAAGATT CCGAATCTTT TATGGGCAGC AAAACCGTTA TGGCCAACGG TGATCCGCTG TCTTATATGG CCCTACCCGC CATGAGTTGC TGGTTTGTCG GCCTGCCCTC GACGCCGAAA AACGCAGCCC ATACCCAACG CAAGAGCCTG TCGTTACCCA CCTCCGTCAT GTTGCCGATC CCGATGGGCC GCCCGGTGGT GGTGGGCGGC ATGCCGGTGC TTAACCTGCT CGCCCTGATG ATGGGGCTGT TCAAGATATT TCGTGGATCT AAGTTGGCGA AAAAGCTGGC CGAAAAACTG GGATTAAAAT CCGGCTTCCT GAAGTGCACC ATTTTGGATG CGGAGCCGGT CAACTCGATC ACTGGCGAAG TGGTGGTTGA GCAGAACGAT TTTGTGGTGG AAGGGCGCTT CCCGCTGGTG TGGGATCGCT ATTATGGCAG CCAGAAAGCG GTGACCGGTG TCATTGGCCA GCGCTGGCAG TGCCCGGCAG ATATTCGGCT GGAGGTGTTG GTCGATCAGG GCGAACTCGG CGTGGTGGCC CGCTTCCCGG ATCATGAAAC CGTCTTCTCG CTGATGCCGA TAGAGCAGGG CTGGGATGAG CGGGTGTATG ACTGGCAGCA GGGGCACGCC CTGTATCGTT CAGGTAATCA GTTGGTCCTG CGCACCCGCC ACGAGGAAGA GTATTTCTTC ACGTTGCCCA ATGACTGGGA AAGCACGGTG ATCCCGTTTA CGGCGGCGGA TCGTCTGACG CTGCCCATTC TGAGAATGGC AGATCGCTAT GGCAATGGCT GGCAATTCAA ACGGCAGGGC AATTCGCTGT TGCTGGCGTT GATCGAACTG ACACAAGGGT TGCTTTCGGG GCGTGAAGTG CATGTCATTC AGGATGAAAA CGCCCCTATC AGCGGGATGC TCAACGATTT TGTTCTCGTG AATAAAACCA ATGGCGAGCG GCGTTTTCTG GTGGGCTACC GCCACGATGG CGAAGGTAAC CTGATTGCCA CGCTGGATGC GGACAACCAT CCCTACCAGT TTGAATATAT CGCTGACAGC CAAATGGTGC GCCATACCGA CCGCAACGGG TTGTCGTTTT ACTACCGCTA TCAGCGCCAC AGCGACGGGC TGAACCGGGT CGAGCACGCC TGGGGTGATG GTGGGCTGTT CGATTACCGT TTTCATTATG ATCGGGTCTA TCAGGAAACC CAAATCACCG ATTCATTGGG CCACACCACC TTGCTGCGCT ACGACGAACG GGGGCTGCCG TTTGCCCGCG TCACGCCATT AGGCGGGGTT TACAGTTATC AATATGATGC ACAGTGCCGG ACCATCGCCG AGATTGACCC AGCGGGCAAT ACCCACGGCT GGGTGTACGA CCAATATGCC AACCTGACAG AAGAGACCTT TGCCGACCGC AGCAGCGTGA AAACCGAGTA TGGCGATGAT CATAAACCAG TGCGCATTAC CGATCCCGGC GGGCGCTTCT GGCGGCAGAG CTGGGATGCA CAGGGCAAGG TGATTTCGCA GACCACGCCG GGCAATATCG CCACCCATTT TACCTATGAT GACCTTGGGC AACTGGTGAC GGTGGTGGAT GCCAGCCAGC AGCGCACGGA ACTGGCCTAT GATGCATGGG GTTTTTTGCG GGCCATTACC GATGCGCGGG GTAACGCGAC GTCATTTAAG CATGATTTTT GCGGTAATCT GCTGAAAAAA GTGGCGGCCA ATGGCGATAT CACCCGCTAT CAGTACGATA AAAAACAGCG GCTGACGGGC TGTACGCTAC CCGATGCCAG AACCATCCGC TGTGAATATG ATCGGGAAGA TAATCTACTG CTCTACAACG AGAACGGCTC ACGTATCACC CGCTTTGGCT ATTTTGGTCA GGGCCGGTTG CAAAGCCGTA CCGATCCGGA TGGCAGCCTG ACCGAGTACC TGTACGACAC CGAAGAACAG TTGATTGGGG TGAAAAATCA GCGCGGTGAA ACGTGGCAGC TCAAGCGTAA CGCCGAAGGG CGGTTAATCG AAGAAGTGGA TTACTGGGGG CAGAGCCGCG GTTATCAGTA CAATGCGGTT GGGCATTTAA CCGGCAGTCA CGACCCGCTA GGGCAGATAT TGGCGGTCAC CTGCGACAAA TTAGGGCGCA TCACCGAGAA GAACATTGCG GGTGATGAGC AGGCGTGGGA GCGCTACGAG TATAACGTTC AGGGCCAGCT TATCGGGGCG GTGAACCCGG CGGTGACGGT GACGCGGCGC TATAATCAGG ACGGCCAGTT AACGCAGGAA ATTCAGCAGC AACCGCAGGT CAGTGCGACG GTGGAATATG GCTATAATGC CGCCGGTCAG CAGGCTGAGC AGCGCCATCT GTTGCAATAT GCCGATGAGG ACGACATTCA GGTACAGCAA CGGATCCGCT ACGGCTACGA TGTGCTGGGG CAGTGCATCA GCCAGCAGAT CGATGATCAC AGGCCGATGG CGTTCAGCTA CGACAAGATT GGCCGGTTAA CCGAACAACG GCTAACGCCG AGCCTGTCGC ACCATCTCAG TTACAGCGCT GCCGGGCAAC TGGCGGGTTA CCAGACACGG CGCAAGGGCA TGGTCTGGAG TGAAACCGAC TACTTCTATG ATCAGCACGG CAATCTGACG CAGCGGGAGG ACAGCCGGCA GGGCAGTGAG CGTTATCACT ACGATGTGCT GGGGCAGATT GTGGGCTATC AGGATCCGCT GGGGACGCTG CATCGCTACC GTTATGATGC CTGCGGTGAC CGGTTCAGCA CGGTGGCTGA CAATGCGGAG GGCCGGCACA CGCAGCACGA TAATGGCACC GCCTACCAGC TCGACAAGGC GGGGCAGTTG GTATCGCGCA CCGACCGGTT TAGCCAGTTA GCCCTGCGCT GGAATACTTT TGGTCGGCTG GCCGGCATCA AAAATGAGCA CCATGATCAC GCTTATACCT ACGATGCGCT GGGGCGCCGG GTGGGCAAAC GGCACCTTCA ACGGCCGACC AAACTGCTTG ATGGGCTGCG GGTGATCGAC AAAGAGACCC CCTCGCGCGG TGAACCGGTG TGGCAGGATG AAACCTGGTT TATGTGGGAC GGTGACGTGA TGGTGGGCGA GTTGCAGCGT GAAGTTGCCC CCCGGGTGGT GACGCCTGAA ACCTGGGACA AACCGGATGG TATCGTCTAC AGCGCTCAGT TTTACGTGTA TCAGCCAGAC AGCTTCGAAC CCCGCGCCAT GCAGCGTTAT CAACAGGTTG CGGAAGCTGA AGACGGCGAA ATCGCGCCAT TAGGCGAAGA GCAGATTTAT TTCTACCAGA ATGACCCGAA CGGTATGCCG ATCCGCCTGC AGGATGGCGA AGGCGAAGTG GTATGGGAAG CACAGTTCAC CCCGTTCGGG CAACTGAGTG TCACGGGTAC CAGCCAACTG CGCCAGCCGC TGCGCATGCA GGGCCAATAT TACGACACGG AAAGCGGTTT ACATTATAAC CGCTACCGTT ATTATGATCC GGCCTGCGGT GTCTTTATCA GCCAGGACCC AATAGGGTTA AAAGGCGGAT TAAATCCGTA CCAGTTTGCA GTGAATACAC TGGGATGGGT GGATCCGCTG GGGTTAGCTA AAAAAACGAA TGAGGCTGGG GCATACTCTG AAGTAGGTGG TCATCATGTT CATGCACAAG CTGGATTTAA AAATGAGCCT AATTACGATA AGGGTACCGC ATTTGCTATT GGTCAGGATT ACATGAAAGA AAAAGGTTTG GATCACCAAT CTATGACCAA TTCTCAACGG CAAGGATTTA AAGAATTAAA TGAAAGTGGT AGACCAAATA CTTTGGCTGA ACATGACAAA ATTGCTAAAG ATGCTTTAAT AGCTGGGGGG GCAACAGAAG AAGAAGCTAA TGATTTAGTG AAAAAGTCCA AGGATAATTT GAAATGTCAA AATGTTTCTA AACCCAGCAA TATCCCTTGG TATAGTTAA
|
Protein sequence | MAMPAVKHFD PVIGLDVHTV IIPPSPVPIP IPHPHVGFVL DLRECVNGVK SVVGSIVFSF VAEAAADVME DNPELLAAGM ALVSKAGSAL EAAANNPVVK TGLALKDAKD QLSGLKAGII DTLGGNIGGG GGSNRPVKVN GTLRSTVGTH TFHIPGLHFP LGAMFAPLIP SKDSESFMGS KTVMANGDPL SYMALPAMSC WFVGLPSTPK NAAHTQRKSL SLPTSVMLPI PMGRPVVVGG MPVLNLLALM MGLFKIFRGS KLAKKLAEKL GLKSGFLKCT ILDAEPVNSI TGEVVVEQND FVVEGRFPLV WDRYYGSQKA VTGVIGQRWQ CPADIRLEVL VDQGELGVVA RFPDHETVFS LMPIEQGWDE RVYDWQQGHA LYRSGNQLVL RTRHEEEYFF TLPNDWESTV IPFTAADRLT LPILRMADRY GNGWQFKRQG NSLLLALIEL TQGLLSGREV HVIQDENAPI SGMLNDFVLV NKTNGERRFL VGYRHDGEGN LIATLDADNH PYQFEYIADS QMVRHTDRNG LSFYYRYQRH SDGLNRVEHA WGDGGLFDYR FHYDRVYQET QITDSLGHTT LLRYDERGLP FARVTPLGGV YSYQYDAQCR TIAEIDPAGN THGWVYDQYA NLTEETFADR SSVKTEYGDD HKPVRITDPG GRFWRQSWDA QGKVISQTTP GNIATHFTYD DLGQLVTVVD ASQQRTELAY DAWGFLRAIT DARGNATSFK HDFCGNLLKK VAANGDITRY QYDKKQRLTG CTLPDARTIR CEYDREDNLL LYNENGSRIT RFGYFGQGRL QSRTDPDGSL TEYLYDTEEQ LIGVKNQRGE TWQLKRNAEG RLIEEVDYWG QSRGYQYNAV GHLTGSHDPL GQILAVTCDK LGRITEKNIA GDEQAWERYE YNVQGQLIGA VNPAVTVTRR YNQDGQLTQE IQQQPQVSAT VEYGYNAAGQ QAEQRHLLQY ADEDDIQVQQ RIRYGYDVLG QCISQQIDDH RPMAFSYDKI GRLTEQRLTP SLSHHLSYSA AGQLAGYQTR RKGMVWSETD YFYDQHGNLT QREDSRQGSE RYHYDVLGQI VGYQDPLGTL HRYRYDACGD RFSTVADNAE GRHTQHDNGT AYQLDKAGQL VSRTDRFSQL ALRWNTFGRL AGIKNEHHDH AYTYDALGRR VGKRHLQRPT KLLDGLRVID KETPSRGEPV WQDETWFMWD GDVMVGELQR EVAPRVVTPE TWDKPDGIVY SAQFYVYQPD SFEPRAMQRY QQVAEAEDGE IAPLGEEQIY FYQNDPNGMP IRLQDGEGEV VWEAQFTPFG QLSVTGTSQL RQPLRMQGQY YDTESGLHYN RYRYYDPACG VFISQDPIGL KGGLNPYQFA VNTLGWVDPL GLAKKTNEAG AYSEVGGHHV HAQAGFKNEP NYDKGTAFAI GQDYMKEKGL DHQSMTNSQR QGFKELNESG RPNTLAEHDK IAKDALIAGG ATEEEANDLV KKSKDNLKCQ NVSKPSNIPW YS
|
| |