Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_3692 |
Symbol | |
ID | 5386687 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | + |
Start bp | 4163688 |
End bp | 4167947 |
Gene Length | 4260 bp |
Protein Length | 1419 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640866716 |
Product | RHS/YD repeat-containing protein |
Protein accession | YP_001402646 |
Protein GI | 153947628 |
COG category | [M] Cell wall/membrane/envelope biogenesis [S] Function unknown |
COG ID | [COG3209] Rhs family protein [COG4104] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.333413 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGGAAG CCGCTGCGCG TGTTGGCGAT GCCATTGGTC ACTCTCAGGC ATTAGCCGGT CTTATCGGTG GAACGATTTT AGGCGGGTTA ATTAATGTCG CGGGGGGCAT TCTTGGCGGA ATGCTATTTG CCGCAGGGTG TGCCTCGGCG TGTCTGGGGG TGGGTATCCT GCTGATTGGC GCATCGATTG CCGTGGGTAT GGCAGCGAAT GCTCTGGGGG AAAAGGCACG GGATGCCTGT GTTGATGCAG GAAAAAACTC GCTCAGCCCC AGCGGGGCCA TAGTCACCGG CTCCGCTAAT GTTCGCACCA ACAGTAAAGC CGCCGCCGTA GCGACCCTCA GTGCCGTCAC CTGCGATAAA GATAAAGCAC AGCAGGTGGC ACAAGGCTCC TCCTCAGTAT TTATCAATGG CTTGCCAGCC GCCCGTCGTA ACGATAAAAC CACCTGTGAT GCCAGTATCA TGGTCGGTTC CACGAATGTA TTCATTGGGG GAGGAACGGA AACTACGCTC CCCATTACAT CTGAAATTCC TGATTGGGCC TATACCGTTT CAGACCTGAC GATGTTTGCT GCCGGGTTGA TCAGTTTTGG CGGCGCAGTC AGTCGGGGGC CAGGGGCGGT ACAGAAACTG TTTGCCAAAC TGCCTGGTGC CGACAAAATT GCCAAAATAG CCTGTCGGTT GGCACCACTG GCGATTATTT TACCGGTGGT CGGTATTCTG ACTAACCCGG TGGATGTGAC CAGTGGACAG AAGTTCCTTA ACGACGAGGA TGAGCGAGAT TTTACGCTTG ATGGTGAATT GCCCCTATTC TGGCAGCGCC GATATCTGAG CAGCTATGTC TATGAGGGGG TATTGGGACG TGGCTGGAAT TTGTTTTGGG AAAGTGCGTT GAGTCGTGTA GACGACGGTA TATTGTGGCG CAATACCTAT GGAGATTATA TTCCCTTCCC AGACATACCG GCAGGTCACC AAACCTTCTG TCCCGAACAG CAGTGCTGGT TGATACATCT GGAGGACGGG CGCTGGTGTA TCCGCGATGC GGGTGAATGG GTTTACCATT ACGGAAAATT TGATGCACAA GGTCTCGCCC CGCTGGCGAA TATTACCGAT AACGTCGGTA ACCGCCAGTC GTTTCATTAC AATGACCAAC AGCAGATGGT ATCCATTACG GGCACTGGCG GGCTGTCACT GCGCTGTGAC TACCACCCAG AACGGCATCG TCTGACGGCG GTCTGGCAAC AGACGCCTGA CGGGGACATT ATTCGCGCCC GCTATCAGTA TAATGAGTCA GGGCAACTGG CAGCAGTACA ACACCGGGAT GATACGGTAG TACGCCGTTT TGGCTGGGGT GAAGATCACG GTTTATTGCT CTGGCATGAA AATGTGGCTG GATTGCGTTG TGACTACCAG TGGCAGGAGA TTGACGACAT CTGGTGTGTG GCAGAACAGC ATACCTGCGA GGGTGACGGC TACCGACTGG CCTACGATGA AGAGCGGCAC CAACGCACTG CGACCTATCA GGACGGCAGT CAGGCGGTAT GGATACTGGA TGAACAGCAC CGGGTCAGCC GTTACACCGA CCGCACTGCC CATGAATGTC AGCTCCAGTG GGACACGGCC GGGCAACCGA CGGGTTACCG TTCACCGCGG GGACATCAAC GCCAATGTCA GTGGGATGAA CTGGGCCGTC TGGTCAGTGT GACGGATGCC AATGGCGCAG AAACACGTTG GCAGTATGAG CGCAATACTG ACCGGCAGAC CTTTGTGTTC TGGCCGGACG GCACGCAAGA GCGCCAGCAG TGGGATGCCC AGGGGCGACT CCTGCAGGAA ACTGACCGTC TGGGGCAATC TACGCACTAT ACCTACCCGC ATCCACGTAC CTTACTGCCG GACAGCATCA CTGACGCGCT GGGAGGGCAA AGTCAGTTGC TCTGGAGTCA GCAAGGTCAA CTCACCGGCT ACACCGACTG TTCCGGTCAG CCCACGCAGT GGCGCTATGA TGCGCTGGGG CAGTTGCTCT TGCGCCGCGA TGCCTTACAA CAGGAAATTC GCTACCACTG GGATCCCGTG GGCAGGTTAA CCAAAGTGAC CCTGCCAGAC GGTTCAACGG AACAGTTTGA CTGGTCTCCG GCGGGTCAGT TAGTGCGACA CCAGCAAGGC CATAACCAGC CCCGCCATTG GCATTACAGT GTGCGCGGAC AAATCCTGAG TACCACAGAC CGCCTGAGCC GGGTTATCCG CTATCGCTAT GATGCCGAGG GACGCCTGGT CCATCTGGAC AATGACAACG GCGGCCAGTA TCACTTTAAC CGGGATGCGG AAGGTCGTTT GCTGGAAGAA CAGCGTCCGG ATGATACTCG TTATTCCTAC ACCTATAATG CCGATGGGCA GGCAACGGAT ATCACACAAC GTGGCCTGTC AGAGAACCAC GCCTCACCGC CAGAGAAACC TACCCGACTG ACGTATGACG CCGTTGGCCG ACTGATAGCG CGCCATACTC TCACGGAACA GACGTGCTAT CAGTGGGACA AGATGGGCAA TCTACTCAGT GCCATCCGCA CCCCGACCGA ACAGGGTGAA AAGCTGGGTA TTCTGACCAA TACCGTGACC TTTGAACGGG ATGCGTTGGG CCGCATTACT CAGGAGCATA ATGGGGCGGA GGCGCTGGCT TACCACTATG ATGCGCTGGG CAACCTGACC CGACTGGAAT TACCGAATGA TGACCATTTC CAGTGGCTGC ATTATGGCTC TGGGCATGTG AGTGCCATCC GCTTTAATGA CCAGTTGGTC AGCGAATTCG AGCGTGATGC ATTACACCGC GAGACCCGCC GCACACAAGG TATCCTGACG CAACAACGTC AGTATGATGT ACTGGGTCGC CGCCGCTGGC AAAGTAGTAT CAGCAGTCGT CTCACCGAGG CGCTCACCAC GCCAGAGCAG GGAATACTGT GGCGGGCCTA TCATTACGAT GAACTGCATG AACTGGCTGC GGTAGAGGAC AGCAACCGGG GCATGTTGAG CTACGGCTAT GATGAAGAGG GACGACTACG CAGTACCGTC TCGCCACACA GCGGTCAGAC GACGGTGCAT TATGACCGGG CCGATAACGC GTTGATGTTA CCGTTACAGA CGCCGGAGAG TTCGCCATAT GCGCGGAGCA GTCAACCTTA TTGTGATAAT CGGCTGACAC GTTGGGAACA GTGGCAGTAT CACTATGATG CCTTTGGTAA CCTGTCGGAG CGACTGGAAG GCTACCGTAC TCAACGCTAC CGCTATGATG GGGATAACCG GTTGGTCGGG GCCAAAGGGG ATGGGCAGAA AGGTCTCTTT GAGGCGCAAT ATCATTATGA TGCGCTGGGT CGGCGGCTCA GCAAAGTGGT CAGGACGCCA CAGGGTAACC AAGAGACTCA CTTTTTATGG CAGGGTCTGC GTCTGCTGCA AAGCCGCACG GACGAGAGCC AGCAAACCTA CTGCTATGAC CCGAATGAGG CATACACACC ACTCGCCTGC ATTGAGCGGC GCTACGGTGA AGACACCTTG TACTGGTATC ACACTGATCT GAATGGCTCG CCACAGGAAG TGACCAACGC GCAGGGTGAA ATGGTATGGT CGGGGCAATA TGGGGTGTTT GGGCAAGTTA CACGCCAGAC CGATGCGATG TGGCGTAACG TCAGTAAACC GCTGGGCCAA TTCAGGCAGC CATTGCGTTA TGCCGGGCAA TATCTGGATG ACGAAACGGG GTTGCACTAC ACTACCTACC GGTATTACGC GCCGGAGGTG GGAAGGTTTA TCACACCCGA TCCGATTGGC TTGGCGGGGG GTCTAAATCT TTATCAGTAT GCGCCAAATC CGTTGGGGTG GATTGATCCA TGGGGGTTGG CAGGTAGTCC AACGACAGCA ACACACATCA CTTATCAGGG TATTGATGCT ATTACAGGTA AGCCTTATGT TGGGTATGCA AGTATGCAAG GCAATCAAAT AGCACAAGAT GTGTTGAAAT ATCGCTATGC TAATGACTTT AGTCGTTTTG GTGGAACTCC TCCTGAAATT TTATACGATG GGTATGGTCA GGCAGGTAAA TATGTCACTC GTGGATTAGA GCAGCGGACA TTTGAAAATC TTGGTGGACT TGACGGTACT GCGAATAAAC AAAATCCAGT AGGGCAGGGA AATGCTAGAA GAACAGAATA CCTTAATGCG GCAGATGAAC ATCTGAGTAA TAAAAATGGT AGTAGAAAAG GAGGAGGCGG TCGATGTTAA
|
Protein sequence | MLEAAARVGD AIGHSQALAG LIGGTILGGL INVAGGILGG MLFAAGCASA CLGVGILLIG ASIAVGMAAN ALGEKARDAC VDAGKNSLSP SGAIVTGSAN VRTNSKAAAV ATLSAVTCDK DKAQQVAQGS SSVFINGLPA ARRNDKTTCD ASIMVGSTNV FIGGGTETTL PITSEIPDWA YTVSDLTMFA AGLISFGGAV SRGPGAVQKL FAKLPGADKI AKIACRLAPL AIILPVVGIL TNPVDVTSGQ KFLNDEDERD FTLDGELPLF WQRRYLSSYV YEGVLGRGWN LFWESALSRV DDGILWRNTY GDYIPFPDIP AGHQTFCPEQ QCWLIHLEDG RWCIRDAGEW VYHYGKFDAQ GLAPLANITD NVGNRQSFHY NDQQQMVSIT GTGGLSLRCD YHPERHRLTA VWQQTPDGDI IRARYQYNES GQLAAVQHRD DTVVRRFGWG EDHGLLLWHE NVAGLRCDYQ WQEIDDIWCV AEQHTCEGDG YRLAYDEERH QRTATYQDGS QAVWILDEQH RVSRYTDRTA HECQLQWDTA GQPTGYRSPR GHQRQCQWDE LGRLVSVTDA NGAETRWQYE RNTDRQTFVF WPDGTQERQQ WDAQGRLLQE TDRLGQSTHY TYPHPRTLLP DSITDALGGQ SQLLWSQQGQ LTGYTDCSGQ PTQWRYDALG QLLLRRDALQ QEIRYHWDPV GRLTKVTLPD GSTEQFDWSP AGQLVRHQQG HNQPRHWHYS VRGQILSTTD RLSRVIRYRY DAEGRLVHLD NDNGGQYHFN RDAEGRLLEE QRPDDTRYSY TYNADGQATD ITQRGLSENH ASPPEKPTRL TYDAVGRLIA RHTLTEQTCY QWDKMGNLLS AIRTPTEQGE KLGILTNTVT FERDALGRIT QEHNGAEALA YHYDALGNLT RLELPNDDHF QWLHYGSGHV SAIRFNDQLV SEFERDALHR ETRRTQGILT QQRQYDVLGR RRWQSSISSR LTEALTTPEQ GILWRAYHYD ELHELAAVED SNRGMLSYGY DEEGRLRSTV SPHSGQTTVH YDRADNALML PLQTPESSPY ARSSQPYCDN RLTRWEQWQY HYDAFGNLSE RLEGYRTQRY RYDGDNRLVG AKGDGQKGLF EAQYHYDALG RRLSKVVRTP QGNQETHFLW QGLRLLQSRT DESQQTYCYD PNEAYTPLAC IERRYGEDTL YWYHTDLNGS PQEVTNAQGE MVWSGQYGVF GQVTRQTDAM WRNVSKPLGQ FRQPLRYAGQ YLDDETGLHY TTYRYYAPEV GRFITPDPIG LAGGLNLYQY APNPLGWIDP WGLAGSPTTA THITYQGIDA ITGKPYVGYA SMQGNQIAQD VLKYRYANDF SRFGGTPPEI LYDGYGQAGK YVTRGLEQRT FENLGGLDGT ANKQNPVGQG NARRTEYLNA ADEHLSNKNG SRKGGGGRC
|
| |