Gene YpsIP31758_3692 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3692 
Symbol 
ID5386687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp4163688 
End bp4167947 
Gene Length4260 bp 
Protein Length1419 aa 
Translation table11 
GC content55% 
IMG OID640866716 
ProductRHS/YD repeat-containing protein 
Protein accessionYP_001402646 
Protein GI153947628 
COG category[M] Cell wall/membrane/envelope biogenesis
[S] Function unknown 
COG ID[COG3209] Rhs family protein
[COG4104] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.333413 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGAAG CCGCTGCGCG TGTTGGCGAT GCCATTGGTC ACTCTCAGGC ATTAGCCGGT 
CTTATCGGTG GAACGATTTT AGGCGGGTTA ATTAATGTCG CGGGGGGCAT TCTTGGCGGA
ATGCTATTTG CCGCAGGGTG TGCCTCGGCG TGTCTGGGGG TGGGTATCCT GCTGATTGGC
GCATCGATTG CCGTGGGTAT GGCAGCGAAT GCTCTGGGGG AAAAGGCACG GGATGCCTGT
GTTGATGCAG GAAAAAACTC GCTCAGCCCC AGCGGGGCCA TAGTCACCGG CTCCGCTAAT
GTTCGCACCA ACAGTAAAGC CGCCGCCGTA GCGACCCTCA GTGCCGTCAC CTGCGATAAA
GATAAAGCAC AGCAGGTGGC ACAAGGCTCC TCCTCAGTAT TTATCAATGG CTTGCCAGCC
GCCCGTCGTA ACGATAAAAC CACCTGTGAT GCCAGTATCA TGGTCGGTTC CACGAATGTA
TTCATTGGGG GAGGAACGGA AACTACGCTC CCCATTACAT CTGAAATTCC TGATTGGGCC
TATACCGTTT CAGACCTGAC GATGTTTGCT GCCGGGTTGA TCAGTTTTGG CGGCGCAGTC
AGTCGGGGGC CAGGGGCGGT ACAGAAACTG TTTGCCAAAC TGCCTGGTGC CGACAAAATT
GCCAAAATAG CCTGTCGGTT GGCACCACTG GCGATTATTT TACCGGTGGT CGGTATTCTG
ACTAACCCGG TGGATGTGAC CAGTGGACAG AAGTTCCTTA ACGACGAGGA TGAGCGAGAT
TTTACGCTTG ATGGTGAATT GCCCCTATTC TGGCAGCGCC GATATCTGAG CAGCTATGTC
TATGAGGGGG TATTGGGACG TGGCTGGAAT TTGTTTTGGG AAAGTGCGTT GAGTCGTGTA
GACGACGGTA TATTGTGGCG CAATACCTAT GGAGATTATA TTCCCTTCCC AGACATACCG
GCAGGTCACC AAACCTTCTG TCCCGAACAG CAGTGCTGGT TGATACATCT GGAGGACGGG
CGCTGGTGTA TCCGCGATGC GGGTGAATGG GTTTACCATT ACGGAAAATT TGATGCACAA
GGTCTCGCCC CGCTGGCGAA TATTACCGAT AACGTCGGTA ACCGCCAGTC GTTTCATTAC
AATGACCAAC AGCAGATGGT ATCCATTACG GGCACTGGCG GGCTGTCACT GCGCTGTGAC
TACCACCCAG AACGGCATCG TCTGACGGCG GTCTGGCAAC AGACGCCTGA CGGGGACATT
ATTCGCGCCC GCTATCAGTA TAATGAGTCA GGGCAACTGG CAGCAGTACA ACACCGGGAT
GATACGGTAG TACGCCGTTT TGGCTGGGGT GAAGATCACG GTTTATTGCT CTGGCATGAA
AATGTGGCTG GATTGCGTTG TGACTACCAG TGGCAGGAGA TTGACGACAT CTGGTGTGTG
GCAGAACAGC ATACCTGCGA GGGTGACGGC TACCGACTGG CCTACGATGA AGAGCGGCAC
CAACGCACTG CGACCTATCA GGACGGCAGT CAGGCGGTAT GGATACTGGA TGAACAGCAC
CGGGTCAGCC GTTACACCGA CCGCACTGCC CATGAATGTC AGCTCCAGTG GGACACGGCC
GGGCAACCGA CGGGTTACCG TTCACCGCGG GGACATCAAC GCCAATGTCA GTGGGATGAA
CTGGGCCGTC TGGTCAGTGT GACGGATGCC AATGGCGCAG AAACACGTTG GCAGTATGAG
CGCAATACTG ACCGGCAGAC CTTTGTGTTC TGGCCGGACG GCACGCAAGA GCGCCAGCAG
TGGGATGCCC AGGGGCGACT CCTGCAGGAA ACTGACCGTC TGGGGCAATC TACGCACTAT
ACCTACCCGC ATCCACGTAC CTTACTGCCG GACAGCATCA CTGACGCGCT GGGAGGGCAA
AGTCAGTTGC TCTGGAGTCA GCAAGGTCAA CTCACCGGCT ACACCGACTG TTCCGGTCAG
CCCACGCAGT GGCGCTATGA TGCGCTGGGG CAGTTGCTCT TGCGCCGCGA TGCCTTACAA
CAGGAAATTC GCTACCACTG GGATCCCGTG GGCAGGTTAA CCAAAGTGAC CCTGCCAGAC
GGTTCAACGG AACAGTTTGA CTGGTCTCCG GCGGGTCAGT TAGTGCGACA CCAGCAAGGC
CATAACCAGC CCCGCCATTG GCATTACAGT GTGCGCGGAC AAATCCTGAG TACCACAGAC
CGCCTGAGCC GGGTTATCCG CTATCGCTAT GATGCCGAGG GACGCCTGGT CCATCTGGAC
AATGACAACG GCGGCCAGTA TCACTTTAAC CGGGATGCGG AAGGTCGTTT GCTGGAAGAA
CAGCGTCCGG ATGATACTCG TTATTCCTAC ACCTATAATG CCGATGGGCA GGCAACGGAT
ATCACACAAC GTGGCCTGTC AGAGAACCAC GCCTCACCGC CAGAGAAACC TACCCGACTG
ACGTATGACG CCGTTGGCCG ACTGATAGCG CGCCATACTC TCACGGAACA GACGTGCTAT
CAGTGGGACA AGATGGGCAA TCTACTCAGT GCCATCCGCA CCCCGACCGA ACAGGGTGAA
AAGCTGGGTA TTCTGACCAA TACCGTGACC TTTGAACGGG ATGCGTTGGG CCGCATTACT
CAGGAGCATA ATGGGGCGGA GGCGCTGGCT TACCACTATG ATGCGCTGGG CAACCTGACC
CGACTGGAAT TACCGAATGA TGACCATTTC CAGTGGCTGC ATTATGGCTC TGGGCATGTG
AGTGCCATCC GCTTTAATGA CCAGTTGGTC AGCGAATTCG AGCGTGATGC ATTACACCGC
GAGACCCGCC GCACACAAGG TATCCTGACG CAACAACGTC AGTATGATGT ACTGGGTCGC
CGCCGCTGGC AAAGTAGTAT CAGCAGTCGT CTCACCGAGG CGCTCACCAC GCCAGAGCAG
GGAATACTGT GGCGGGCCTA TCATTACGAT GAACTGCATG AACTGGCTGC GGTAGAGGAC
AGCAACCGGG GCATGTTGAG CTACGGCTAT GATGAAGAGG GACGACTACG CAGTACCGTC
TCGCCACACA GCGGTCAGAC GACGGTGCAT TATGACCGGG CCGATAACGC GTTGATGTTA
CCGTTACAGA CGCCGGAGAG TTCGCCATAT GCGCGGAGCA GTCAACCTTA TTGTGATAAT
CGGCTGACAC GTTGGGAACA GTGGCAGTAT CACTATGATG CCTTTGGTAA CCTGTCGGAG
CGACTGGAAG GCTACCGTAC TCAACGCTAC CGCTATGATG GGGATAACCG GTTGGTCGGG
GCCAAAGGGG ATGGGCAGAA AGGTCTCTTT GAGGCGCAAT ATCATTATGA TGCGCTGGGT
CGGCGGCTCA GCAAAGTGGT CAGGACGCCA CAGGGTAACC AAGAGACTCA CTTTTTATGG
CAGGGTCTGC GTCTGCTGCA AAGCCGCACG GACGAGAGCC AGCAAACCTA CTGCTATGAC
CCGAATGAGG CATACACACC ACTCGCCTGC ATTGAGCGGC GCTACGGTGA AGACACCTTG
TACTGGTATC ACACTGATCT GAATGGCTCG CCACAGGAAG TGACCAACGC GCAGGGTGAA
ATGGTATGGT CGGGGCAATA TGGGGTGTTT GGGCAAGTTA CACGCCAGAC CGATGCGATG
TGGCGTAACG TCAGTAAACC GCTGGGCCAA TTCAGGCAGC CATTGCGTTA TGCCGGGCAA
TATCTGGATG ACGAAACGGG GTTGCACTAC ACTACCTACC GGTATTACGC GCCGGAGGTG
GGAAGGTTTA TCACACCCGA TCCGATTGGC TTGGCGGGGG GTCTAAATCT TTATCAGTAT
GCGCCAAATC CGTTGGGGTG GATTGATCCA TGGGGGTTGG CAGGTAGTCC AACGACAGCA
ACACACATCA CTTATCAGGG TATTGATGCT ATTACAGGTA AGCCTTATGT TGGGTATGCA
AGTATGCAAG GCAATCAAAT AGCACAAGAT GTGTTGAAAT ATCGCTATGC TAATGACTTT
AGTCGTTTTG GTGGAACTCC TCCTGAAATT TTATACGATG GGTATGGTCA GGCAGGTAAA
TATGTCACTC GTGGATTAGA GCAGCGGACA TTTGAAAATC TTGGTGGACT TGACGGTACT
GCGAATAAAC AAAATCCAGT AGGGCAGGGA AATGCTAGAA GAACAGAATA CCTTAATGCG
GCAGATGAAC ATCTGAGTAA TAAAAATGGT AGTAGAAAAG GAGGAGGCGG TCGATGTTAA
 
Protein sequence
MLEAAARVGD AIGHSQALAG LIGGTILGGL INVAGGILGG MLFAAGCASA CLGVGILLIG 
ASIAVGMAAN ALGEKARDAC VDAGKNSLSP SGAIVTGSAN VRTNSKAAAV ATLSAVTCDK
DKAQQVAQGS SSVFINGLPA ARRNDKTTCD ASIMVGSTNV FIGGGTETTL PITSEIPDWA
YTVSDLTMFA AGLISFGGAV SRGPGAVQKL FAKLPGADKI AKIACRLAPL AIILPVVGIL
TNPVDVTSGQ KFLNDEDERD FTLDGELPLF WQRRYLSSYV YEGVLGRGWN LFWESALSRV
DDGILWRNTY GDYIPFPDIP AGHQTFCPEQ QCWLIHLEDG RWCIRDAGEW VYHYGKFDAQ
GLAPLANITD NVGNRQSFHY NDQQQMVSIT GTGGLSLRCD YHPERHRLTA VWQQTPDGDI
IRARYQYNES GQLAAVQHRD DTVVRRFGWG EDHGLLLWHE NVAGLRCDYQ WQEIDDIWCV
AEQHTCEGDG YRLAYDEERH QRTATYQDGS QAVWILDEQH RVSRYTDRTA HECQLQWDTA
GQPTGYRSPR GHQRQCQWDE LGRLVSVTDA NGAETRWQYE RNTDRQTFVF WPDGTQERQQ
WDAQGRLLQE TDRLGQSTHY TYPHPRTLLP DSITDALGGQ SQLLWSQQGQ LTGYTDCSGQ
PTQWRYDALG QLLLRRDALQ QEIRYHWDPV GRLTKVTLPD GSTEQFDWSP AGQLVRHQQG
HNQPRHWHYS VRGQILSTTD RLSRVIRYRY DAEGRLVHLD NDNGGQYHFN RDAEGRLLEE
QRPDDTRYSY TYNADGQATD ITQRGLSENH ASPPEKPTRL TYDAVGRLIA RHTLTEQTCY
QWDKMGNLLS AIRTPTEQGE KLGILTNTVT FERDALGRIT QEHNGAEALA YHYDALGNLT
RLELPNDDHF QWLHYGSGHV SAIRFNDQLV SEFERDALHR ETRRTQGILT QQRQYDVLGR
RRWQSSISSR LTEALTTPEQ GILWRAYHYD ELHELAAVED SNRGMLSYGY DEEGRLRSTV
SPHSGQTTVH YDRADNALML PLQTPESSPY ARSSQPYCDN RLTRWEQWQY HYDAFGNLSE
RLEGYRTQRY RYDGDNRLVG AKGDGQKGLF EAQYHYDALG RRLSKVVRTP QGNQETHFLW
QGLRLLQSRT DESQQTYCYD PNEAYTPLAC IERRYGEDTL YWYHTDLNGS PQEVTNAQGE
MVWSGQYGVF GQVTRQTDAM WRNVSKPLGQ FRQPLRYAGQ YLDDETGLHY TTYRYYAPEV
GRFITPDPIG LAGGLNLYQY APNPLGWIDP WGLAGSPTTA THITYQGIDA ITGKPYVGYA
SMQGNQIAQD VLKYRYANDF SRFGGTPPEI LYDGYGQAGK YVTRGLEQRT FENLGGLDGT
ANKQNPVGQG NARRTEYLNA ADEHLSNKNG SRKGGGGRC