Gene YpsIP31758_0341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_0341 
Symbol 
ID5387008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp400246 
End bp404829 
Gene Length4584 bp 
Protein Length1527 aa 
Translation table11 
GC content55% 
IMG OID640863311 
ProductYD repeat-/RHS repeat-containing protein 
Protein accessionYP_001399335 
Protein GI153950146 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGTCG CACAAAGTAT TGGCACCGGG TATGGCGCGG CTGGCGCGCA ATCGGCATTA 
CGTCAAACCG CGCTGGGGCA ACAGAGCCCG GCGCGGACGG ATTATCAGGT CAGTAATCCC
AATGTGGGGA ATATCGCGCG AGCGAGTGAT TCTCTGCTGA ATGTGGCGGA AAGTCAGCAG
TTTAACACTC TGGTGACCGC CGGTTTTGGG ATGCAGGCGA TTGCCGCCGG GGTGGCCGTT
CCCTTTACCG TCGGGCTGGC CGCCGGGATG GCCGGGGGAT ATGTCGGTGC CAAAATCGGC
AATAAGTTAG GTCACGGTAT TGCCCGTGCG CTTAATTTTA ATCAAGTGGC GACGGAAGGA
GAAAGCCCCG CGCACCTGGG CCATCCCATT GCCCATCAGA AAAAAGATTG GGGAGTGTGG
GGGGCCATTG GCGGTATTTT ACTGGGGGCA GCGGCGGCGG CATTGGTGGT CGTCACCTTT
GGTACGGGGT TGGTGGTGAT TGCCGCGGCG GCGGCCGCAG CCGGATTAAT CGGCGGTATC
GCCGCCGCGA CCGGGGCGGC ATTAGGACAA TACGGTGATA ACAAAGGCGT GATTGCTGAA
GGCTCCGCCA ACGTCTTCTT TGAAGGCCAG CCGGTGGCCC GGGTGGGGGA TAAGATCCAA
TGCAGCGATC ACCCCAGTTC ACCGCCGCCG ATGATTGCCG AGGGCGCAAA AACCGTGTTT
GCCAACCAAA AGCAGATCGC GCGTCTGGGG CACCGTACCA CCTGCGATGG CAATATCAAT
GCGGGATGCG GTTCGCTCGC CATCACGCAG GAAACGGCTT ATGTGTATGA AGTGGCAGAC
AGCCGTAATC CGTATTTACG CTGGTCAGCG GTTATTCTGA GTTTCTTGCC AATACAGAAA
AAATTTGAGC AAGGGTTTCG CTCCTTAAAA AAGCCGCCCA ATACCGCGGT CAACGCTACC
CATAACTGCC CGACCGGCAG TGATCCTGTT GATATGGTCT CCGGCGATTA CCTTCAGGTC
TGGCCCGTGA TTGATATTGC CGGTGTTTTG CCTGTCCGGT TACAGCGTAC TTATCGTTCC
GGCGATTATT TCACTGTCTC GGGTTGCTTT GGTCATAAAT GGGCGGATAG CTGGTCACAA
CACTTGGTCG TGCACGAAGA CAACATTGAT TACATTGACG AGGAAGGGGT GGGGCTGAGC
TTCTTTACGC CAGAGAATAA AGTTCAGGCC GTCAATCTGT ATAACCTGCG TTATGAGTTA
GTGGGGGAGC GTCATGGCGA ACTCAGGGTT TTTGACCGGT CAACTCAGCA GACACTGCAT
TTTAATCAAC AGCAACAACA GCAGCGTTAT TTATCGGCTA TCACCGACCG CAAAGGGAAC
CGAATCGATT TTCGTTACCA ACAGGGGGAA CTGATATCGG TCGAGCACAG TGACGGCTAT
GTGCTGGAGA TCGACAGCCG TGGCAGGACA ATACACGCGG TCGAGCTTGT CACGCAGGAG
AAAAGACAAA AGCTGTTGCA GAGTACATTC AGTGAGCGCG GTTACCTCGT TCAGTGCCAG
AGTTTCCAGT ATGGCACGCT GTCGCATGAA TATGATCCGA AGGGGTATAT GGTGCGCTGG
CGGGATACGG ATAGCACGGA TGTGGCTGTC CGTTATGATA TTTCAGGGCG GGTGGTGGCA
CTAAAAACAT CGACCGGTTT TTTTGCTGAC CATTTTATTT ATCATGACAA AGAACGCTAT
ACGATCTATC GCGATGGGGA AGGGGGAGAA ACCGGTTATC ACTATAATGA AAACAATCTG
CTGATAAAGT TAGTCGATCC GTTAGGCAAT ATCACCCTCA CCGACTGGGA CTTGACCCAG
AAAATAAAAG AAACCGATGC ATTAGGGCGT ATCACCCGGT TTATTTACAA TGAGCGCGGT
GATTTAACCG CCGTGATCCA CCCGGATGAA ACCCGTACCG AATATGAGTA TAACCCGTCT
GGTGTGGTCA CTGCGTTTAC GTCTTCGGCG GGGGATAGCT GGCAATATCA GTATGACAGG
CAGGGATTAT TACGACAGGT AACTTATCCT TCCGGCCAGA CGATGTCGTT TCGCTACGGC
AAGAAGGGAG AAGTGCTGCG CAAGATCATC GCGGAAGATC AGGTGTGGCG TTACCACTAT
GATCACCACG GTTGTTTAAG CACCATTATC GATCCGAAGG GGAACAGTAC AGCCGTCACA
CTGGATGTGC TGGGGCGGTT GTTTTCTCAC CAAAACGCCC TCGGGGAGCT CACCCGTTAC
ACACACAGTG ACGCGCATGC CAGTCCGGCG GGCAGCGTCA CAAAAATGGT CATGCCGGAT
GGTGTGGAGC AGGCCATTGC CTATGACAGC GAGAAGCGTA TTGCCGCGCT GACCGATGGC
GCGGGGAAAA CCACCCGTTA CGAATACGGG GGTTTTGATT TACTGACCGG CCTGATACGC
CCGGACGGGC AGCGGCTGAC CTTCGGCTAT GACACACTGA CGCGCCTGAA TCAGGTGACC
AATGCGTCAG GTGACACCTA TCGTTATACC CGTGATCGGG CCGGGCAGGT GATCAGCGAA
ACCGATTTTA CCGGGCGCAC GGTTCATTAT CAGTATGATG CGGTGGGGCG GCGTATCGGG
GCGCGTTACC CGGATCAGCG CCTTGTTCGC TGGCACTACT CGATGCAGGA TCAGGTGCTG
GCGCAGCAGA CCTGGCACTG TGATGCACTG AGTTCCACGC TGGTTGGCAC GGTCAGCTAC
GGTTATGACC GTGCCGGTCG TTTACTGAGC GCGACCAATG CGGATGCGGT GGTGGAGTTT
GATTACGACG AGGCGGGTCA GTTGGTTGCG GAGCGGCTGA ATGGCCGGGA GGTGCGGCAT
CAGTGGGATG CGCTAAACGG CACCCCTGTT GCCCGACAGG TGGGTGAGCT GGGGCTGACG
TTTGTGTACG GCGCACAGGG CGAACTGACC CGGCTACAGC TGGCGGGCCA TCAGCCCCTT
CAGTTGCAGC ATGACCGGTT GGGGCGAGAG ACGGTACGTG AAAGCGCGGC CGGGTTTATT
CAGGCCTGCA ATTATACCCC GAGCGGGCTG TTGGCCCATC AGGCGGCCGG GCGTAACTCG
GCATTATTCC AGCAACAGTT GATCGCCCCG GAGAGCCCGG CCTTGCACGG CAGTGCGGTC
AACCGCAGTT GGCAGTATGA CCGGGCTTAT AACGTGGTGG GGATGGATGA CGGGCGTTGG
GGTAAAACGC AGTACCAATA TGACCGCAAT GACCAGGTCG TCCGCGCTGA TTTTGGTGGT
TTTTTGCCGT TGCAGGAGCA GTTTAGCTAC GATGTGAACC AGAACCTGCG CGAGCATCGC
TGTTTACCGC GTGGGGCACA GGCGGTTTTG GCGCAGGCGA GCCAGCAACA GCAGGCGGGC
CGGGTGGTGA AGCGCGGTGA CAGTGAGTAC CGTTACGATG CTGGCGGGCG ACTGGTGGAG
AAACGCAGCC AAAAAGACGG CTACCGGCCA CAGCTGTGGC GCTACCGCTG GAATGAGCAG
GACCAGTTAT CCGAATTGAT CACGCCGACG GGGGCGCGCT GGCGCTACGG TTATGATGCC
TTTGGGCGAC GTATTCGAAA ACTGCGGGTA GTGGATACCC CGCCGCTCAA TGAGATGGAC
GCTCCTTCCA CCGGTCCGGC CACGGCATCA CTGGCCGGTT ACGCGTACCT GTGGAGCGGC
GACCAATTAA TCGAAGAGGT GCCGGTGTAT GCCGACGGCA CGGTGGCGTA TGAGCAGGGG
ATCCACTGGC TGTATGCGCC GGGCGGATTA ACGCCGATGG CCCGCTACGC GCAAGGCAAA
CTGCATTATG TGGTGGCGGA TCATCTGGGT ACGCCGCGGG AGTTGTTGAA TGAGCAGGGC
AAGGTGGTGT GGGCGAGCCG TCTGAGCACG TGGGGGCAGG CGGAATTATG GCGACAGGCG
GCGAATGAGG AGGATCGGGT CAGCTGTAAC CTGCGTTTTG CGGGTCAGTA CGCGGATGCA
GAATCCGGGT TGCATTACAA CCGGTTCCGC TACTATGATG GCGAGACGGG ACAATATCTA
TGCCCGGACC CGATAGGGCT GGCGGGGGGA TTAAACCCGT ACGGGTATGT GCATAATCCG
GTGAAGTATG TGGATCCGCT GGGGTTGTGT AAAACTGATG TAGCAAGAGA GCGCCAAGCC
CAAATGCTGC AAGATGATGT AGGTTATAAT ATAAGCCCTA AGAGTTGGGA TCAGTTCCCT
TCAATTGGTA GAGATGGTTC GTTTATTACG GATAAAAAAG GGGCCCTTAA ATATTTTAAT
GGTATGCAAA CTGGTAATGT AACTATTTCT AAGTCAGTAG CAGCATCTAT TGAAAAGGAT
ATGGGATTAA GTCTTGGTTC TTTAAACGGT GGATTTAATA TAAGAAAAAT TGACGGCATT
TCTAATATGC AACCACGAAG CCCATTGAGT GGTAATGATT ACTTCCTCGG TCCAGGTCAG
CATTTACCGG GGGGCGCTCC TGAAATGGTT ATAAACTCTG TGCCGACATC AACACCAGTC
ACTATAAGGG TAAGTGTAAA ATGA
 
Protein sequence
MTVAQSIGTG YGAAGAQSAL RQTALGQQSP ARTDYQVSNP NVGNIARASD SLLNVAESQQ 
FNTLVTAGFG MQAIAAGVAV PFTVGLAAGM AGGYVGAKIG NKLGHGIARA LNFNQVATEG
ESPAHLGHPI AHQKKDWGVW GAIGGILLGA AAAALVVVTF GTGLVVIAAA AAAAGLIGGI
AAATGAALGQ YGDNKGVIAE GSANVFFEGQ PVARVGDKIQ CSDHPSSPPP MIAEGAKTVF
ANQKQIARLG HRTTCDGNIN AGCGSLAITQ ETAYVYEVAD SRNPYLRWSA VILSFLPIQK
KFEQGFRSLK KPPNTAVNAT HNCPTGSDPV DMVSGDYLQV WPVIDIAGVL PVRLQRTYRS
GDYFTVSGCF GHKWADSWSQ HLVVHEDNID YIDEEGVGLS FFTPENKVQA VNLYNLRYEL
VGERHGELRV FDRSTQQTLH FNQQQQQQRY LSAITDRKGN RIDFRYQQGE LISVEHSDGY
VLEIDSRGRT IHAVELVTQE KRQKLLQSTF SERGYLVQCQ SFQYGTLSHE YDPKGYMVRW
RDTDSTDVAV RYDISGRVVA LKTSTGFFAD HFIYHDKERY TIYRDGEGGE TGYHYNENNL
LIKLVDPLGN ITLTDWDLTQ KIKETDALGR ITRFIYNERG DLTAVIHPDE TRTEYEYNPS
GVVTAFTSSA GDSWQYQYDR QGLLRQVTYP SGQTMSFRYG KKGEVLRKII AEDQVWRYHY
DHHGCLSTII DPKGNSTAVT LDVLGRLFSH QNALGELTRY THSDAHASPA GSVTKMVMPD
GVEQAIAYDS EKRIAALTDG AGKTTRYEYG GFDLLTGLIR PDGQRLTFGY DTLTRLNQVT
NASGDTYRYT RDRAGQVISE TDFTGRTVHY QYDAVGRRIG ARYPDQRLVR WHYSMQDQVL
AQQTWHCDAL SSTLVGTVSY GYDRAGRLLS ATNADAVVEF DYDEAGQLVA ERLNGREVRH
QWDALNGTPV ARQVGELGLT FVYGAQGELT RLQLAGHQPL QLQHDRLGRE TVRESAAGFI
QACNYTPSGL LAHQAAGRNS ALFQQQLIAP ESPALHGSAV NRSWQYDRAY NVVGMDDGRW
GKTQYQYDRN DQVVRADFGG FLPLQEQFSY DVNQNLREHR CLPRGAQAVL AQASQQQQAG
RVVKRGDSEY RYDAGGRLVE KRSQKDGYRP QLWRYRWNEQ DQLSELITPT GARWRYGYDA
FGRRIRKLRV VDTPPLNEMD APSTGPATAS LAGYAYLWSG DQLIEEVPVY ADGTVAYEQG
IHWLYAPGGL TPMARYAQGK LHYVVADHLG TPRELLNEQG KVVWASRLST WGQAELWRQA
ANEEDRVSCN LRFAGQYADA ESGLHYNRFR YYDGETGQYL CPDPIGLAGG LNPYGYVHNP
VKYVDPLGLC KTDVARERQA QMLQDDVGYN ISPKSWDQFP SIGRDGSFIT DKKGALKYFN
GMQTGNVTIS KSVAASIEKD MGLSLGSLNG GFNIRKIDGI SNMQPRSPLS GNDYFLGPGQ
HLPGGAPEMV INSVPTSTPV TIRVSVK