Gene YpsIP31758_1823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_1823 
Symbol 
ID5386174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp2105710 
End bp2108694 
Gene Length2985 bp 
Protein Length994 aa 
Translation table11 
GC content54% 
IMG OID640864807 
Productputative toxin protein 
Protein accessionYP_001400798 
Protein GI153947357 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0293647 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAACAT CATTGTTCAG TAAAACCCCT TCGGTCACGG TTCTTGATAA CCGCGGCCTG 
ACCGTGCGCG ACATCGCATA CCACCGCCAT CCCGACTCGC CGGATGTTAC CAGTGAGCGT
ATCACCCACC ATCAGTACGA CGCCCGCGGC TTTCTGACCC AGAGCGCCGA CCCGCGCCTG
CATGGTGCCG GGCTGATGAA CTTCAGCTAC CTGACGGACC TGACCGGCCG CATCCTGCGC
ACGCAGGGGG CGGATAACGG TACCACCGTC AGCCTGAATG ATGCTGCCGG ACGACCATTT
ATCTCGGCCA GTAATATCAG CACCAGCGAC GACGGTACTG AAGACAGAGG CCAGGCGATG
ACCCGCACCT GGCAGTATGA GGAGGCTTCA CTGCCGGGAC GTCTGGTGAG CGTAACCGAA
CAGGTGACCG GCAAAGCCAC CCGCATCACG GAGCGCTTCG TGTACGCTGC CAATACTGAC
GCGGAAAAAA GCCTGAACCT GGCCGGCGCG TGCGTCAGCC ATTACGACAC TGCGGGACTG
GTGCAGCCGG ACAGCATTGC CCTGACCGGT GTACCGCTCT CTGTCACCCG CCGCCTGATG
AAAAGTGCGG ATAACCCGGA CGCCGTGGCC GACTGGCAGG GGGCGGACGC CTCCGCCTGG
AACGACCAAC TGGACGGCGA AACGCATACC ACCCTGACCA CCGCGGACGC CACCGGGGCG
GTGCTGACCA CCACCGATGC GAAAGGTAAC CTGCAGCGCA TGGCGTACGA CGTGGCGGGC
CTGCTGTCGG GCAGTTGGCT GACCCTGAAG GATGGCACAG AGCAGGTCAT CGTGAAGTCC
CTGACATACT CAGCCGCCGG GCAGAAGCTG CGCGAGGAGC ACGGCAACGG CGTGGTGACC
ACGTACGAAT ACGAGCCGGA AACGCAGCGC CTGGTCGGGA TTAAAACGGA ACGTCCCGCC
GGGCACGCCT CCGGGGCGAA GGTGCTGCAG GACCTGCGTT ACGAGTACGA CCCGGTGGGC
AACGTGCTGA AGGTCACTAA CGATGCGGAA GAGACGCGCT TCTGGCGCAA TCAGAAAGTG
GTGCCTGAGA ACACGTACAC CTACGACAGC CTGTACCAGT TGGTCAGCGC CACCGGGCGC
GAGATGGCAA ACGCCGGCCA GCAGAGCTGC AGCTTACCGT CCACCACCGT CCCCCTTCCT
GCCGACAGCT CCGCGTATAC CCGCTACTCC CGAACCTATA CCTACGACGA AGCCGGCAAC
CTGACGCAAA TTAGGCACAA TGCCCCGGCC ACCAACAACA GCTACACCAC AAAAATCACC
GTCAGTGACC GCAGCAACCG GGGTGTACTA AGCACACTGA CCGAAAATGC CGCAGACGTG
GACGCGCTGT TCACGGCAGG CGGGCAGCAG GCTCAGTTGC AGCCGGGACA GCATCTTATC
TGGACGGCAC GTAATGAGCT GCTGAAGGTG ACGCCAGTGG TCCGCGACGG CAGCACGAAC
GACAGCGAAA GCTACCGCTA TGATGCAGCC AGCCAGCGAA TCCTGAAGGT CAGCAGGCAG
AAAACGAACA CCAGCATGCA GACACAGCGG GTGCTGTACC TGCCAGGGCT GGAGCTGAGG
AGTACAAAAT CCGGCGATAC GGAAACGGAA GGCCTGCAGA TTATCACAGT GGGTGAGGCG
GGCCGTGCGC AGGTGCGGGT GCTGCACTGG GAGAGCGGCA GGCCGGATGA AATCACTGAC
GACCAGATAC GTTACAGTTA CGACAATCTG GCCGGTAGCT GCAGCCTGGA ACTGGGCGGT
GACGGCAATA TCATCAGCGC GGAGGAGTAT TATCCGTACG GCGGCACGGC GGTCTGGGCG
GTGCGGCGCG CGGTGGAAGC GGATTACAAA ACCGTCCGCT ACTCTGGTAA GGAACGGGAT
GCGACGGGGT TGTACTACTA CGGGTACCGG TACTATCAGC CGTGGGCGGG CAGATGGCTC
TCCGCAGACC CGGCTGGCAT GGTGGACGGG CTGAATCTGT TCAGGATGGC GCGCAATAAT
CCTGTTGCGT TTATTGATCG TAATGGTCTT AATTCAGAAT TGTTATATTC TCAGGCATTC
AAGCGGACGG CAAATAAATA TAACGTAATT ATTGGTGTAA GGGCACCTAA TCCATTAGGT
GAAACTTTAT TAAAAGAAGG CTTTCCATCA AAAAATTTCC ATATGAAGGC AAAATCCAGT
CCAACAGGAC CAACTGCAGG TTTTATTGCA GAAGACCCGA TTTACTCTAA GGTGTCACCT
TCTGCATATA AAAAACAGAG AGCATCAATT GATAAGGCAA AAGCTTTGGG CTCAGAATCT
ATAGATTTAT TCATTAGCAA ATCACGAATT AATGAGTTAA TAGAGACTGG GAATTTAAAT
TTTCTAGGCG AAAATCGTTA CTCAGCGAAG TATCCGTATG GTACTCAAGA GTTTGAGATT
GGAAATAATG GAAGAGTTTT AAATTCAGAG GGAAAACCTG TTAAAGTAAT GACTAATCCA
CCGGAGATTG GAGAGAGAAA AAGCAATAGT TCCCCCATAA CAGCCGATTA CGATTTGTTT
GCAATTATAC CAAGTGTTAA TCAGTCAGTT AATGAAAGAC CCCTGACAGT GCCTCATAAG
TTATTACGTG GTAATTTTTC TCTCCCCTTC ACCAGTCCAA AAGGGAAAAA TGGCATGAGT
GAAGATGTAA ATATGGGAAA TCTTCATCAT TTTGGGAAAA CGATCGTCAA CAGTTTAAAT
AAAGAGATTA ACGCAGAGGG ATATGCTGGC GGTAAATTGG TCTGGCATAA CGATGAAGCT
GGAAATCCAT TTAGCCCTGG ATTTGATGAG AATGACAAAC CAATTTTCTT CCTCCCATCA
GGAGGTATGT TCCAGGCAAA AAATAAAAGT GAATTGCTTG GTTTTTATTC CAGATTGCGG
AGGAGTGGAT ACACTCCAGA GCATAGTCCA ATTTTTGGTT TTTAA
 
Protein sequence
MRTSLFSKTP SVTVLDNRGL TVRDIAYHRH PDSPDVTSER ITHHQYDARG FLTQSADPRL 
HGAGLMNFSY LTDLTGRILR TQGADNGTTV SLNDAAGRPF ISASNISTSD DGTEDRGQAM
TRTWQYEEAS LPGRLVSVTE QVTGKATRIT ERFVYAANTD AEKSLNLAGA CVSHYDTAGL
VQPDSIALTG VPLSVTRRLM KSADNPDAVA DWQGADASAW NDQLDGETHT TLTTADATGA
VLTTTDAKGN LQRMAYDVAG LLSGSWLTLK DGTEQVIVKS LTYSAAGQKL REEHGNGVVT
TYEYEPETQR LVGIKTERPA GHASGAKVLQ DLRYEYDPVG NVLKVTNDAE ETRFWRNQKV
VPENTYTYDS LYQLVSATGR EMANAGQQSC SLPSTTVPLP ADSSAYTRYS RTYTYDEAGN
LTQIRHNAPA TNNSYTTKIT VSDRSNRGVL STLTENAADV DALFTAGGQQ AQLQPGQHLI
WTARNELLKV TPVVRDGSTN DSESYRYDAA SQRILKVSRQ KTNTSMQTQR VLYLPGLELR
STKSGDTETE GLQIITVGEA GRAQVRVLHW ESGRPDEITD DQIRYSYDNL AGSCSLELGG
DGNIISAEEY YPYGGTAVWA VRRAVEADYK TVRYSGKERD ATGLYYYGYR YYQPWAGRWL
SADPAGMVDG LNLFRMARNN PVAFIDRNGL NSELLYSQAF KRTANKYNVI IGVRAPNPLG
ETLLKEGFPS KNFHMKAKSS PTGPTAGFIA EDPIYSKVSP SAYKKQRASI DKAKALGSES
IDLFISKSRI NELIETGNLN FLGENRYSAK YPYGTQEFEI GNNGRVLNSE GKPVKVMTNP
PEIGERKSNS SPITADYDLF AIIPSVNQSV NERPLTVPHK LLRGNFSLPF TSPKGKNGMS
EDVNMGNLHH FGKTIVNSLN KEINAEGYAG GKLVWHNDEA GNPFSPGFDE NDKPIFFLPS
GGMFQAKNKS ELLGFYSRLR RSGYTPEHSP IFGF