Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_1823 |
Symbol | |
ID | 5386174 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | - |
Start bp | 2105710 |
End bp | 2108694 |
Gene Length | 2985 bp |
Protein Length | 994 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640864807 |
Product | putative toxin protein |
Protein accession | YP_001400798 |
Protein GI | 153947357 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0293647 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAACAT CATTGTTCAG TAAAACCCCT TCGGTCACGG TTCTTGATAA CCGCGGCCTG ACCGTGCGCG ACATCGCATA CCACCGCCAT CCCGACTCGC CGGATGTTAC CAGTGAGCGT ATCACCCACC ATCAGTACGA CGCCCGCGGC TTTCTGACCC AGAGCGCCGA CCCGCGCCTG CATGGTGCCG GGCTGATGAA CTTCAGCTAC CTGACGGACC TGACCGGCCG CATCCTGCGC ACGCAGGGGG CGGATAACGG TACCACCGTC AGCCTGAATG ATGCTGCCGG ACGACCATTT ATCTCGGCCA GTAATATCAG CACCAGCGAC GACGGTACTG AAGACAGAGG CCAGGCGATG ACCCGCACCT GGCAGTATGA GGAGGCTTCA CTGCCGGGAC GTCTGGTGAG CGTAACCGAA CAGGTGACCG GCAAAGCCAC CCGCATCACG GAGCGCTTCG TGTACGCTGC CAATACTGAC GCGGAAAAAA GCCTGAACCT GGCCGGCGCG TGCGTCAGCC ATTACGACAC TGCGGGACTG GTGCAGCCGG ACAGCATTGC CCTGACCGGT GTACCGCTCT CTGTCACCCG CCGCCTGATG AAAAGTGCGG ATAACCCGGA CGCCGTGGCC GACTGGCAGG GGGCGGACGC CTCCGCCTGG AACGACCAAC TGGACGGCGA AACGCATACC ACCCTGACCA CCGCGGACGC CACCGGGGCG GTGCTGACCA CCACCGATGC GAAAGGTAAC CTGCAGCGCA TGGCGTACGA CGTGGCGGGC CTGCTGTCGG GCAGTTGGCT GACCCTGAAG GATGGCACAG AGCAGGTCAT CGTGAAGTCC CTGACATACT CAGCCGCCGG GCAGAAGCTG CGCGAGGAGC ACGGCAACGG CGTGGTGACC ACGTACGAAT ACGAGCCGGA AACGCAGCGC CTGGTCGGGA TTAAAACGGA ACGTCCCGCC GGGCACGCCT CCGGGGCGAA GGTGCTGCAG GACCTGCGTT ACGAGTACGA CCCGGTGGGC AACGTGCTGA AGGTCACTAA CGATGCGGAA GAGACGCGCT TCTGGCGCAA TCAGAAAGTG GTGCCTGAGA ACACGTACAC CTACGACAGC CTGTACCAGT TGGTCAGCGC CACCGGGCGC GAGATGGCAA ACGCCGGCCA GCAGAGCTGC AGCTTACCGT CCACCACCGT CCCCCTTCCT GCCGACAGCT CCGCGTATAC CCGCTACTCC CGAACCTATA CCTACGACGA AGCCGGCAAC CTGACGCAAA TTAGGCACAA TGCCCCGGCC ACCAACAACA GCTACACCAC AAAAATCACC GTCAGTGACC GCAGCAACCG GGGTGTACTA AGCACACTGA CCGAAAATGC CGCAGACGTG GACGCGCTGT TCACGGCAGG CGGGCAGCAG GCTCAGTTGC AGCCGGGACA GCATCTTATC TGGACGGCAC GTAATGAGCT GCTGAAGGTG ACGCCAGTGG TCCGCGACGG CAGCACGAAC GACAGCGAAA GCTACCGCTA TGATGCAGCC AGCCAGCGAA TCCTGAAGGT CAGCAGGCAG AAAACGAACA CCAGCATGCA GACACAGCGG GTGCTGTACC TGCCAGGGCT GGAGCTGAGG AGTACAAAAT CCGGCGATAC GGAAACGGAA GGCCTGCAGA TTATCACAGT GGGTGAGGCG GGCCGTGCGC AGGTGCGGGT GCTGCACTGG GAGAGCGGCA GGCCGGATGA AATCACTGAC GACCAGATAC GTTACAGTTA CGACAATCTG GCCGGTAGCT GCAGCCTGGA ACTGGGCGGT GACGGCAATA TCATCAGCGC GGAGGAGTAT TATCCGTACG GCGGCACGGC GGTCTGGGCG GTGCGGCGCG CGGTGGAAGC GGATTACAAA ACCGTCCGCT ACTCTGGTAA GGAACGGGAT GCGACGGGGT TGTACTACTA CGGGTACCGG TACTATCAGC CGTGGGCGGG CAGATGGCTC TCCGCAGACC CGGCTGGCAT GGTGGACGGG CTGAATCTGT TCAGGATGGC GCGCAATAAT CCTGTTGCGT TTATTGATCG TAATGGTCTT AATTCAGAAT TGTTATATTC TCAGGCATTC AAGCGGACGG CAAATAAATA TAACGTAATT ATTGGTGTAA GGGCACCTAA TCCATTAGGT GAAACTTTAT TAAAAGAAGG CTTTCCATCA AAAAATTTCC ATATGAAGGC AAAATCCAGT CCAACAGGAC CAACTGCAGG TTTTATTGCA GAAGACCCGA TTTACTCTAA GGTGTCACCT TCTGCATATA AAAAACAGAG AGCATCAATT GATAAGGCAA AAGCTTTGGG CTCAGAATCT ATAGATTTAT TCATTAGCAA ATCACGAATT AATGAGTTAA TAGAGACTGG GAATTTAAAT TTTCTAGGCG AAAATCGTTA CTCAGCGAAG TATCCGTATG GTACTCAAGA GTTTGAGATT GGAAATAATG GAAGAGTTTT AAATTCAGAG GGAAAACCTG TTAAAGTAAT GACTAATCCA CCGGAGATTG GAGAGAGAAA AAGCAATAGT TCCCCCATAA CAGCCGATTA CGATTTGTTT GCAATTATAC CAAGTGTTAA TCAGTCAGTT AATGAAAGAC CCCTGACAGT GCCTCATAAG TTATTACGTG GTAATTTTTC TCTCCCCTTC ACCAGTCCAA AAGGGAAAAA TGGCATGAGT GAAGATGTAA ATATGGGAAA TCTTCATCAT TTTGGGAAAA CGATCGTCAA CAGTTTAAAT AAAGAGATTA ACGCAGAGGG ATATGCTGGC GGTAAATTGG TCTGGCATAA CGATGAAGCT GGAAATCCAT TTAGCCCTGG ATTTGATGAG AATGACAAAC CAATTTTCTT CCTCCCATCA GGAGGTATGT TCCAGGCAAA AAATAAAAGT GAATTGCTTG GTTTTTATTC CAGATTGCGG AGGAGTGGAT ACACTCCAGA GCATAGTCCA ATTTTTGGTT TTTAA
|
Protein sequence | MRTSLFSKTP SVTVLDNRGL TVRDIAYHRH PDSPDVTSER ITHHQYDARG FLTQSADPRL HGAGLMNFSY LTDLTGRILR TQGADNGTTV SLNDAAGRPF ISASNISTSD DGTEDRGQAM TRTWQYEEAS LPGRLVSVTE QVTGKATRIT ERFVYAANTD AEKSLNLAGA CVSHYDTAGL VQPDSIALTG VPLSVTRRLM KSADNPDAVA DWQGADASAW NDQLDGETHT TLTTADATGA VLTTTDAKGN LQRMAYDVAG LLSGSWLTLK DGTEQVIVKS LTYSAAGQKL REEHGNGVVT TYEYEPETQR LVGIKTERPA GHASGAKVLQ DLRYEYDPVG NVLKVTNDAE ETRFWRNQKV VPENTYTYDS LYQLVSATGR EMANAGQQSC SLPSTTVPLP ADSSAYTRYS RTYTYDEAGN LTQIRHNAPA TNNSYTTKIT VSDRSNRGVL STLTENAADV DALFTAGGQQ AQLQPGQHLI WTARNELLKV TPVVRDGSTN DSESYRYDAA SQRILKVSRQ KTNTSMQTQR VLYLPGLELR STKSGDTETE GLQIITVGEA GRAQVRVLHW ESGRPDEITD DQIRYSYDNL AGSCSLELGG DGNIISAEEY YPYGGTAVWA VRRAVEADYK TVRYSGKERD ATGLYYYGYR YYQPWAGRWL SADPAGMVDG LNLFRMARNN PVAFIDRNGL NSELLYSQAF KRTANKYNVI IGVRAPNPLG ETLLKEGFPS KNFHMKAKSS PTGPTAGFIA EDPIYSKVSP SAYKKQRASI DKAKALGSES IDLFISKSRI NELIETGNLN FLGENRYSAK YPYGTQEFEI GNNGRVLNSE GKPVKVMTNP PEIGERKSNS SPITADYDLF AIIPSVNQSV NERPLTVPHK LLRGNFSLPF TSPKGKNGMS EDVNMGNLHH FGKTIVNSLN KEINAEGYAG GKLVWHNDEA GNPFSPGFDE NDKPIFFLPS GGMFQAKNKS ELLGFYSRLR RSGYTPEHSP IFGF
|
| |