Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_0795 |
Symbol | |
ID | 5387624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | + |
Start bp | 954541 |
End bp | 957963 |
Gene Length | 3423 bp |
Protein Length | 1140 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640863758 |
Product | hypothetical protein |
Protein accession | YP_001399779 |
Protein GI | 153949709 |
COG category | [S] Function unknown |
COG ID | [COG3523] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGCGCA TTGCCTTACC GATTAAAAAG CCAGAAGTCT GGTTCTGGAT TGTTGCTCTG TTATTTTTGC TGGCGGGTGC CGTGTTGTGC TGGCTGGTCT GGCAACATCC AGAACGGGTA GGATTAATTC AGGGAACACC GCAACGTGAT CGCTGGCTGA CGGGACTGGT GGTGGGAACA GGGATACTGA CCTTGTGCGC ATTACTGTCG TATGTGGGGA CCCGATTATC CGGCAGAAAA CACTTTGATG AGATACGGCA GCAGGCACAA GGTGATGATG TGTCGTTGCT GAAAGAAAAC GCGAAGGTGG GAGATATTCA GCAGAGCGAA CTGTCCCATT TGAAAACCCG TCTGCGCCGC CGTTACGGGC TGTTCTGGCG CTACAAAGTC CGGTTGCTGA TGGTGGTGGG AGAGCCTGAC GAAATCGCTG CATTGGCTCC GCAACTGGCG GAACAGGGCT GGCTGGAAGG GCAGCGTACG GTGCTGATTC ACGGTGATAG TGTCCAAAGT CCAGCCGATG AAACCGGGCT GAGTGAGTGG CGTAAACTGC GCCGTGGCCG CCCGTTTGAT GGCATTGTTT GGGCGATAAC CGCCGCGCAA AGTAGTCATC CACAGTGGAT GGACAACGGC CTGCGGACGC TGGAGAAAAT GGGCGCAACC TTGCGCTATC AACCGCCGGT GTATCTGTGG CAGGTCTGCG GCAGCGGCTG GCCTCAGGAT ACGCGGGTGG AGCAGCCGGT CGGGGTGGTG TTCCCGGCGA AGGCCACGCC AGAGCAGGTT GAGCTTCAAC TGCGCGCTCT GTTGCCTCAG TTACGTGAGC AGGGGATGCA GCAGCTTTCT GTCGAACCGC ACCATGATTT CTTGTTGCGT CTGGCGCAGT CGCTGGAGCA GGGCGATGCC ACCCGCTGGC GGCAGCGTCT GACGCCGTGG TTTACCGAGT ATGCGGCGCG TATTCCCCTG CGTGGGTTGA TGTTCAGTTT GCCGGAGGTA TCAACATCGG CGGCGCGTGT GCACGAGAAA AACTGGACTG CATCTGACCG CTGGCAGGGG ATACTGGATG ACTGTCGTTC TGCCCGTGGA CGCCGTGTCG GGCTGCCGTG GGAGCAGACA TTGTGCTACA GCTTGCTGGC ACTGATAGTG CTGTGGGGCG TGGGCAGTGT CGTGTCATTT GCGGTCAACC GTCACCAAAT GGTCTCTGCC GCGCAGCAGG CACAGCAACT GGCGCAGTCG CAAGCGGTCT CTGACCAGCA ACTCATGGCC CTACAAGCCC TGCGCAATGA TATCGGGCGC TTACAATCTC GCGTGGCGCA GGGGGCACCT TGGTATCAGC GCTTTGGCCT GGATCATAAT GCGCCGCTGC TTGAGGTCCT GATGCCGTGG TATGGCCAGG CCAACAACCG CATTCTTAGG GATGCTACCG CGCAAAGCCT GCATCAGAAA CTCAGTGAAC TGGCGGAGCT GCCCGCCAAC AGCCCGCAAC GTGCGGTACT GGCAAAAACA GGCTATGACC AGTTAAAGGC CTATCTGATG ATGTCACGTC CGGAAAAAGC CGATGCTGCA TTTTATGCAC AGGTGATGCA GACCACCGAG CCTGTGCGTG CCGGTGTATC TCCTGGTCTG TGGCAGAGTC TGGCCCCAGA CTTATGGCAA TTTTATGCGC AAAACCTGCC TGCTCAGCCG GACTGGAAAA TTAAACCGGA TACCGGGTTG GTAAGCCAGG TACGGCAGGT CTTACTGGGG CAGATTGGTC AGCGCAATGC AGAAAGTACG CTGTATGAAA ACATGCTGCT ATCAGTGCGT CGCAACTATG CCGACATGAC GCTGACGGAC ATGACCGGCG ACACTGATGC TCAGCGTCTG TTCCAGACCT CGGAATCCGT GCCCGGTATG TTTACCCGCA AGGCCTGGGA TGAGCAAATC CAGCAGGCAA TAGATAAAAC GGTGGCCTCC CGCCGCGAAG AGATTGACTG GGTATTGAGT GATAACCGCC GGGCGATATC CGAAGATATC TCACCGGAAG CGCTGAAAAA ACGCCTGACC GAACGTTATT TCACCGATTT TGCTGGCAGT TGGCTGAGTT TCCTCAACAG TTTGCACTGG AATGAGGCGC ATAACCTGTC GGATGTGATT GACCAACTGA CCTTGATGAG TGATGTACGC CAGTCGCCGC TGATTGCGCT AATGAACACG CTGGCGTGGC AAGGGCAGAC CGGGCAGCAG AATCAGGCAT TATCGGATTC GCTGGTGAAG TCTGCCAAGG CGCTGATGAA TAAAGACCAG GCTCCGGCGA TTGACCAGAG TGCCGGTGGG CCAGTAGGGC CACTGGACGA GACCTTTGGC CCGTTGCTGG CACTGATGGG CAAAGGCGAT GCACAAAACA GGCTGTCGTC GGACAGCTCG CTGAGCCTGC AAACGTTGCT CACCCGCGTG ACCCGGGTGC GGCTTAAACT CCAGCAAGTG GTTAATGCCT CGAACCCACA AGAGATGACT CAGGTGCTGG CCCAGACCGT TTTCCAGGGG AAAAGTGTCG ACCTGACGGA CACGCAGGAG TACGGCAGCC TGATTGCCGC CAGTCTGGGG GAAGAGTGGA GCAGTTTCGG GCAGACGATG TTTGTTCAGC CGCTGACGCA GGCGTGGGAG ACCGTGTTGC AACCTTCGTC GGCCAGCCTT AACGACCAGT GGAAAAACGC GGTGGTGGCC AATTGGAAAT CAGCCTTTGA CGGGCGTTAC CCGTTTGCCG CCAGTAAAAG CGATGCTTCA CTGCCGATGC TGGCCGAATT TATTCGTAAG GATAGCGGGC GTATCGACAG CTTCCTGACC CGTGAGCTGG GTGGCGTGCT GCATAAAGAA GGGACGCGCT GGGTTCCGAG TAAAGTGAAC AGCCAGGGGC TAACGTTTAA CTCGGACTTT CTGGCGGCCA TTAATCAACT GAGTCAAGTC TCCGATATTC TGTTCACTGA CGGCAGTCAG GGGCTGCGCT TTGAGCTACT GGCACGCCCG GTTCCGAATG TGGTGGAAAC CCATTTAGCG ATTGATGGAC AGAAATTGCA TTATTTCAAC CAGATGGAAA GTTGGCAGAG TTTCCGCTGG CCGGGTGATA CCTACAAACC CGGCACGCTA TTGACCTGGA CTGGCGTGAA TTCCGGGGCC CGTTTATACG GTGATTATCA GGGGACATGG GGATTGATCC GCTGGCTGGA ACAGGCAAAA CAGAAAAAGC TGGATGAAGG GCGTTATCAA CTGACCTTCA CCACCGCGGA TAACCTACCA CTGCAATGGA TATTACGCAC TGAGCTGGGC AAAGGTCCAT TAGGTCTGCT GCAACTGCGT AACTTTACCC TGCCTGCGCA AATTTTTCTG ATACAGAGCG CCCCTTCAGC GGCATCTGAT CGGACCGATG ATGAGGATAT GGCAGAGGAT TAA
|
Protein sequence | MKRIALPIKK PEVWFWIVAL LFLLAGAVLC WLVWQHPERV GLIQGTPQRD RWLTGLVVGT GILTLCALLS YVGTRLSGRK HFDEIRQQAQ GDDVSLLKEN AKVGDIQQSE LSHLKTRLRR RYGLFWRYKV RLLMVVGEPD EIAALAPQLA EQGWLEGQRT VLIHGDSVQS PADETGLSEW RKLRRGRPFD GIVWAITAAQ SSHPQWMDNG LRTLEKMGAT LRYQPPVYLW QVCGSGWPQD TRVEQPVGVV FPAKATPEQV ELQLRALLPQ LREQGMQQLS VEPHHDFLLR LAQSLEQGDA TRWRQRLTPW FTEYAARIPL RGLMFSLPEV STSAARVHEK NWTASDRWQG ILDDCRSARG RRVGLPWEQT LCYSLLALIV LWGVGSVVSF AVNRHQMVSA AQQAQQLAQS QAVSDQQLMA LQALRNDIGR LQSRVAQGAP WYQRFGLDHN APLLEVLMPW YGQANNRILR DATAQSLHQK LSELAELPAN SPQRAVLAKT GYDQLKAYLM MSRPEKADAA FYAQVMQTTE PVRAGVSPGL WQSLAPDLWQ FYAQNLPAQP DWKIKPDTGL VSQVRQVLLG QIGQRNAEST LYENMLLSVR RNYADMTLTD MTGDTDAQRL FQTSESVPGM FTRKAWDEQI QQAIDKTVAS RREEIDWVLS DNRRAISEDI SPEALKKRLT ERYFTDFAGS WLSFLNSLHW NEAHNLSDVI DQLTLMSDVR QSPLIALMNT LAWQGQTGQQ NQALSDSLVK SAKALMNKDQ APAIDQSAGG PVGPLDETFG PLLALMGKGD AQNRLSSDSS LSLQTLLTRV TRVRLKLQQV VNASNPQEMT QVLAQTVFQG KSVDLTDTQE YGSLIAASLG EEWSSFGQTM FVQPLTQAWE TVLQPSSASL NDQWKNAVVA NWKSAFDGRY PFAASKSDAS LPMLAEFIRK DSGRIDSFLT RELGGVLHKE GTRWVPSKVN SQGLTFNSDF LAAINQLSQV SDILFTDGSQ GLRFELLARP VPNVVETHLA IDGQKLHYFN QMESWQSFRW PGDTYKPGTL LTWTGVNSGA RLYGDYQGTW GLIRWLEQAK QKKLDEGRYQ LTFTTADNLP LQWILRTELG KGPLGLLQLR NFTLPAQIFL IQSAPSAASD RTDDEDMAED
|
| |