Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_2494 |
Symbol | |
ID | 5384473 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | - |
Start bp | 2817531 |
End bp | 2820572 |
Gene Length | 3042 bp |
Protein Length | 1013 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640865485 |
Product | hypothetical protein |
Protein accession | YP_001401463 |
Protein GI | 153950521 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.645652 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGACTTA AAGACCAGCT TCGCCAGAAT GCCGCCGCCA GCCAGCAGGC GACGTGCACG CCGTCAGGGT GTAAAGCCTG TCAGCGACAA GGGTTACCGA TATTTCCGCT GCGTGTAGCG GCGGTACCGA AAGCGTTGGT GAACTCTGGC TGGCAGCCGG CGGTGCCGCA GCAGGATATT GAACTGACCG GCGGTGAATT TAAATACGCA CTACGTACCC TGCGCATGGG GTATTTGTAT GTGTTGCTGG ATAAAACCAT CTGGCAGGGG TATCAGGTGA CAGCCGAAGG GTATCTGCGG CAGTTCAATC CGCTGGCGAT GCCGGAAGGC GAAACAGTCG AGCCGCTGAG CACCGCGTGT CTGACGCACG GGCACGATAT CGCCGCCAGT TTCATCAACA TAGATGACAC AAAATATCAG GTGGCCTGGC TGGCGTTTAG CAGTGACCCG TGGAGCGAAA CGGTACTGGC GGAGTATAAA AGCGGTCAAC GCCCGGCGAG CCGCTTCACA CAAATTGTGC TGTCAGAACT GAAAGCCAGC CCGGCCAGCA TACCCGAAGC ACTGGCGCTG AACCCTTCGC TCTCTGCGCT GAAGTCCCAT GTGGCGGAGT TTGCGACAAA ACTCTTCTCG AATACCGAGA AAGTGGCGGG TGAGCCGCTG GGTGGTGCGC ATGGTTTTTA TCCACGTCTG GACAGCGAAA TGGCGCTGGG GCTGCGTATT GCTCAGTTGG GCGCGCAATA TCGCTGCCAG ATAGCGGCGC TGGCACTGGA TGACACGGTG GGCGTGGTGC AGGAGCTGAA TAACAGCCGT ATGCAGATTG TGGAAGCGCG TCAGGTGTAT AGCCAACAAC CCCAAATGTT GCATAAAAAT ACGATTTCTC AGGCGATTGA ACAGTATCTT ACGCATTTGA AAACGGCTGT AGAAGACAGC AGCCAACCAC GCTATGAAAA AGCCGGGAGT TATCCCGTTT GGGGTGGCAC AGTCATCCCT AAAGAACAGG TAGCGCAGGA AAGCTGGGCT GAACAGTACG CCCGCCTGCT AAAAAGTTAT AACGAGCCGG TGCGTGCTGC TTTTGCCAAA AAATATGAAG CGCAAATGAA CGGTTATCAG CAGCGGATTG AGGCCATCGG CAGTGACCTG GCTGGCTGGT ATCGGTCGCG TAAATGGCTG AATGTCATCA AAGACGATTA CGCTCCGGAA GGGCGTCCTC CCTGCTGGGC TGCCCAATTT AATACTCTGA CGGCCTGCCT TCAGGGCGGT TCGATGGATA CACAGACGGA CGCGGTCTGG CAAGAGTGGC TGAAACAGCC CGATTCCCCC GCCTATGCCG GGCTTTTGGT CGATCAATCG TCCCTTTTGG CGAACGTATT CGATGGCAGC AATGGCTATT CTTATCTGAA AACCGGACTG GGCTCTGATG AGATGAGCCG CTATCTGGAA TCCCCCGGGG TGCAGAACGC CATATCCACC CGTCTGGCGG CCTTAAGCGG GGCGGCCAGT AGAGTTATGT CAACACTGGA TGAGGCCGCG CAAGCCGGTT ATACCCGGGT GATGCAAGGG AGCATTTACG CGACCACCGG GCAGCAAATA ACCTTGTTTA AAGTGACGAC CACCGTGGAG AAATTTCAAC AATACGTCCG GGCGGCGTCA TGGTTACCCC CTTCAGTGGC AGAGAATCAT GCGACATTTG GGGGCATATA CAGCCGGGGT ACACAGGCAG TGAACACTTC GGCAGCCGGG GCGATGATGG AGATAACCGA CCCGGTTTTA CTGAGGAAAA GTATGACGGC GTATATCAGC AGCCCCGTGT CGCTGGAGGA ACTGAAAACC ACGCTGAAAA GTGCCGGGAC CAACCCGGAG GCGATAGTAC GTGCCCCGCT AAAAAACCCC AATCTGCTGA ACCACTTTTC TGACCTGAGA ATTGCTTCAA CGTCACTGCG TGCTGATGCC GGGAGTGTGG TGACCCCCCT GAACATGCCA GAGACGGACG AGGTGCTTAA AGAGCGTACC CGTCGTTATG CCAGCGGCAA TGGGCTGGGG ATGGTGCTGT CTGCCGGTAT GATGTGGCTC CAGTTGCGGG ACTGGAATGA GAATGCAAAA AATCTGGAGA AAGCCATTGG TAATGATACC GATGCCAGTC TGAAGTATTA CATCAACCGT TTGATGGTGC TCAGTGCCGC CACCGAGATC GCAGGCTTTA GTCGGATGCT GACGATCAAA AATAACTGGA ATGTCTTACC CGAACACTTT GTTCATCCAT TGATCCGAAT TGGCGGTGTG ATAGCCGGTA TGGCGGCGGT TGTGGAGGGG GTGAGGATGG GGATGTTGGG ATGGGAAGCC GATAAAAATG GGGATGATGA ATCGGCCCAA TTATACTACG TTGCTGCAAT CGTTACAGTT GTAGGGGGCG GAGTTAGTGG TATCGGAGCT TTGTTGGGTT CGTTTGCCTT ATTAGGGCCT GCTGGGATTG GCGCACTATT AATATTGACT GGCGCACTGT TTGCCTTTGA GGCCAGCCAG CTTAGAAGTA CGCCGTTTGA AGTCTGGTTG CGTCGCAGTT GTTTTGGGAT CCCACATGAC AAAGATGTGG TGTGGCATGA GGACAGCTTG CAGGATTTAA ATGCCTCACT GACCGCTTTT AATGCCATTG TAAATGGCAT GGCGGTTGAG GTGGGTTATG AAGGGTTGTC TGAATTACAG GGTATCCGCT ATACCAAACT GGAATTGCGC CTGAGCCTGC CGGGGGGTAA AGAGGCGACA TCAGCCTGGG AGCTGCGCCT GACCGGAGGG GAGGAAAATA CAGTACTGCT AGCCGAAACA CATAATGTAC CGGGAAAACC GAACCACAGG CTGGCGGCTC CAACGTCAGA ATATTATTCC GGGCGTTATA AACGGGCCGC AGAAGGCAAT AATCTGGAAA TCAGGGCTGA GGTCTGGGTT AATGAGAGTC GCTATGGCAA AGCAACATTG GACGTCAATT ACTGGCCGGA TAAAACCGAC CCGCAGTATC AACTGTCTCT GGTTGTGAAT GCTGAGAAAT AA
|
Protein sequence | MGLKDQLRQN AAASQQATCT PSGCKACQRQ GLPIFPLRVA AVPKALVNSG WQPAVPQQDI ELTGGEFKYA LRTLRMGYLY VLLDKTIWQG YQVTAEGYLR QFNPLAMPEG ETVEPLSTAC LTHGHDIAAS FINIDDTKYQ VAWLAFSSDP WSETVLAEYK SGQRPASRFT QIVLSELKAS PASIPEALAL NPSLSALKSH VAEFATKLFS NTEKVAGEPL GGAHGFYPRL DSEMALGLRI AQLGAQYRCQ IAALALDDTV GVVQELNNSR MQIVEARQVY SQQPQMLHKN TISQAIEQYL THLKTAVEDS SQPRYEKAGS YPVWGGTVIP KEQVAQESWA EQYARLLKSY NEPVRAAFAK KYEAQMNGYQ QRIEAIGSDL AGWYRSRKWL NVIKDDYAPE GRPPCWAAQF NTLTACLQGG SMDTQTDAVW QEWLKQPDSP AYAGLLVDQS SLLANVFDGS NGYSYLKTGL GSDEMSRYLE SPGVQNAIST RLAALSGAAS RVMSTLDEAA QAGYTRVMQG SIYATTGQQI TLFKVTTTVE KFQQYVRAAS WLPPSVAENH ATFGGIYSRG TQAVNTSAAG AMMEITDPVL LRKSMTAYIS SPVSLEELKT TLKSAGTNPE AIVRAPLKNP NLLNHFSDLR IASTSLRADA GSVVTPLNMP ETDEVLKERT RRYASGNGLG MVLSAGMMWL QLRDWNENAK NLEKAIGNDT DASLKYYINR LMVLSAATEI AGFSRMLTIK NNWNVLPEHF VHPLIRIGGV IAGMAAVVEG VRMGMLGWEA DKNGDDESAQ LYYVAAIVTV VGGGVSGIGA LLGSFALLGP AGIGALLILT GALFAFEASQ LRSTPFEVWL RRSCFGIPHD KDVVWHEDSL QDLNASLTAF NAIVNGMAVE VGYEGLSELQ GIRYTKLELR LSLPGGKEAT SAWELRLTGG EENTVLLAET HNVPGKPNHR LAAPTSEYYS GRYKRAAEGN NLEIRAEVWV NESRYGKATL DVNYWPDKTD PQYQLSLVVN AEK
|
| |