Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YPK_2222 |
Symbol | |
ID | 6089656 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis YPIII |
Kingdom | Bacteria |
Replicon accession | NC_010465 |
Strand | + |
Start bp | 2462363 |
End bp | 2463583 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641597287 |
Product | imidazolonepropionase |
Protein accession | YP_001720956 |
Protein GI | 170024451 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.201457 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTATCAG TAACTCACTG TGACAGCTTA TGGTTCGGGG CCGATATCAT TACGATGCGC GGGGGAAATT ATCAGTTGAT TCCGCAAGGG GCAATCGCTG TCACTGGCGA TAAGATAGTC TGGATTGGGC CACATGCCGA ATTACCGCCT ATTCATGCCG CACGTCAGGT CGTATATGAA GGTGGTCTTA TCACCCCCGG ATTGATTGAC TGTCACACCC ATCTCGTGTT TGGCGGTGAT CGTAGCAATG AATTTGAGCA ACGCCTTAAC GGGGTCAGCT ATGCCGAAAT TGCTGCTAAT GGCGGTGGTA TTATTTCAAC CGTCAGAGCC ACACGCCAAG CTAGCGAACA GCAACTACTG GAACAAGCCC TATTTCGTCT GAAGCCCTTA CTTGCTGAAG GGGTGACTAC GATTGAGATT AAGTCTGGCT ATGGCCTTAA TCTTGAAAGT GAAATAAAAA TGTTGCGAGT GGCCCGCCGA TTGGGGGAGT TACTGCCTAT TGACGTCAAA ACGACTTGTT TGGCCGCCCA TGCGCTACCG CCCGAGTTTA TCGGGCAGCC TGATGATTAT ATTGATGTCG TATGTAATAG CATTATTCCT CAGGTGGCAG TTGAAAACTT AGCCGATGCC GTGGACGCAT TTTGCGAACA TTTAGCTTTT TCACCGGCTC AAGTTGAGCG AGTATTTTTA GCCGCACAAA AAGCCGGGCT ACCTGTAAAA CTGCACGCAG AGCAACTTTC TGCTCTCCGT GGCGCGACTC TGGCCGCTAA ATTCCATGCG ATATCGGCAG ACCATTTGGA GTACGCAACC GAATCTGATG TCCAGGCTAT GGCAAATGCG GGTACTGTCG CAGTCTTACT ACCAGGTGCC TACTACTTAT TGCGGGAAAC ACAATGCCCC CCAATTGATC TGTTCCGCCA GTATAAGGTC CCCATGGCAC TGGCCAGTGA TGCCAACCCA GGGACATCTC CGGTACTTTC ACTACGCTTG ATGCTCAATA TGGCTTGCAC GTTATTCCGC ATGACACCAG AAGAAGCACT GGCTGGTGTC ACGTGCCACG CAGCTCAAGC TCTTGGTGTA CAACAGACTC AAGGTACGTT GGAGACAGGG AAATTGGCTA ACTGGGTGCA TTGGCCCTTA TCACACCCAG CCGAGTTAGC TTATTGGTTA GGAGGGCAAT TACCTGCCAC TGTCGTATTC CGAGGAGAAG TACGCCCATG A
|
Protein sequence | MVSVTHCDSL WFGADIITMR GGNYQLIPQG AIAVTGDKIV WIGPHAELPP IHAARQVVYE GGLITPGLID CHTHLVFGGD RSNEFEQRLN GVSYAEIAAN GGGIISTVRA TRQASEQQLL EQALFRLKPL LAEGVTTIEI KSGYGLNLES EIKMLRVARR LGELLPIDVK TTCLAAHALP PEFIGQPDDY IDVVCNSIIP QVAVENLADA VDAFCEHLAF SPAQVERVFL AAQKAGLPVK LHAEQLSALR GATLAAKFHA ISADHLEYAT ESDVQAMANA GTVAVLLPGA YYLLRETQCP PIDLFRQYKV PMALASDANP GTSPVLSLRL MLNMACTLFR MTPEEALAGV TCHAAQALGV QQTQGTLETG KLANWVHWPL SHPAELAYWL GGQLPATVVF RGEVRP
|
| |