Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmwyl1_3900 |
Symbol | |
ID | 5368200 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Marinomonas sp. MWYL1 |
Kingdom | Bacteria |
Replicon accession | NC_009654 |
Strand | - |
Start bp | 4397086 |
End bp | 4398306 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640806288 |
Product | imidazolonepropionase |
Protein accession | YP_001342732 |
Protein GI | 152997897 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAACG ATACTCCAAT GAAATTAGAC AGCTTATGGC GTGGCGCTCA TATCGCCACC ATGAAAGATG GCCTGTATAG CGTGATTGAA AACGCAGCGA TAGGCGTCGT CGGCGGCCGT ATCGTTTGGA TTGGCGAAGC GCAAGATCTG CCAGACTATG AGACTCAAAG CGAACATGAT CTGGATGGCG GTTGGATTAC ACCTGGCTTG ATCGATTGTC ACACGCACCT TGTTTTTGGT GGTAACCGTG CAGGTGAATT CGAACAACGA TTGAATGGCG TAAGTTATCA AGAAATCGCC AAGCAGGGCG GCGGTATAGC ATCCTCTGTA AAGGCCACTC GTGACGCCAG CGAAGAGGAG TTAATCGCCA GTGCTTCTCG TCGTCTGAAA AGTTTAATAG CCGATGGTGT GACCACAGTT GAGATCAAAT CAGGTTATGG ACTGTCACTG GATGCGGAGT TGAAAATGCT GCGAGTGGCA GGTCAACTAG GTAATGATTT TCCGGTGACT GTAAAGCGTA CTTGTTTGGC GGCACATGCC ATGCCTCCTG AGTTTGTGGA GAAAGACGAC TACATAGATT ATTTATGCGA GACCTTGCTG CCGAAGGCAG CTAAGCTAGG CATGGCGGAT GCCGTGGATG CATTTTGTGA AGGCATTGCT TTTAGCACAG AACAAGTTGC CCGCTATTTT AAAACGGCGG AATCTTTAGG CTTACCAGTG AAAATTCATG CGGAGCAGTT GTCGTCATTA GGTGGAACGG CCATGGCCGC GTCCTTCAAA GCCTTGTCGG CCGATCATAT CGAATTCATT GAAGAATCAG ACGTCAAAGC CATGGCAGAA TCGGGCACAG TGGCGGTGTT ATTGCCTGGG GCATTTTTTA CCCTAAAAGA AACTCAATGC CCTCCAATTG ATTTACTTCG ACAATATGGC GTTCCCATGG CGGTTGCTAC TGACGCTAAC CCTGGTACTT CGCCAGCTTT ATCGCTTCGG CTCATGATGA ATATGTCTTG TACTTTGTTT GCTCTGACGC CTGAAGAAGC CCTTGCCGGT GCAACCATTC ATGCGGCAAA AGCATTGGGT ATGGCAGATA CTCATGGTAG TTTAGAAGTT GGTAAGGTCG CCGACTTTGT GTGCTGGGAG GTAGAAAGTC CTGGAGAGCT AAGTTATTGG TTAGGTGGCG ATTTATTAAA AGCTCGTGTG TACCAAGGCG AGAAAGAATA A
|
Protein sequence | MTNDTPMKLD SLWRGAHIAT MKDGLYSVIE NAAIGVVGGR IVWIGEAQDL PDYETQSEHD LDGGWITPGL IDCHTHLVFG GNRAGEFEQR LNGVSYQEIA KQGGGIASSV KATRDASEEE LIASASRRLK SLIADGVTTV EIKSGYGLSL DAELKMLRVA GQLGNDFPVT VKRTCLAAHA MPPEFVEKDD YIDYLCETLL PKAAKLGMAD AVDAFCEGIA FSTEQVARYF KTAESLGLPV KIHAEQLSSL GGTAMAASFK ALSADHIEFI EESDVKAMAE SGTVAVLLPG AFFTLKETQC PPIDLLRQYG VPMAVATDAN PGTSPALSLR LMMNMSCTLF ALTPEEALAG ATIHAAKALG MADTHGSLEV GKVADFVCWE VESPGELSYW LGGDLLKARV YQGEKE
|
| |