Gene Mmwyl1_3900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmwyl1_3900 
Symbol 
ID5368200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMarinomonas sp. MWYL1 
KingdomBacteria 
Replicon accessionNC_009654 
Strand
Start bp4397086 
End bp4398306 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content48% 
IMG OID640806288 
Productimidazolonepropionase 
Protein accessionYP_001342732 
Protein GI152997897 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAACG ATACTCCAAT GAAATTAGAC AGCTTATGGC GTGGCGCTCA TATCGCCACC 
ATGAAAGATG GCCTGTATAG CGTGATTGAA AACGCAGCGA TAGGCGTCGT CGGCGGCCGT
ATCGTTTGGA TTGGCGAAGC GCAAGATCTG CCAGACTATG AGACTCAAAG CGAACATGAT
CTGGATGGCG GTTGGATTAC ACCTGGCTTG ATCGATTGTC ACACGCACCT TGTTTTTGGT
GGTAACCGTG CAGGTGAATT CGAACAACGA TTGAATGGCG TAAGTTATCA AGAAATCGCC
AAGCAGGGCG GCGGTATAGC ATCCTCTGTA AAGGCCACTC GTGACGCCAG CGAAGAGGAG
TTAATCGCCA GTGCTTCTCG TCGTCTGAAA AGTTTAATAG CCGATGGTGT GACCACAGTT
GAGATCAAAT CAGGTTATGG ACTGTCACTG GATGCGGAGT TGAAAATGCT GCGAGTGGCA
GGTCAACTAG GTAATGATTT TCCGGTGACT GTAAAGCGTA CTTGTTTGGC GGCACATGCC
ATGCCTCCTG AGTTTGTGGA GAAAGACGAC TACATAGATT ATTTATGCGA GACCTTGCTG
CCGAAGGCAG CTAAGCTAGG CATGGCGGAT GCCGTGGATG CATTTTGTGA AGGCATTGCT
TTTAGCACAG AACAAGTTGC CCGCTATTTT AAAACGGCGG AATCTTTAGG CTTACCAGTG
AAAATTCATG CGGAGCAGTT GTCGTCATTA GGTGGAACGG CCATGGCCGC GTCCTTCAAA
GCCTTGTCGG CCGATCATAT CGAATTCATT GAAGAATCAG ACGTCAAAGC CATGGCAGAA
TCGGGCACAG TGGCGGTGTT ATTGCCTGGG GCATTTTTTA CCCTAAAAGA AACTCAATGC
CCTCCAATTG ATTTACTTCG ACAATATGGC GTTCCCATGG CGGTTGCTAC TGACGCTAAC
CCTGGTACTT CGCCAGCTTT ATCGCTTCGG CTCATGATGA ATATGTCTTG TACTTTGTTT
GCTCTGACGC CTGAAGAAGC CCTTGCCGGT GCAACCATTC ATGCGGCAAA AGCATTGGGT
ATGGCAGATA CTCATGGTAG TTTAGAAGTT GGTAAGGTCG CCGACTTTGT GTGCTGGGAG
GTAGAAAGTC CTGGAGAGCT AAGTTATTGG TTAGGTGGCG ATTTATTAAA AGCTCGTGTG
TACCAAGGCG AGAAAGAATA A
 
Protein sequence
MTNDTPMKLD SLWRGAHIAT MKDGLYSVIE NAAIGVVGGR IVWIGEAQDL PDYETQSEHD 
LDGGWITPGL IDCHTHLVFG GNRAGEFEQR LNGVSYQEIA KQGGGIASSV KATRDASEEE
LIASASRRLK SLIADGVTTV EIKSGYGLSL DAELKMLRVA GQLGNDFPVT VKRTCLAAHA
MPPEFVEKDD YIDYLCETLL PKAAKLGMAD AVDAFCEGIA FSTEQVARYF KTAESLGLPV
KIHAEQLSSL GGTAMAASFK ALSADHIEFI EESDVKAMAE SGTVAVLLPG AFFTLKETQC
PPIDLLRQYG VPMAVATDAN PGTSPALSLR LMMNMSCTLF ALTPEEALAG ATIHAAKALG
MADTHGSLEV GKVADFVCWE VESPGELSYW LGGDLLKARV YQGEKE