Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3004 |
Symbol | |
ID | 4078034 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 3172003 |
End bp | 3173220 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638008333 |
Product | imidazolonepropionase |
Protein accession | YP_614998 |
Protein GI | 99082844 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.783832 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.525072 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGATA CATACGTGTT GAGCGGTGCT CGGTTGGCGA CCATGGAGGC CGAGGGGTCT TATGGGCTTG TGGAAGACGG GGCGATCGCC ATTCAGGGGG ATGAAATTCT CTGGTGCGGC GCGCGGGGCG CGCTACCGGA CACCTACGCA GCTTGCCCCA GCACCGACCT CAATGGACGT CTGGTCACGC CGGCCTTCAT TGATTGTCAC ACGCACATCG TTTTTGGCGG AGACCGTGCG GGCGAGTTCG AGATGCGCCT CGAGGGGGCA ACCTATGAAG AGGTGGCCAA GGCTGGCGGC GGGATTGTGT CGACCGTCAC GGCCACGCGT GCCGCAAGCC TGGATGCGCT TGTGACCGGG GCCTTGCCGC GGCTGGATGC ACTGATTGCG GAAGGGGTGA GCACGGTAGA AGTCAAATCC GGCTACGGGC TCGACCGCGA GACCGAGCTC AATATGCTGC GCGCGGCGCG TCGTCTGGCT GAGCACAGAG ATGTTACGGT CAAAACCACC TTTCTGGGCG CGCATGCAGT CCCTGCAGAG TATGCGGGTC GGGCAGATGC CTATCTTGAT GAGGTCTGCC TGCCGACGCT GCGCGCAGCG CATGCCGAGG GGTTGGTCGA TGCGGTAGAT GGCTTTTGTG AAGGGATCGC TTTTTCGGCG GCGCAGATCG CAAAGGTTTT TGACGTGGCA GCAGAGTTGG GTTTGCCGGT GAAGCTCCAT GCCGAACAGC TGTCGCATCA GGGCGGCACC AAATTGGCGG CAGAGCGCGG CGCACTGTCT GTGGATCATG TGGAATACGC CACAGAAGCG GACGCACGGG CAATGGCGGC TTCGGGGTCC GTTGCGGTCC TGTTGCCCGG CGCGTTCTAC ACAATCCGCG AAACGCAAGT CCCGCCCGTG GCAGAGTTTC GCATGCATGG CGTGCCGATG GCACTGGCGA CAGACTGCAA CCCCGGATCA TCGCCGCTCA CGTCGCTTCT GCTGACGCTC AATATGGGCT GCACATTGTT TCGCCTGACC CCGGAAGAGG CGCTTGCGGG TGTGACCCGT AATGCCGCCC GCGCGCTGGG CATGCAGGAT CGTGGGCGCA TCGCCCCCGG GCTTCGTGCG GATCTGGCGG TCTGGGATGT CTCTCGCCCG GCAGAGCTGG CTTATCGCAT TGGTTTCAAC CCGCTCTACG CGCGTGTCAT GGGCGGCAAA ATGGAGGTTC GTACATGA
|
Protein sequence | MRDTYVLSGA RLATMEAEGS YGLVEDGAIA IQGDEILWCG ARGALPDTYA ACPSTDLNGR LVTPAFIDCH THIVFGGDRA GEFEMRLEGA TYEEVAKAGG GIVSTVTATR AASLDALVTG ALPRLDALIA EGVSTVEVKS GYGLDRETEL NMLRAARRLA EHRDVTVKTT FLGAHAVPAE YAGRADAYLD EVCLPTLRAA HAEGLVDAVD GFCEGIAFSA AQIAKVFDVA AELGLPVKLH AEQLSHQGGT KLAAERGALS VDHVEYATEA DARAMAASGS VAVLLPGAFY TIRETQVPPV AEFRMHGVPM ALATDCNPGS SPLTSLLLTL NMGCTLFRLT PEEALAGVTR NAARALGMQD RGRIAPGLRA DLAVWDVSRP AELAYRIGFN PLYARVMGGK MEVRT
|
| |