Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | lpp0766 |
Symbol | hutI |
ID | 3116936 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Legionella pneumophila str. Paris |
Kingdom | Bacteria |
Replicon accession | NC_006368 |
Strand | + |
Start bp | 858681 |
End bp | 859892 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637579469 |
Product | imidazolonepropionase |
Protein accession | YP_123104 |
Protein GI | 54296735 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTGCCT GTGATGAATT ATTGCTCAAT GCAAGCACAA TCGATGCCAC AGGGTTGCAG CTTTCTAATC AAGCCATTGT TATAAGAAAA GGCAGAATAG AGTGGTGTGG TTCTGAGGAC CAGTTACCTG CCCATTTTCA GGAAAGTGCC AAGTCAAGAA AGGATTGTCA CGGCCAATTA ATTACACCAG GATTAATTGA TTGTCATACT CATCTGGTTT ATGCAGGCCA CAGAGCCGCA GAGTTTCGAT TAAAATTGCA AGGCGTCAGT TACGCTGATA TCGCCAAATC AGGCGGGGGT ATATTATCCA CAGTACAAAT GACACGAGAT GCCTCTGAAG AAGAGTTAGT TGATCAATCA TTGCCAAGAC TTCTTGCGCT AAAAAATGAA GGGGTTACTA CTGTTGAAAT TAAGTCAGGC TACGGCCTTG ATTTGCAAAA TGAATTGAAA ATGCTCAGGG TTGCCAGGCA ATTAGGGGAA GTGGCTGGAA TCAGGGTAAA AACAACTTTT CTGGGGGCGC ATGCTGTAGG TCCCGAGTTT AAAGGAAATA GTCAGGCTTA TGTCGATTTT CTTTGTAATG AGATGTTGCC TGCAGCTAAA AATATGGATT TGGTAGATGC CGTGGATGTT TTTTGTGAAT CCATAGCTTT TTCTATAAGG CAGGCTGAGC AAATTTTTCA AGCCGCTAAG GATTTAAATT TACCAATAAA ATGCCATGCT GAACAATTAT CTAACATGGG GGCCAGTTCT TTGGCTGCTC GTTATGGTGC GTTATCCTGT GATCATCTGG AGTTTTTGGA TGAAAACGGC GCGTTAAATA TGGTAAAAGC CAATACGGTT GCCGTCTTAC TTCCTGGCGC TTTTTATTTC CTTAAAGAAA AACAAAAACC ACCTGTTGAT TTATTGCGTC AGGTTGGTGT TGGTATGGCC ATTGCCACGG ATTCTAACCC GGGTTCCTCT CCAACGACTT CTTTGTTATT GATGATGAGT ATGGCTTGCC AATTTTTCTC TATGTCTATA CCTGAAGTTT TATCTGCAGT GACGTATCAG GCTTCCAGAG CTTTAGGAAT GGAAAAGGAT ATTGGGAGCA TTGAAGCAGG CAAGATTGCT GATTTGGTTT TATGGTCTAT AAAAGATAGT GCTGCGCTAT GTTATTATTT CGCTTATCCG TTACCTCATC AAACGATGGT GGCTGGTGAA TGGGTATCCT GA
|
Protein sequence | MFACDELLLN ASTIDATGLQ LSNQAIVIRK GRIEWCGSED QLPAHFQESA KSRKDCHGQL ITPGLIDCHT HLVYAGHRAA EFRLKLQGVS YADIAKSGGG ILSTVQMTRD ASEEELVDQS LPRLLALKNE GVTTVEIKSG YGLDLQNELK MLRVARQLGE VAGIRVKTTF LGAHAVGPEF KGNSQAYVDF LCNEMLPAAK NMDLVDAVDV FCESIAFSIR QAEQIFQAAK DLNLPIKCHA EQLSNMGASS LAARYGALSC DHLEFLDENG ALNMVKANTV AVLLPGAFYF LKEKQKPPVD LLRQVGVGMA IATDSNPGSS PTTSLLLMMS MACQFFSMSI PEVLSAVTYQ ASRALGMEKD IGSIEAGKIA DLVLWSIKDS AALCYYFAYP LPHQTMVAGE WVS
|
| |