Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_2934 |
Symbol | hutI |
ID | 3720674 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | + |
Start bp | 1612029 |
End bp | 1613216 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640071120 |
Product | imidazolonepropionase |
Protein accession | YP_352995 |
Protein GI | 77463491 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.277878 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGATTC TCGGCAATCT GCGGGTGGCA ACGCTGAGCG ACGGCTACGG GCTGATCCCC GACGCCGCGA TCCTGATCGA GGGCGGCCGC ATCCAATGGG TCGGTCCCGA GGCCCACCTG CCGCCCTCCG CTGCGCCCCG GCACGACATG GGCGGGCGGC TCTGCACGCC CGCGCTGATC GACTGCCATA CCCATGCGGT CTTCGCAGGC ACCCGCGCGG CCGAGTTCGA GATGCGGCTG AAGGGCGCCT CCTATGCCGA GGTGGCGGCA GCGGGCGGCG GCATCGTCTC GACGGTGATG GCCACCCGCG CCGCCGGCGC CGACGAGCTT CTGGCGGCGA GCCTGCCCCG GATCGATGCG ATGCTGGCGG GCGGCGTCGG CACGGTCGAG ATCAAGTCGG GCTACGGGCT CGACATCGAG ACCGAGCTCC GGATGCTGCG CGTCGCGCGC CGGATCGGTG AGCTGCGCAA GGTCCGGGTC CGCACGAGCT TTCTCGGCGC CCATGCCGTG CCGCCGGACC ATCGCGGCCG TCCCGACGCC TATCTGGCCG AAGTCGTGCT GCCCGCGCTG AAGGTGGCGC AGGACGAGGG ACTCGTCGAT GCGGTGGATG GGTTCTGCGA GGGCATCGCC TTCTCGCCCG CGCAGATCGC CCATCTCTTC GCACAGGCGC ACAAGCTGCG GCTGCCGGTG AAGCTTCATG CCGAGCAGCT TTCGAACCTC GGCGGCGCGG CGCTGGCGGC GCGCCACGAT GCGCTCTCGG CCGATCATCT CGAATATCTC GATGCCGAGG GCGTGGCGGC GCTCGCCGCG GCGGGAACCG TGGCCGTGCT GCTGCCCGGC GCCTTCTACG CACTGCGCGA GACGCAGGCA CCCCCGGTGG CGGCGCTCCG CGCGGCGGGC GTGCCGATGG CGGTGGCGAC CGACCTGAAC CCCGGCACCT CTCCGCTGGG CGCGTTGGGG CTCGCCATGA ACATGGCCTG CACCCTCTTC CGCCTGACGC CAGAGGAGGC TCTGGCCGGC ACCACGATCC ATGCCGCCCG TGCACTCGGG CTCTCCGACA CCGGCCGCAT CGCGCCGGGC TTCCGCGCCG ATCTCGCCAT CTGGGAGGCC GAGCATCCGG CCGAACTCAG CTGGCGCATC GGCCCCGCGC CCCTCCATGC CCGCCTCCAC GAGGGAGAGT TCGTCTGA
|
Protein sequence | MMILGNLRVA TLSDGYGLIP DAAILIEGGR IQWVGPEAHL PPSAAPRHDM GGRLCTPALI DCHTHAVFAG TRAAEFEMRL KGASYAEVAA AGGGIVSTVM ATRAAGADEL LAASLPRIDA MLAGGVGTVE IKSGYGLDIE TELRMLRVAR RIGELRKVRV RTSFLGAHAV PPDHRGRPDA YLAEVVLPAL KVAQDEGLVD AVDGFCEGIA FSPAQIAHLF AQAHKLRLPV KLHAEQLSNL GGAALAARHD ALSADHLEYL DAEGVAALAA AGTVAVLLPG AFYALRETQA PPVAALRAAG VPMAVATDLN PGTSPLGALG LAMNMACTLF RLTPEEALAG TTIHAARALG LSDTGRIAPG FRADLAIWEA EHPAELSWRI GPAPLHARLH EGEFV
|
| |