Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A1303 |
Symbol | |
ID | 3833611 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 1537507 |
End bp | 1538751 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637825393 |
Product | imidazolonepropionase |
Protein accession | YP_426391 |
Protein GI | 83592639 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCAAG CACCCTGGGA CAGCCTGTGG CTCAACGGCT CGCTGGCCAC CATGGCCGCA GGCGGGGCGG CCTATGGCGC GATCGAGGAT GGCGCCATCG CCGTGGCCGA CGGGCGCCTG GTTTTCGTCG GCCGCCGCGC CGATCTGCCC CCGGGCGCCG AGGATCGGGC CGCCAGCCTT CACGATTTGG GGGGGCGCTG GGTCACTCCG GGACTGATCG ATTGCCATAC CCACCTTGTT TACGCCGGCA GCCGCGCCCG TGAGTTCGAG CTTCGGCTGC AAGGCGCCAG CTATGAGGAG ATCGCCCGCG CCGGCGGCGG CATCGTCTCC ACCGTCGGCG CGACGCGGGC CGCCAGCCTT GACCAACTGA TCGCCAGCGC CTTGCCCCGG CTTGACGCCC TGCTGGCCGA AGGGGTGACG ACGCTCGAGA TCAAATCGGG CTATGGGCTG ACGATTGAAA GCGAACGGCG CCTGTTGCAG GCCGCCCGCG CGCTGGGTCG GCGGCGCCCC GTTGATGTGG TGACGACCTT TCTTGGCGCC CATGCCCTGC CGCCGGAATT TTCCGGGGAT GGCAATGGCT ATATCGATCA CCTCTGCGCG TCGATGCTGC CCGCCCTGGC GGCCGAGGGG CTGGTCGACG CCGTCGATGT CTTTTGCGAG ACCATCGCCT TTTCCCTTGC CCAGGCCACC CGGGTTTTTG AAGCGGCGCG GGGCTTGGGC CTGCCGGTCA AGGTCCATGC CGAACAGCTT GGCCTGCTGG GCGGGGCGGC GCTGGCGGCG GGCTTTGGCG CGCTGTCGGC CGATCATGTC GAATACCTCG ACGAGGCCGG GGTGCGGGCC CTGGCGGCGG CGGGAACGGT GGCGGTGCTG CTGCCCGGCG CCTTTCATAT GCTGCGCGAA ACCCAAAGGC CGCCGGTCGA CGCCCTGCGC CGCTTCGGCG TGGCGATCGC CATCGCCACC GATTGCAACC CCGGCACCTC GCCCGTCACC TCGCCGCTGC TCATGCTCAA CATGGCCTGC GTGCTCTTCC GCCTGACCCC CGAGGAGGCC CTGGCCGGGC TGACCCGCAA TGGCGCCAAG GCGCTGGGCT TAAGCGATCG CGGCGTGCTG GCCCCCGGTT TGCGCGCCGA TTTCGCCCTG TGGGATATCG GCCATCCCGC TGAACTGGCC TATGCGCTGG GGCTCAACCC CTGTCATGCC GTGGTGCGGG CCGGCGAACC ACGGCCGCCG CGCGCCCCCT TTTGA
|
Protein sequence | MAQAPWDSLW LNGSLATMAA GGAAYGAIED GAIAVADGRL VFVGRRADLP PGAEDRAASL HDLGGRWVTP GLIDCHTHLV YAGSRAREFE LRLQGASYEE IARAGGGIVS TVGATRAASL DQLIASALPR LDALLAEGVT TLEIKSGYGL TIESERRLLQ AARALGRRRP VDVVTTFLGA HALPPEFSGD GNGYIDHLCA SMLPALAAEG LVDAVDVFCE TIAFSLAQAT RVFEAARGLG LPVKVHAEQL GLLGGAALAA GFGALSADHV EYLDEAGVRA LAAAGTVAVL LPGAFHMLRE TQRPPVDALR RFGVAIAIAT DCNPGTSPVT SPLLMLNMAC VLFRLTPEEA LAGLTRNGAK ALGLSDRGVL APGLRADFAL WDIGHPAELA YALGLNPCHA VVRAGEPRPP RAPF
|
| |