Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0408 |
Symbol | |
ID | 3908846 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 452026 |
End bp | 452763 |
Gene Length | 738 bp |
Protein Length | 245 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637882294 |
Product | 1-(5-phosphoribosyl)-5-[(5- phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase |
Protein accession | YP_484030 |
Protein GI | 86747534 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0106] Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase |
TIGRFAM ID | [TIGR00007] phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.545772 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.604429 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCTCT TCCCCGCGAT CGATCTCAAG AACGGCCAGT GCGTGCGGCT CGAACAGGGC GACATGGCGC GCGCCACCGT GTTCAACCTC GATCCGACCG CGCAGGCGAA GAGCTTTGCG GCGCAGGGTT TTCAGTATCT CCATGTGGTC GATCTCGACG GCGCCTTCGC CGGCAAGCCG ATGAACGCAC AGGCCGTGGA ATCGATGCTG AAAGTCGTGT CGATGCCGGT CCAGCTCGGC GGCGGCATCC GCGACCTCGC GACAGTCGAG GCCTGGCTGT CCAAAGGCAT CGCCCGCGTC ATCATCGGCA CCGCGGCGGT GCGGGATCCG GCGCTGGTGA AGCAAGCTGC GAAGTCGTTC CCCGGCCGCG TCGCGGTCGG CCTAGATGCC CGCGACGGCA AGGTTGCGGT CGAAGGCTGG GCGGAGAGCT CGCAGGTCAC CGCGCTGGAA ATTGCGCAAC GCTTCGAAGA CGCCGGCGTC GCCGCGATCA TCTTCACCGA CATTGCCCGC GACGGCCTGC TCAAGGGCAT CAACTGGGAT GCGACGATCG CGCTCGCCGA GGCCATCAGT ATTCCGGTAA TCGCCTCCGG GGGGCTCGCC TCGATCGAGG ATGTGAAGGC GATGCTGAGC CCGCGCGCTC ACAAACTGGA AGGCGCGATC GCCGGCCGTG CGCTGTATGA CGGCCGGCTC GACCCGGCGG AAGCGCTGGC GCTGATCGGC GCCGCCAGAG CGGCTTGA
|
Protein sequence | MILFPAIDLK NGQCVRLEQG DMARATVFNL DPTAQAKSFA AQGFQYLHVV DLDGAFAGKP MNAQAVESML KVVSMPVQLG GGIRDLATVE AWLSKGIARV IIGTAAVRDP ALVKQAAKSF PGRVAVGLDA RDGKVAVEGW AESSQVTALE IAQRFEDAGV AAIIFTDIAR DGLLKGINWD ATIALAEAIS IPVIASGGLA SIEDVKAMLS PRAHKLEGAI AGRALYDGRL DPAEALALIG AARAA
|
| |