Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A3655 |
Symbol | purH |
ID | 3837111 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | - |
Start bp | 4194420 |
End bp | 4196000 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637827779 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_428736 |
Protein GI | 83594984 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTCCATT CCCTGCCCAT CCGCCGCGCC CTGATCAGCG TTTCCGACAA GGGCGGGCTT GTGCCCTTCG CCCGTTTCCT CGCCGATCAC GACATCGAGA TCTTGTCGAC CGGAGGCAGC GCCAAGGCGC TGGCCGATGC CGGCATTCCG GTGACCGAGG TCGCCGATTT CACCGGTTTC CCGGAAATGC TCGATGGCCG GGTCAAGACC CTGCATCCGA AGATCCACGG CGGCATCCTG GGCATCCGCG ACAATCCCGA GCACCAGCGG GCGATGGCCG CCCATGAGAT CTTGCCGATC GATCTGGTGG TGGTGAACCT CTATCCCTTC GAAGCCACGG TGGCCAAGGG CGCCGCCTTC GAGGACTGCG TCGAGAACAT CGACATCGGT GGGCCGGCCC TGATCCGCGC CGCCGCCAAG AACCACGAGG CGGTCACCGT CGTCGTCGAT CCCGAGGATT ACCAGCCGGT GATGGACGCC ATGACCGCCG AGGGCGGCGC CACCACGCTG GAGCTGCGGC GCAAGCTGGC TTCGGCCGCC TTCGCCCGCT GCGGCGCCTA TGACGGCGCC ATCAGCCGCT GGTTCCAGGG GCAGGTCGGC GACGAGACTC CGCGTCATAT CGTTTTCGCC GGCCGCCTGC GCCAGACCCT GCGCTATGGC GAGAACCCCC ATCAGAAGGC GGCGTTCTAT GGTCACGGCA TCGCCCGCCC GGGGGTGGCC AGCGCCGAGC AGCTTCAGGG CAAGGAGCTG AGCTACAACA ACATCAATGA TACCGACGCC GCCTTTGATC TGGTCTGCGA ATTCGCCGAG CCGGCGGTGG CGATCATCAA GCACGCCAAC CCCTGCGGCG TCGCCCAGGG CGCCAGCGTC GTCGAAGCCT ATAAGGCCGC CCTCGCCTGC GATCCGGTCA GCGCCTTTGG CGGCATCGTC GCCCTCAACC GGCCGATCGA TCGCGACTCG GCGGTGGAAA TCACCAAGAT CTTCACCGAG GTGGTCATCG CCCCCGATGC CGACGCCGAG GCGCGGGCGA TTTTCGCGGC CAAGAAAAAC CTGCGCCTGC TGCTGACCGG CGTGGTCGCC GATACCACGG CGCCCGGGCT GACCGTGCGC TCGGTCGCCG GCGGCATGCT GGTCCAGGAC CGCGACGCCG CCGATCTGCT GTCGGCCGAT CTCAAGGTGG TCAGCAAGCG CACGCCGACC GAACGCGAAC TGGCCGACAT GCTGATCGCC TTCAAGGTCT GCAAGCACGT CAAATCCAAC GCCATCGTCT ATGTCAAGGA TGGCGCCACG GTGGGCATCG GCGCCGGCCA GATGAGCCGG GTCGACAGCG CCCGCATCGC CTCGTGGAAG GCCGATGAGG CCGCCGAGGC GGCCGGGCTC GCCCAATCGC CGACCCAGGG GTCGGTCGTC GCCTCCGACG CCTTCTTCCC CTTCGCCGAT GGCCTGCTGG CCGCGGCCAA GGCCGGGGCA ACGGCGGTGA TCCAGCCCGG CGGCAGCATG CGCGACGACG AGGTCATCAA AGCCGCCGAC GAGGCCGGCT TGGCGATGGT CTTCACCGGT TTGCGCCACT TCCGCCATTA G
|
Protein sequence | MLHSLPIRRA LISVSDKGGL VPFARFLADH DIEILSTGGS AKALADAGIP VTEVADFTGF PEMLDGRVKT LHPKIHGGIL GIRDNPEHQR AMAAHEILPI DLVVVNLYPF EATVAKGAAF EDCVENIDIG GPALIRAAAK NHEAVTVVVD PEDYQPVMDA MTAEGGATTL ELRRKLASAA FARCGAYDGA ISRWFQGQVG DETPRHIVFA GRLRQTLRYG ENPHQKAAFY GHGIARPGVA SAEQLQGKEL SYNNINDTDA AFDLVCEFAE PAVAIIKHAN PCGVAQGASV VEAYKAALAC DPVSAFGGIV ALNRPIDRDS AVEITKIFTE VVIAPDADAE ARAIFAAKKN LRLLLTGVVA DTTAPGLTVR SVAGGMLVQD RDAADLLSAD LKVVSKRTPT ERELADMLIA FKVCKHVKSN AIVYVKDGAT VGIGAGQMSR VDSARIASWK ADEAAEAAGL AQSPTQGSVV ASDAFFPFAD GLLAAAKAGA TAVIQPGGSM RDDEVIKAAD EAGLAMVFTG LRHFRH
|
| |