Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_1100 |
Symbol | purH |
ID | 3720859 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | - |
Start bp | 2857807 |
End bp | 2859396 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640072332 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_354187 |
Protein GI | 77464683 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.759839 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAACC TCGTTCCCGT TGGCCGCGCC CTTCTGTCGG TTTCCGACAA GTCGGGTCTC CTCGACCTCG CCCGTGCCCT GGCAGATCTG GAGGTGGAGC TGATCTCGAC CGGCGGCACG GCGGCCGCGC TGCGGGCGGC GGGGCTGAAG GTGCGGGACG TCGCCGAGGT CACGGGCTTT CCCGAGATGA TGGACGGCCG GGTCAAGACC CTGCATCCGA TGGTCCACGG CGGGCTTCTG GCGCTGCGCG ACGACGACGA GCATCTGGTG GCGATGGCCG CCCACGGGAT CGAGCCGATC GACCTCCTGG TGGTGAACCT CTATCCGTTC GAAGCGGCCG TCGCGCGGGG CGCCTCCTAC GACGACTGCA TCGAGAACAT CGACATCGGC GGGCCGGCCA TGATCCGGGC CGCGGCCAAG AACCACCGTT TCGTGAACGT CGTGACCGAC ACGGCCGACT ACAAGGCGCT GCTCGACGAG CTGCGCGCCC ATGACGGCGC CACGAGGCTC TCCTTCCGCC AGAAACTCGC GCTGACCGCC TATGCGCGCA CCGCGGCCTA CGACACGGCC GTCTCGACCT GGATGGCGGG CGCGCTGAAG GCCGAGGCGC CGCGCCGCCG CTCCTTCGCG GGCACGCTCG CCCAGACCAT GCGCTACGGC GAGAACCCGC ACCAGAAGGC CGCCTTCTAC ACCGACGGCT CGGCCCGTCC GGGCGTCGCC ACCGCGAAAC AGTGGCAGGG CAAGGAGCTT TCCTACAACA ACATCAACGA CACCGACGCG GCCTTCGAGC TGGTGGCCGA GTTCGACCCG GCCGAGGGCC CGGCCTGCGT CATCGTCAAG CACGCCAACC CCTGCGGCGT GGCCCGGGGC GCGACACTGG CCGAAGCCTA TGCCCGCGCC TTCGACTGCG ACCGCGTCTC GGCCTTCGGC GGCATCATCG CGCTGAACCA GCCGCTCGAT GCGGCCACGG CCGAAAAGAT CACCGAGATC TTCACCGAGG TGGTGATCGC CCCCGGCGCC GACGAGGAAG CCCGCGCGAT CTTCGCCGCC AAGAAGAACC TCCGGCTGCT GACGACCGAG GCGCTGCCCG ATCCGCTGGC GCCGGGGCTC GCCTTCAAGC AGGTGGCGGG CGGCTTCCTC GTGCAGGACC GCGACGCGGG CCATGTCGAT GCGCTCGACC TGAAGGTGGT GACGAAGCGC GCGCCCTCGG ACGCGGAACT CGCCGACCTC CTCTTTGCCT GGACCGTGGC CAAGCATGTG AAATCGAACG CCATCGTCTA TGTGAAGGAC GGGGCCACCG TGGGCGTGGG GGCGGGGCAG ATGAGCCGCG TCGACTCGAC CCGGATCGCT GCGCGCAAGT CGCAGGACAT GGCGCAGGCG CTGGGTCTGG CCCAGCCGCT GACGCAAGGG TCCGTCGTGG CCTCCGACGC CTTCTTCCCC TTCGCCGACG GCCTGCTCGC CGCGGCCGAG GCGGGCGCCA CCGCGATCAT CCAGCCCGGC GGCTCGATGC GCGACGACGA GGTGATCGCG GCGGCCGACG AGGCGGGGCT CGCCATGGTC TTCACCGGCC AGCGTCACTT CCGGCACTGA
|
Protein sequence | MTNLVPVGRA LLSVSDKSGL LDLARALADL EVELISTGGT AAALRAAGLK VRDVAEVTGF PEMMDGRVKT LHPMVHGGLL ALRDDDEHLV AMAAHGIEPI DLLVVNLYPF EAAVARGASY DDCIENIDIG GPAMIRAAAK NHRFVNVVTD TADYKALLDE LRAHDGATRL SFRQKLALTA YARTAAYDTA VSTWMAGALK AEAPRRRSFA GTLAQTMRYG ENPHQKAAFY TDGSARPGVA TAKQWQGKEL SYNNINDTDA AFELVAEFDP AEGPACVIVK HANPCGVARG ATLAEAYARA FDCDRVSAFG GIIALNQPLD AATAEKITEI FTEVVIAPGA DEEARAIFAA KKNLRLLTTE ALPDPLAPGL AFKQVAGGFL VQDRDAGHVD ALDLKVVTKR APSDAELADL LFAWTVAKHV KSNAIVYVKD GATVGVGAGQ MSRVDSTRIA ARKSQDMAQA LGLAQPLTQG SVVASDAFFP FADGLLAAAE AGATAIIQPG GSMRDDEVIA AADEAGLAMV FTGQRHFRH
|
| |