Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0091 |
Symbol | purH |
ID | 4078757 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 94226 |
End bp | 95815 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638005378 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_612086 |
Protein GI | 99079932 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.587177 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACC TCCACCCCGT CCGCCGCGCC CTTCTGTCCG TCTCTGACAA AACCGGGCTG ATCGAGCTGG GTAAATCCCT CGCTGAGCGC GGGGTTGAAC TGCTCTCGAC CGGTGGCACT GCCAAAGCGC TGCGCGATGC CGGGCTGACC GTGAAGGACG TCTCCGAGGT GACCGGCTTT CCCGAGATGA TGGATGGCCG CGTAAAAACC CTGCATCCGA TGGTGCATGG CGGCCTTCTG GCTCTGCGCG ACAATGACGC GCATGTGGCC GCGATGACCG ATCATGGCAT TGGCGAAATC GATCTCTTGG TGGTGAACCT CTACCCCTTT GAGGCGGCGC TGAAGCGCGG TGCGGCCTAT GACGAAATGA TCGAGAACAT CGACATCGGT GGTCCCGCGA TGATCCGCGC GGCGGCCAAG AACCACGCGT TTGTCAATGT GGTGGTGGAT GTTGAGGATT ACGGCGTCCT CTTGGAGGAG CTGGACCAGA ACGACGGTCA GACCTCCTTT GCCTTCCGTC AGTGGCTGGC ACAGAACGCC TATGCGCGCA CCGCTGCCTA TGATGCGGCT GTGTCGAACT GGATGGCCGG AGCGATCGGT CTTGATGCGC CGCGCCGCCG TGCCTTTGCT GGTCAGATTG CGCAGACGCT GCGCTATGGC GAGAACCCGC ATCAGGACGC GGCCTTCTAC ACCGATGGCA CCGAGCGTGT GGGCGTGGCG ACCGCAGAGC AGTTGCAGGG CAAGGAACTC TCCTACAACA ACATCAACGA CACCGACGCA GCCTTTGAAC TCGTGAGCGA ATTCGCCCCC GAGGACGGCC CGGCCGTGGC GATCATCAAA CACGCCAACC CCTGCGGCGT GGCGCGTGGC GCAACCCTCT TGGAGGCATA CAACAAGGCG TTTGACTGCG ATCGCACCTC AGCATTCGGG GGCATCGTTG CGCTCAACAT GCCGCTTGAT GCCGAGACCG CAGAGGCAAT CACCCAGATC TTTACCGAAG TGGTGATCGC ACCGGGGGCC TCGGATGAGG CCAAGGCGAT CTTTGCGGCG AAGAAGAACC TGCGCCTCTT GATCACCGAG GGCCTGCCTA ACCCGCAGGA CGCAGGCCTG ACCACCCGTC AGGTTTCGGG CGGGATGCTG GTGCAGGACA AGGACGTTGG CCACCGGGCC ATGGACGACC TGAAAGTGGT GACCGAAAAG GCACCGACCG AAGAGCAGAT GGCGGATCTG CTCTTTGCCT GGAAGGTCGC CAAACATGTG AAATCCAACG CGATTGTCTA TGTCAAAGAC GGCCAGACGG TGGGCGTGGG CGCAGGCCAG ATGAGCCGCG TCGACTCCGC CACGATTGCA GGTGTCAAAG CACAGCGCAT GGCGGATGCG ATGGAACTGC CCGAAAGCCT CGCCAAAGGC TCCGCAGTGG CGTCGGATGC TTTCTTTCCC TTCGCTGATG GCCTGATGGA AGCAGCTTCC AATGGCGCCA CCTGCGTCAT TCAGCCCGGT GGCTCCATGC GTGATGACGA GGTCATCAAG GCGGCGAATG ACGCGGGGCT TGCCATGGTC TTTACCGGTA TGCGCCACTT CCGCCACTAA
|
Protein sequence | MTDLHPVRRA LLSVSDKTGL IELGKSLAER GVELLSTGGT AKALRDAGLT VKDVSEVTGF PEMMDGRVKT LHPMVHGGLL ALRDNDAHVA AMTDHGIGEI DLLVVNLYPF EAALKRGAAY DEMIENIDIG GPAMIRAAAK NHAFVNVVVD VEDYGVLLEE LDQNDGQTSF AFRQWLAQNA YARTAAYDAA VSNWMAGAIG LDAPRRRAFA GQIAQTLRYG ENPHQDAAFY TDGTERVGVA TAEQLQGKEL SYNNINDTDA AFELVSEFAP EDGPAVAIIK HANPCGVARG ATLLEAYNKA FDCDRTSAFG GIVALNMPLD AETAEAITQI FTEVVIAPGA SDEAKAIFAA KKNLRLLITE GLPNPQDAGL TTRQVSGGML VQDKDVGHRA MDDLKVVTEK APTEEQMADL LFAWKVAKHV KSNAIVYVKD GQTVGVGAGQ MSRVDSATIA GVKAQRMADA MELPESLAKG SAVASDAFFP FADGLMEAAS NGATCVIQPG GSMRDDEVIK AANDAGLAMV FTGMRHFRH
|
| |