Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_3806 |
Symbol | purH |
ID | 5060284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | - |
Start bp | 4360946 |
End bp | 4362514 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640476064 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_001160615 |
Protein GI | 145596318 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTTCCA GTCAGGACGA GCGCCGCCCG ATCCGGCGGG CGCTGGTCAG CGTCTACGAC AAGGCCGGTC TGGCCGAGCT GGCCCAGGCG TTGCACGACG CCGGCGTGGA GATCGTCTCG ACCGGAAGCA CCGCGTCGGC CATCGCCGGT GCCGGCGTGC CGGTCACCGC GGTGGATTCC GTGACCGGGT TCCCGGAGAT CCTCGACGGC CGGGTCAAGA CCCTGCACCC CAAGATCCAC GGCGGTCTCC TCGCTGACCT GCGCAAGGAG TCGCACGTTG GGCAGCTCGC CGAGCACGGC ATCGGGGCAA TCGACCTGCT GGTGTCCAAC CTGTACCCGT TTCAGGCGAC CGTCGCCTCC GGGGCGGGGC AGGACGAGTG TGTCGAGCAG ATCGACATCG GCGGGCCGGC GATGGTGCGG GCCGCTGCCA AGAACCACGC CTCGGTCGCC GTGGTGACCG ACCCGTCGGG CTACCCGCAG CTGCTGACGG CGGTACGGGT GGGTGGTTTC ACCCTGGCGC AGCGTCGGGC GCTCGCGGCC CGCGCGTTCG CGGTGATCGC CGACTACGAC GTGGCCGTCG CCGAGTGGTG CGCGCGGGAG TTGGTCGAGG ACGCGCCGTG GCCGAGCTTC GCCGGGCTGG CGCTGCGCCG CGACGCGGTG CTGCGGTACG GGGAGAACCC GCACCAGGCA GCCGCCCTCT ACACCGACCC GTCGAGCCCG GCCGGGCTTG CCCAGGCCGA GCAGCTGCAC GGCAAGGAGA TGTCGTACAA CAACTACGTT GACGCCGACG CCGCCTGGCG GGCCGCCAAC GACTTCTCCG ACCAGCCGGC GGTGGCGATC ATCAAGCACG CCAACCCGTG TGGCATCGCG GTTGGCGTGG ACGTCGCAGA GGCGCACCGC AAGGCGCACG CCTGCGACCC CGTGTCCGCC TTCGGTGGCG TGATCGCCGT GAACCGACCG GTCGGCGTCG AGCTCGCCGG GCAGGTGTCG GAGGTGTTCA CCGAGGTGGT TGTCGCTCCG GAGTTCGAAC CCGACGCGCT CGAGGTACTG CGGGGCAAGA AGAACGTGCG CCTGCTGCGC GCCCCGGCCT ACGCCCCGGC GTCGGCGGAG TGGCGACCGG TCACCGGTGG GATGCTGGTG CAGGTGCGGG ACAAGGTGGA CGCCGCGGGT GACGACCCGG CCACCTGGCA GCTGGCGACC GGCGAGGCCG CCGACGAGGC GACCCTGCGG GACCTGGCCT TCGCCTGGCG GGCGGTCCGA GCGGTGAAGA GCAACGCGAT TCTGCTCGCC CGCGACGGCG CGACCGTCGG TGTGGGTATG GGGCAGGTCA ACCGGGTGGA CTCGGCCCGG CTGGCGGTGG ATCGGGCCGG TGCCGAACGG GCGCGGGGCG CGGTGTGTGC CTCGGACGCG TTCTTCCCGT TCGCCGACGG GCCGAAGATC CTCATCGACG CCGGAGTGCG GGCGATCGTC CAACCTGGCG GGTCGATCCG GGACGAGGAG GTCATCGCCG CTGCCAAGGC GGCCGGCGTG ACCATGTACC TGACCGGCAC CCGTCACTTT TTCCACTGA
|
Protein sequence | MSSSQDERRP IRRALVSVYD KAGLAELAQA LHDAGVEIVS TGSTASAIAG AGVPVTAVDS VTGFPEILDG RVKTLHPKIH GGLLADLRKE SHVGQLAEHG IGAIDLLVSN LYPFQATVAS GAGQDECVEQ IDIGGPAMVR AAAKNHASVA VVTDPSGYPQ LLTAVRVGGF TLAQRRALAA RAFAVIADYD VAVAEWCARE LVEDAPWPSF AGLALRRDAV LRYGENPHQA AALYTDPSSP AGLAQAEQLH GKEMSYNNYV DADAAWRAAN DFSDQPAVAI IKHANPCGIA VGVDVAEAHR KAHACDPVSA FGGVIAVNRP VGVELAGQVS EVFTEVVVAP EFEPDALEVL RGKKNVRLLR APAYAPASAE WRPVTGGMLV QVRDKVDAAG DDPATWQLAT GEAADEATLR DLAFAWRAVR AVKSNAILLA RDGATVGVGM GQVNRVDSAR LAVDRAGAER ARGAVCASDA FFPFADGPKI LIDAGVRAIV QPGGSIRDEE VIAAAKAAGV TMYLTGTRHF FH
|
| |