Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0106 |
Symbol | purH |
ID | 3915992 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 109446 |
End bp | 111035 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640442831 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_495389 |
Protein GI | 87198132 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCGAGG TTACCATCAA GCGGGCGCTG CTTTCGGTGT CCGACAAGTC CGGCCTTGTC GATCTGGGCA AGGCGCTGGC CGCGCGCGGC GTCGAACTGC TGTCCACCGG CGGTACTGCC AAGGCGCTGC GCGATGCCGG CCTGGCGGTG AAGGACGTGT CCGAGCACAC CGGGTTTCCC GAAATGATGG ACGGCCGCGT CAAGACGCTG CACCCGACGA TCCACGGAGG CCTGCTGGCC GTACGAGACA ATCCCGAACA CGCCGCTGCG ATGGCCGAGC ACGCGATCGG TGCGATCGAT CTCGTCGTGG TCAACCTCTA CCCGTTCGAA GCGACCGTGG CCAAGGGTGC GGAGCGTGAG GAGGTGATCG AGAACATCGA CATCGGCGGA CCTTCGATGG TCCGTTCGGC GGCGAAGAAC CACGAATACG TCACCATCCT GACGGATGCG GCCGACTATG CCGCGTTCCT TGGCGAACTC GACCAGACCG GCGGCGCGAC GACACTCGCC TTCCGCCGCA AGATGGCCGC CAAGGCCTAT GCCGCAACCG CCGCCTATGA TGCGGCAATC TCGCAGTGGT TCGCGACCGT GGACCAGCAG GAACACTTCC CACCCGTCCT CGCCGCCGCG CACACCCGCG TGACCACGCT GCGCTATGGC GAGAACCCTC ACCAGGATGC CGCACTCTAC GTTCCGCGCA ATCCCGCCGT CACCGGCCTG CCGCAGGCGA CCCAGGTGCA GGGCAAGGAG CTGTCCTACA ACAACTACAA CGATGCCAAC GCAGCGCTTG AACTGGTCGC CGAGTTCGCG GGCGCGAAGC CGACGGTGGT GATCGTGAAG CACGCCAACC CCTGCGGCGT GGCAACGGCG GAAACGCTGC TTGAAGCGTG GAAGGACGCG CTCGAGTGCG ACTCGGTCTC GGCATTCGGC GGGATCGTGG CGACGAACGT TCCGCTCGAC GGGCCAACCG CAAACGCAAT CTGCGAGATA TTCACCGAAG TGGTTGTGGC GCCGGGCGCA GATGACGCGG CGAAGGAAGC CTTCGCACGC AAGAAGAACC TGCGCCTGCT GCTGATCGAT TCGCTTCCCG ATGCGGGCCG CAAGGGCCTC GTCACCGTGC CGATCGCGGG CGGTCTACTG GTGCAGGACC GCGATGCGGG CAAGATCGAC AAGTCCGACC TCAAGGTGGT GACCAAGCGC GCGCCGACCG AACGGGAGCT GGAGGACGCG CTCTTCGCCT GGACCGTGGC AAAGCACGTC AAGTCGAACG CCATCGTCTA TGCCAAGGAC GGCGTGACGG CCGGCATTGG CGCAGGCCAG ATGAACCGCC GCGACAGCTC GCGCATCGCC GCCGCGAAGG CCAGGGAAGC GGCCGAAACC CATCACTTCA GCGAAGTGCG CACGGTAGGC TCGGCAGTAG CCTCCGACGC GTTCTTCCCC TTTGCCGATG GCCTGATGGC GGCCGTGGAA GCGGGAGCCA CCTGCGTGAT CCAGCCCGGT GGCTCGATCC GCGACGAGGA TGTGATCAAG GCTGCTGATG AAGCCGGCCT TGCCATGGTC TTCACCGGAA TGCGCCACTT CCGCCACTGA
|
Protein sequence | MTEVTIKRAL LSVSDKSGLV DLGKALAARG VELLSTGGTA KALRDAGLAV KDVSEHTGFP EMMDGRVKTL HPTIHGGLLA VRDNPEHAAA MAEHAIGAID LVVVNLYPFE ATVAKGAERE EVIENIDIGG PSMVRSAAKN HEYVTILTDA ADYAAFLGEL DQTGGATTLA FRRKMAAKAY AATAAYDAAI SQWFATVDQQ EHFPPVLAAA HTRVTTLRYG ENPHQDAALY VPRNPAVTGL PQATQVQGKE LSYNNYNDAN AALELVAEFA GAKPTVVIVK HANPCGVATA ETLLEAWKDA LECDSVSAFG GIVATNVPLD GPTANAICEI FTEVVVAPGA DDAAKEAFAR KKNLRLLLID SLPDAGRKGL VTVPIAGGLL VQDRDAGKID KSDLKVVTKR APTERELEDA LFAWTVAKHV KSNAIVYAKD GVTAGIGAGQ MNRRDSSRIA AAKAREAAET HHFSEVRTVG SAVASDAFFP FADGLMAAVE AGATCVIQPG GSIRDEDVIK AADEAGLAMV FTGMRHFRH
|
| |