Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Maqu_3450 |
Symbol | purH |
ID | 4657125 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Marinobacter aquaeolei VT8 |
Kingdom | Bacteria |
Replicon accession | NC_008740 |
Strand | + |
Start bp | 3811958 |
End bp | 3813538 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639813429 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_960708 |
Protein GI | 120556357 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.52631 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAACC AGGCAAACAC CCCCGTCCGT CGCGCCCTGA TCAGCGTGAG TGATAAAACC GGCATCGTCG ATTTCGGCCG TGCCCTCACC GAACGCGGCG TCGAGCTGCT CTCCACCGGC GGCACCTTCC GCCTGCTCAA GGAAAACAAC GTACCGGTCA CCGAAGTGTC CGATTACACG GGTTTCCCGG AAATGATGGA TGGCCGGGTG AAAACCCTGC ACCCGAAAAT CCACGGCGGC ATTCTCGGCC GTCGTGGCAC CGACGATGCC ATCATGGCTG AGCACGGAAT CAACCCCATC GACATAGTGG TGGTGAACCT GTACCCGTTC GAGGACACCG TGGCCAACCC GGACTGCGAC CTGGCCACCG CCATTGAAAA CATCGACATT GGCGGCCCTA CCATGGTCCG TGCCGCCGCC AAGAACCACA ACGATGTTGC CATTGTCGTG AACGCCTCCG ACTACAGCCG TGTTCTGAAA GAGCTGGCGG ACAACGACGG CGAGCTGACC TACAGCACGC GCTTCGACCT CGCCGTGAAG GCCTTCGAGC ACACCGCCGG CTACGACTGT GCTATTGCCA ACTATCTGGG CGGGCGTACC CCGGATAACG ACAACGCCGA CTTCCCCCGC ACCTTCAATG CCCAGTTCGT GAAAGTTCAG GATATGCGCT ATGGCGAAAA CCCGCATCAG CGCGCTGCCT TCTACGCCGA GCGCCACCCG AAGGAAGCCT GTGTCGCCAC CGCCAAACAG CTTCAGGGCA AGGAACTGAG CTACAACAAC GTAGCCGACA CCGATGCGGC GCTGGAGTGC GTCAAACCCT TCGCCGACCC GGCCTGCGTC ATCGTCAAGC ACGCCAATCC CTGTGGCGTG GCCATCGGTG CCGACATACT GCAGGCCTAC GACCTGGCCT TTGCCACCGA CCCGACGTCG GCCTTTGGCG GCATCATCGC CTTCAACCGT GAACTGGATG CGGCGACCGC CAAGGCCATT GTGGATCGCC AGTTCGTGGA AGTAATTATC GCCCCGAGCG TTGCGCCGGA AGCGGTCGAG ATTGTTGCCG CCAAGAAGAA CGTACGCCTG CTGGCCTGCG GCGAATTCGA CGGTGAACGC GCCCAGACCA TGGACTACAA GCGCGTCACC GGCGGCCTGC TGGTTCAGGA TCGCGACCTG GGCATGGTGG CCATGGAAGA CGTGAAAGTG GTCACCGAAC GCCAGCCCAC CGAACAGGAA CTCAACGACC TGCTGTTCGC CTGGGAAGTG GCCAAGTACG TTAAATCCAA CGCTATTGTC TATGCCAAAG CCGGCCGCAC CATCGGCGTC GGCGCCGGCC AGATGAGCCG CGTCTACAGC GCCAAGATTG CAGGCATCAA AGCCGCCGAC GAAAACCTGG AAGTGAAGGG TTCCGTCATG GCGTCGGATG CATTCTTCCC GTTCCGTGAC GGCATCGACG CCGCGGCCGA AGCCGGCATT ACCGCCGTGA TTCAGCCCGG CGGCTCCATG CGCGACCAGG AAGTGATCGA TGCCGCCAAC GAGCATGGCA TTGCCATGGT CTTCACCGGC ATGCGCCATT TCCGTCACTG A
|
Protein sequence | MANQANTPVR RALISVSDKT GIVDFGRALT ERGVELLSTG GTFRLLKENN VPVTEVSDYT GFPEMMDGRV KTLHPKIHGG ILGRRGTDDA IMAEHGINPI DIVVVNLYPF EDTVANPDCD LATAIENIDI GGPTMVRAAA KNHNDVAIVV NASDYSRVLK ELADNDGELT YSTRFDLAVK AFEHTAGYDC AIANYLGGRT PDNDNADFPR TFNAQFVKVQ DMRYGENPHQ RAAFYAERHP KEACVATAKQ LQGKELSYNN VADTDAALEC VKPFADPACV IVKHANPCGV AIGADILQAY DLAFATDPTS AFGGIIAFNR ELDAATAKAI VDRQFVEVII APSVAPEAVE IVAAKKNVRL LACGEFDGER AQTMDYKRVT GGLLVQDRDL GMVAMEDVKV VTERQPTEQE LNDLLFAWEV AKYVKSNAIV YAKAGRTIGV GAGQMSRVYS AKIAGIKAAD ENLEVKGSVM ASDAFFPFRD GIDAAAEAGI TAVIQPGGSM RDQEVIDAAN EHGIAMVFTG MRHFRH
|
| |