Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_0445 |
Symbol | purH |
ID | 4251569 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | - |
Start bp | 507529 |
End bp | 509127 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 638117004 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_732582 |
Protein GI | 113968789 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGTTG CAAATAATGC CAGACCCATT CGTCGCGCGC TGTTAAGCGT TTCAGATAAA ACCGGAATTC TCGAATTCGC CAAAGCATTA CACGCCCAAG GCGTTGAACT GCTGTCAACT GGCGGCACCG CTCGCCTGTT AGCGGATAAC GGCGTGCCTG TTATCGAAGT ATCTGACTAT ACAGGACACC CTGAGATCAT GGATGGTCGC GTTAAAACCC TGCACCCGAA AGTGCATGGC GGCATTTTGG CGCGTCGCGG TCTTGATGAA AATGTCATGG CTGCCAACAA CATCAATGCA ATCGATCTGG TTGCGGTTAA CCTCTACCCT TTTGCCGATA CTGTTGCTAA AGCCGGTTGC ACCTTAGAAG ATGCGATTGA AAACATCGAC ATCGGTGGCC CGACTATGGT GCGCGCTGCG GCGAAAAACC ATAAAGATGT GACTATCGTT GTTAATGCCG CCGATTATGA TCGCGTATTA GCCGAAATGG CCGCCAACAA TGGCAGCACG ACTCACGCGA CCCGTTTCGA TTTAGCGATT GCCGCCTTCG AACACACTGC CGGTTACGAT GGCATGATCG CCAACTATTT CGGCACTATG GTTCCTGCGC ATAGCACTGA TGAGTGCTTC GAAGATTCTA AGTTCCCACG CACCTTCAAC ACTCAATTAG TGAAGAAGCA AGATCTGCGT TACGGTGAAA ACAGCCACCA AACTGCAGCC TTCTATGTTG ACACTAAGAT CGACGAAGCC TCAGTCGCAA CTGCAGTTCA ACTGCAAGGT AAGGCACTGT CTTACAACAA CATCGCCGAT ACCGATGCCG CCCTTGAGTG CGTAAAAGAG TTCAGCGAAC CCGCTTGCGT TATCGTTAAA CACGCTAACC CATGTGGTGT TGCACTGGGT AAAGATCTGC TCGATGCCTA TAACCGCGCC TATCAAACTG ACCCAACGTC AGCCTTCGGT GGCATTATCG CCTTCAACGG CGAGTTAGAT GCAGCAACCG CTAGCGCTAT TGTTGAGCGT CAATTCGTTG AAGTGATTAT TGCGCCAGTC GTGAGCCAAG GTGCCCGCGA TGTAGTGGCC AAGAAAACCA ACGTGCGTCT GTTAGAGTGT GGTCAATGGG ATACTAAGAC CAAGACCTTA GACTATAAGC GCGTGAACGG TGGTCTGCTG GTACAAGACC GCGACCAAGG CATGGTTGGC TTAGATGACA TTAAAGTCGT GACTAAGCGT CAACCGACCG AGAGCGAGCT GAAGGACTTA ATGTTCTGCT GGAAAGTGGC TAAGTTCGTT AAATCTAACG CCATTGTTTA CGCTAAAGAC GGTATGACCA TCGGTGTCGG CGCAGGCCAA ATGAGCCGCG TCTACAGCGC TAAAATTGCG GGTATCAAGG CGGCCGATGA AGGGTTAGAA GTGGTTAACT CTGTGATGGC GTCCGATGCC TTCTTCCCAT TCCGCGACGG TATCGATGCC GCAGCGGCGG CGGGCATCAG CTGCATCATC CAGCCAGGTG GCTCAATGCG CGATGCTGAA ATCATCGCCG CAGCCGACGA GCACGGCATG GCCATGGTAA TGACGGGCAT GCGCCACTTC CGTCACTAA
|
Protein sequence | MTVANNARPI RRALLSVSDK TGILEFAKAL HAQGVELLST GGTARLLADN GVPVIEVSDY TGHPEIMDGR VKTLHPKVHG GILARRGLDE NVMAANNINA IDLVAVNLYP FADTVAKAGC TLEDAIENID IGGPTMVRAA AKNHKDVTIV VNAADYDRVL AEMAANNGST THATRFDLAI AAFEHTAGYD GMIANYFGTM VPAHSTDECF EDSKFPRTFN TQLVKKQDLR YGENSHQTAA FYVDTKIDEA SVATAVQLQG KALSYNNIAD TDAALECVKE FSEPACVIVK HANPCGVALG KDLLDAYNRA YQTDPTSAFG GIIAFNGELD AATASAIVER QFVEVIIAPV VSQGARDVVA KKTNVRLLEC GQWDTKTKTL DYKRVNGGLL VQDRDQGMVG LDDIKVVTKR QPTESELKDL MFCWKVAKFV KSNAIVYAKD GMTIGVGAGQ MSRVYSAKIA GIKAADEGLE VVNSVMASDA FFPFRDGIDA AAAAGISCII QPGGSMRDAE IIAAADEHGM AMVMTGMRHF RH
|
| |