Gene Shewmr4_0445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_0445 
SymbolpurH 
ID4251569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp507529 
End bp509127 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content52% 
IMG OID638117004 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_732582 
Protein GI113968789 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGTTG CAAATAATGC CAGACCCATT CGTCGCGCGC TGTTAAGCGT TTCAGATAAA 
ACCGGAATTC TCGAATTCGC CAAAGCATTA CACGCCCAAG GCGTTGAACT GCTGTCAACT
GGCGGCACCG CTCGCCTGTT AGCGGATAAC GGCGTGCCTG TTATCGAAGT ATCTGACTAT
ACAGGACACC CTGAGATCAT GGATGGTCGC GTTAAAACCC TGCACCCGAA AGTGCATGGC
GGCATTTTGG CGCGTCGCGG TCTTGATGAA AATGTCATGG CTGCCAACAA CATCAATGCA
ATCGATCTGG TTGCGGTTAA CCTCTACCCT TTTGCCGATA CTGTTGCTAA AGCCGGTTGC
ACCTTAGAAG ATGCGATTGA AAACATCGAC ATCGGTGGCC CGACTATGGT GCGCGCTGCG
GCGAAAAACC ATAAAGATGT GACTATCGTT GTTAATGCCG CCGATTATGA TCGCGTATTA
GCCGAAATGG CCGCCAACAA TGGCAGCACG ACTCACGCGA CCCGTTTCGA TTTAGCGATT
GCCGCCTTCG AACACACTGC CGGTTACGAT GGCATGATCG CCAACTATTT CGGCACTATG
GTTCCTGCGC ATAGCACTGA TGAGTGCTTC GAAGATTCTA AGTTCCCACG CACCTTCAAC
ACTCAATTAG TGAAGAAGCA AGATCTGCGT TACGGTGAAA ACAGCCACCA AACTGCAGCC
TTCTATGTTG ACACTAAGAT CGACGAAGCC TCAGTCGCAA CTGCAGTTCA ACTGCAAGGT
AAGGCACTGT CTTACAACAA CATCGCCGAT ACCGATGCCG CCCTTGAGTG CGTAAAAGAG
TTCAGCGAAC CCGCTTGCGT TATCGTTAAA CACGCTAACC CATGTGGTGT TGCACTGGGT
AAAGATCTGC TCGATGCCTA TAACCGCGCC TATCAAACTG ACCCAACGTC AGCCTTCGGT
GGCATTATCG CCTTCAACGG CGAGTTAGAT GCAGCAACCG CTAGCGCTAT TGTTGAGCGT
CAATTCGTTG AAGTGATTAT TGCGCCAGTC GTGAGCCAAG GTGCCCGCGA TGTAGTGGCC
AAGAAAACCA ACGTGCGTCT GTTAGAGTGT GGTCAATGGG ATACTAAGAC CAAGACCTTA
GACTATAAGC GCGTGAACGG TGGTCTGCTG GTACAAGACC GCGACCAAGG CATGGTTGGC
TTAGATGACA TTAAAGTCGT GACTAAGCGT CAACCGACCG AGAGCGAGCT GAAGGACTTA
ATGTTCTGCT GGAAAGTGGC TAAGTTCGTT AAATCTAACG CCATTGTTTA CGCTAAAGAC
GGTATGACCA TCGGTGTCGG CGCAGGCCAA ATGAGCCGCG TCTACAGCGC TAAAATTGCG
GGTATCAAGG CGGCCGATGA AGGGTTAGAA GTGGTTAACT CTGTGATGGC GTCCGATGCC
TTCTTCCCAT TCCGCGACGG TATCGATGCC GCAGCGGCGG CGGGCATCAG CTGCATCATC
CAGCCAGGTG GCTCAATGCG CGATGCTGAA ATCATCGCCG CAGCCGACGA GCACGGCATG
GCCATGGTAA TGACGGGCAT GCGCCACTTC CGTCACTAA
 
Protein sequence
MTVANNARPI RRALLSVSDK TGILEFAKAL HAQGVELLST GGTARLLADN GVPVIEVSDY 
TGHPEIMDGR VKTLHPKVHG GILARRGLDE NVMAANNINA IDLVAVNLYP FADTVAKAGC
TLEDAIENID IGGPTMVRAA AKNHKDVTIV VNAADYDRVL AEMAANNGST THATRFDLAI
AAFEHTAGYD GMIANYFGTM VPAHSTDECF EDSKFPRTFN TQLVKKQDLR YGENSHQTAA
FYVDTKIDEA SVATAVQLQG KALSYNNIAD TDAALECVKE FSEPACVIVK HANPCGVALG
KDLLDAYNRA YQTDPTSAFG GIIAFNGELD AATASAIVER QFVEVIIAPV VSQGARDVVA
KKTNVRLLEC GQWDTKTKTL DYKRVNGGLL VQDRDQGMVG LDDIKVVTKR QPTESELKDL
MFCWKVAKFV KSNAIVYAKD GMTIGVGAGQ MSRVYSAKIA GIKAADEGLE VVNSVMASDA
FFPFRDGIDA AAAAGISCII QPGGSMRDAE IIAAADEHGM AMVMTGMRHF RH