Gene Shewmr7_3584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr7_3584 
SymbolpurH 
ID4257969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-7 
KingdomBacteria 
Replicon accessionNC_008322 
Strand
Start bp4248682 
End bp4250280 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content52% 
IMG OID638124268 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_739621 
Protein GI114049071 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGTTG CAAATAATGC CAGACCCATT CGTCGCGCGC TGTTAAGCGT TTCAGATAAA 
ACCGGAATTC TCGAATTCGC CAAAGCATTA CACGCCCAAG GCGTTGAACT GCTGTCAACG
GGCGGCACCG CTCGCCTGTT AGCGGATAAC GGCGTGCCTG TTATCGAAGT ATCTGACTAT
ACAGGACACC CTGAGATCAT GGATGGTCGC GTTAAAACCC TGCACCCGAA AGTGCATGGC
GGCATTTTGG CGCGTCGCGG TCTTGATGAA AATGTCATGG CTGCCAACAA CATCAATGCA
ATCGATCTGG TTGCGGTTAA CCTCTACCCT TTTGCCGATA CTGTTGCTAA AGCCGGGTGC
ACCTTAGAAG ATGCAATTGA AAACATCGAC ATCGGTGGCC CGACTATGGT GCGCGCTGCG
GCGAAAAACC ATAAAGATGT GACTATCGTT GTTAATGCCG CCGATTATGA TCGCGTATTA
GCCGAAATGG CCGCCAACAA TGGCAGCACG ACTCACGCGA CTCGTTTCGA TTTAGCGATT
GCCGCCTTCG AACACACTGC AGGTTACGAT GGCATGATCG CCAACTATTT CGGCACTATG
GTTCCTGCGC ATAGCACTGA TGAGTGCTTC GAAGATTCTA AGTTCCCACG CACCTTCAAC
ACCCAATTAG TGAAGAAGCA AGATCTGCGT TACGGTGAAA ACAGCCACCA AACTGCTGCC
TTCTACGTTG ACACTAAGAT CGACGAAGCC TCGGTCGCAA CGGCAGTTCA ACTGCAAGGT
AAAGCACTGT CTTACAACAA CATCGCCGAT ACCGATGCCG CCCTTGAGTG CGTGAAAGAG
TTCAGCGAAC CCGCTTGCGT TATCGTTAAA CACGCTAACC CATGCGGCGT TGCACTAGGT
AAAGACTTAC TCGATGCCTA TAACCGCGCC TATCAAACTG ATCCAACGTC AGCCTTCGGT
GGCATTATCG CCTTCAACGG CGAGTTAGAT GCTGCAACCG CTAGCGCTAT CGTTGAGCGT
CAATTCGTTG AAGTGATTAT TGCGCCAGTC GTGAGCCAAG GTGCCCGCGA TGTAGTGGCC
AAGAAAACCA ACGTGCGTCT GTTAGAGTGT GGTCAATGGG ATACTAAGAC CAAGACCTTA
GACTACAAGC GCGTGAACGG TGGTCTACTG GTACAAGACC GCGACCAAGG CATGGTTGGT
TTAGATGACA TAAAAGTCGT GACTAAACGT CAACCGACCG AGAGCGAGCT GAAGGACTTA
ATGTTCTGCT GGAAAGTGGC TAAGTTCGTT AAATCTAACG CGATTGTTTA CGCTAAAGAC
GGCATGACCA TCGGTGTCGG CGCCGGCCAA ATGAGCCGCG TCTACAGCGC TAAGATTGCG
GGTATCAAGG CGGCCGATGA AGGCTTAGAA GTGGTTAACT CTGTGATGGC GTCCGATGCC
TTCTTCCCAT TCCGCGACGG TATCGATGCC GCAGCGGCGG CGGGCATCAG CTGCATCATC
CAGCCAGGTG GCTCAATGCG CGATGCTGAA ATCATCGCCG CGGCCGACGA GCACGGCATG
GCCATGGTAA TGACGGGCAT GCGCCACTTC CGTCACTAA
 
Protein sequence
MTVANNARPI RRALLSVSDK TGILEFAKAL HAQGVELLST GGTARLLADN GVPVIEVSDY 
TGHPEIMDGR VKTLHPKVHG GILARRGLDE NVMAANNINA IDLVAVNLYP FADTVAKAGC
TLEDAIENID IGGPTMVRAA AKNHKDVTIV VNAADYDRVL AEMAANNGST THATRFDLAI
AAFEHTAGYD GMIANYFGTM VPAHSTDECF EDSKFPRTFN TQLVKKQDLR YGENSHQTAA
FYVDTKIDEA SVATAVQLQG KALSYNNIAD TDAALECVKE FSEPACVIVK HANPCGVALG
KDLLDAYNRA YQTDPTSAFG GIIAFNGELD AATASAIVER QFVEVIIAPV VSQGARDVVA
KKTNVRLLEC GQWDTKTKTL DYKRVNGGLL VQDRDQGMVG LDDIKVVTKR QPTESELKDL
MFCWKVAKFV KSNAIVYAKD GMTIGVGAGQ MSRVYSAKIA GIKAADEGLE VVNSVMASDA
FFPFRDGIDA AAAAGISCII QPGGSMRDAE IIAAADEHGM AMVMTGMRHF RH