Gene SO_0442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSO_0442 
SymbolpurH 
ID1168318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella oneidensis MR-1 
KingdomBacteria 
Replicon accessionNC_004347 
Strand
Start bp469791 
End bp471422 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content51% 
IMG OID637342442 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionNP_716079 
Protein GI24372037 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGTTG CAAATCATGC CAGACCCATT CGTCGCGCGC TGTTAAGCGT TTCAGATAAA 
ACCGGAATTC TCGAGTTCGC CAAAGCATTA CACGCCCAAG GCGTTGAACT GCTATCAACT
GGCGGCACCG CTCGCCTGTT AGCGGATAAC GGTGTACCTG TTATCGAAGT ATCTGACTAT
ACAGGACACC CTGAGATCAT GGATGGTCGC GTTAAAACTC TGCATCCTAA AGTGCATGGT
GGTATTCTGG CTCGTCGTGG TCTTGATGAA AATGTCATGG CTGCCAACAA TATCAATGCA
ATTGATCTGG TTGCAGTTAA CCTCTACCCC TTTGCCGATA CTGTCGCAAA AGCCGGTTGC
ACCTTAGAAG ATGCAATTGA AAACATCGAT ATCGGTGGTC CAACAATGGT TCGCGCTGCG
GCGAAAAACC ATAAAGATGT GACTATCGTG GTGAATGCTG CTGATTACAA TCGCGTCTTA
GCTGAAATGG CCGTCAACAA TGGCAGTACA ACTCATGCAA CCCGTTTTGA TTTAGCGATT
GCCGCCTTCG AACACACTGC CGGTTACGAT GGTATGATCG CCAACTACTT CGGCACTATG
GTTCCAATGC ATAGGGTTCC AGCGCATAGC ACTGATGAAT GCTTCCAAGA TTCCCTATCC
GTTGAAGGCT CAAAGTTCCC ACGCACCTTC AACACCCAAT TAGTGAAGAA GCAAGATCTG
CGTTACGGTG AAAACAGCCA TCAAGCGGCG GCTTTCTATG TCGACACTAA GATCGATGAA
GCTTCAGTAG CCACTGCCGT TCAGCTGCAA GGTAAGGCAT TGTCTTACAA CAATATCGCC
GATACCGATG CCGCCCTTGA GTGCGTTAAA GAGTTCAGTG AGCCAGCTTG CGTTATCGTT
AAACACGCTA ACCCATGTGG CGTAGCACTG GGTAAAGACT TACTCGATGC CTACAACCGC
GCCTATCAAA CTGACCCAAC CTCAGCCTTT GGCGGCATTA TCGCCTTCAA CGGCGAATTA
GATGCGGCCA CCGCCAGCGC TATCGTTGAG CGTCAATTCG TTGAAGTGAT CATCGCGCCA
GTTGTGAGCC AAGGTGCCCG CGATGTGGTG GCTAAGAAAA CTAACGTGCG TCTATTAGAG
TGTGGCCAGT GGAATACTAA GACCCAAACC TTAGACTACA AGCGCGTAAA CGGTGGTCTG
CTGGTGCAAG ATCGCGACCA AGGCATGGTT GGTTTAGACG ACATTAAAGT GGTGACTAAG
CGTCAACCCA CAGAAAGCGA ACTCAAAGAT TTAATGTTCT GCTGGAAAGT GGCTAAATTC
GTTAAATCTA ACGCCATTGT TTACGCTAAA GACGGCATGA CAATCGGTGT CGGCGCTGGC
CAAATGAGCC GTGTCTACAG CGCCAAGATC GCCGGTATCA AGGCTGCCGA TGAAGGTTTA
GAAGTGGTGA ACTCTGTGAT GGCCTCCGAT GCGTTTTTCC CCTTCCGCGA TGGTATTGAT
GCCGCAGCGG CAGCAGGCAT CAGCTGCATC ATCCAGCCAG GTGGCTCAAT GCGCGATGCG
GAAATCATCG CAGCGGCCGA CGAGCACGGC ATGGCCATGG TGATGACTGG CATGCGCCAC
TTCCGTCACT GA
 
Protein sequence
MTVANHARPI RRALLSVSDK TGILEFAKAL HAQGVELLST GGTARLLADN GVPVIEVSDY 
TGHPEIMDGR VKTLHPKVHG GILARRGLDE NVMAANNINA IDLVAVNLYP FADTVAKAGC
TLEDAIENID IGGPTMVRAA AKNHKDVTIV VNAADYNRVL AEMAVNNGST THATRFDLAI
AAFEHTAGYD GMIANYFGTM VPMHRVPAHS TDECFQDSLS VEGSKFPRTF NTQLVKKQDL
RYGENSHQAA AFYVDTKIDE ASVATAVQLQ GKALSYNNIA DTDAALECVK EFSEPACVIV
KHANPCGVAL GKDLLDAYNR AYQTDPTSAF GGIIAFNGEL DAATASAIVE RQFVEVIIAP
VVSQGARDVV AKKTNVRLLE CGQWNTKTQT LDYKRVNGGL LVQDRDQGMV GLDDIKVVTK
RQPTESELKD LMFCWKVAKF VKSNAIVYAK DGMTIGVGAG QMSRVYSAKI AGIKAADEGL
EVVNSVMASD AFFPFRDGID AAAAAGISCI IQPGGSMRDA EIIAAADEHG MAMVMTGMRH
FRH