Gene Sputcn32_3401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSputcn32_3401 
SymbolpurH 
ID5078023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella putrefaciens CN-32 
KingdomBacteria 
Replicon accessionNC_009438 
Strand
Start bp3958589 
End bp3960220 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content52% 
IMG OID640500601 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001184911 
Protein GI146294487 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGCTG CAAATAATGC CAGACCCATT CGTCGCGCGC TGTTAAGCGT TTCAGATAAA 
ACCGGAATTC TCGAGTTCGC CAAAGCACTT CACGCCCAAG GTGTAGAATT GTTATCGACT
GGCGGCACCG CTCGCCTGTT AGCGGATAAC GGCGTGCCTG TTATCGAAGT ATCTGATTAC
ACAGGACACC CTGAGATCAT GGACGGTCGC GTTAAGACGC TGCACCCTAA AGTGCACGGC
GGCATTTTAG CGCGCCGCGG TCTTGATGAA AGCGTGATGG CCGACAACAA TATCAATGCC
ATCGATCTGG TTGCGGTTAA CCTTTATCCC TTCGCTGAAA CGGTCGCTAA AGCTGGTTGT
ACCTTAGAAG ACGCTATCGA AAATATCGAT ATTGGCGGTC CAACTATGGT GCGCGCGGCG
GCGAAAAACC ATAAAGATGT GACTATCGTG GTGAATGCGG CCGATTACTC ACGCGTTCTG
GCTGAAATGA CGGCTAACAA TGGCAGCACC ACCCATGCGA CGCGTTTTGA TTTAGCGATT
GCCGCCTTTG AGCACACTGC GGGTTACGAT GGTATGATCG CCAACTACTT CGGCACTATG
GTTCCAATGC ATAGGGTTCC AGCCCACAGC ACGGACGAAT GTTTCGAAGA CTCCCTATCC
GTTGATGGCT CAAAGTTCCC ACGCACCTTC AACACTCAAT TAGTGAAGAA ACAAGATTTA
CGCTATGGCG AAAACAGCCA TCAAAAAGCG GCTTTCTATG TTGACACTAA AATTGATGAA
GCTTCTGTGG CTACCGCGAT TCAGTTGCAA GGCAAAGCCT TGTCTTACAA CAACATTGCC
GATACTGATG CGGCCCTTGA GTGTGTAAAA GAGTTCAGTG AGCCCGCCTG CGTTATCGTT
AAACACGCTA ACCCTTGTGG TGTCGCACTC GGTAAAGACC TGCTGGACGC CTACAACCGC
GCCTATCAAA CTGACCCAAC ATCTGCTTTT GGCGGCATTA TCGCCTTCAA CGGCGAATTA
GATGCCGAGA CGGCCAGCGC TATCGTTGAG CGTCAATTCG TTGAAGTGAT TATCGCCCCA
AGCGTCAGCC AAGCGGCGCG CGATGTGATT GCGAAAAAGA CCAACGTGCG TTTATTGGAA
TGTGGTCAGT GGAACACTAA GACCCAAACC TTAGACTACA AACGCGTTAA CGGCGGCTTG
TTAGTGCAAG ATCGCGACCA AGGCATGGTT GGCTTAGACG ACATTAAAGT CGTGACTAAG
CGTCAACCCA CAGAGAGTGA ACTGAAAGAC TTAATGTTCT GCTGGAAAGT GGCGAAGTTC
GTTAAATCTA ACGCCATCGT TTATGCCAAA GACGGCATGA CTATCGGTGT CGGCGCTGGC
CAAATGAGCC GCGTTTACAG CGCTAAAATC GCCGGCATCA AGGCCGCCGA TGAAGGGCTA
GAAGTGGTCA ACTCAGTGAT GGCCTCCGAT GCCTTCTTCC CCTTCCGCGA CGGTATCGAT
GCCGCTGCGG CGGCAGGCAT TAGCTGCATC ATTCAACCGG GTGGCTCAAT GCGCGATGCT
GAAATCATCG CTGCTGCAGA CGAGCACGGC ATGGCTATGG TGATGACAGG CATGCGCCAC
TTCCGTCATT GA
 
Protein sequence
MTAANNARPI RRALLSVSDK TGILEFAKAL HAQGVELLST GGTARLLADN GVPVIEVSDY 
TGHPEIMDGR VKTLHPKVHG GILARRGLDE SVMADNNINA IDLVAVNLYP FAETVAKAGC
TLEDAIENID IGGPTMVRAA AKNHKDVTIV VNAADYSRVL AEMTANNGST THATRFDLAI
AAFEHTAGYD GMIANYFGTM VPMHRVPAHS TDECFEDSLS VDGSKFPRTF NTQLVKKQDL
RYGENSHQKA AFYVDTKIDE ASVATAIQLQ GKALSYNNIA DTDAALECVK EFSEPACVIV
KHANPCGVAL GKDLLDAYNR AYQTDPTSAF GGIIAFNGEL DAETASAIVE RQFVEVIIAP
SVSQAARDVI AKKTNVRLLE CGQWNTKTQT LDYKRVNGGL LVQDRDQGMV GLDDIKVVTK
RQPTESELKD LMFCWKVAKF VKSNAIVYAK DGMTIGVGAG QMSRVYSAKI AGIKAADEGL
EVVNSVMASD AFFPFRDGID AAAAAGISCI IQPGGSMRDA EIIAAADEHG MAMVMTGMRH
FRH