Gene Sbal223_3840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_3840 
SymbolpurH 
ID7088875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp4549580 
End bp4551178 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content52% 
IMG OID643462719 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002359740 
Protein GI217974989 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.28825 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCTG CAAATAATGC CAGACCCATT CGTCGCGCGC TGTTAAGCGT TTCAGATAAA 
ACCGGAATTC TCGAGTTCGC CAAAGCACTT CACGCCCAAG GTGTGGAGCT GTTGTCAACT
GGCGGCACCG CTCGCCTGTT AGCGGATAAC GGCGTGCCTG TTATCGAAGT ATCTGATTAC
ACAGGACACC CTGAGATCAT GGACGGTCGC GTTAAGACGC TGCACCCTAA AGTGCACGGC
GGCATTTTGG CGCGCCGCGG TCTTGATGAA AGCGTTATGG CCGACAACAA TATCAATGCC
ATCGATCTGG TTGCGGTTAA CCTTTATCCT TTCGCTGAAA CTGTGGCTAA AGCCGGTTGT
ACCTTAGAAG ACGCTATCGA AAATATCGAT ATTGGCGGCC CAACTATGGT GCGCGCAGCG
GCAAAAAACC ACAAAGACGT CACCATTGTC GTTAATGCGG CCGATTACTC GCGCGTACTA
GCAGAAATGA CGGCTAACAA TGGCAGCACG ACCCATGCGA CGCGTTTCGA CTTAGCGATT
GCGGCCTTTG AGCACACTGC GGGTTACGAT GGCATGATCG CCAACTACTT CGGCACTATG
ATTCCTGCGC ATAGCACGGA CGAATGCTTT GCTGATTCTA AGTTCCCACG CACGTTCAAC
ACCCAATTAG TGAAGAAGCA AGACTTACGC TATGGCGAAA ACAGCCATCA AGCGGCGGCC
TTCTATGTCG ATACGAAAAT TAATGAAGCC TCTGTGGCGA CGGCAATTCA GTTGCAAGGC
AAAGCCTTGT CTTACAACAA CATTGCCGAT ACCGACGCCG CTCTTGAGTG CGTAAAAGAA
TTCTTGGAAC CCGCCTGCGT TATCGTTAAA CACGCTAACC CATGTGGTGT GGCCTTAGGT
AAAGACTTGC TCGATGCCTA TAACCGCGCT TATCAAACTG ACCCAACCTC AGCCTTCGGT
GGCATTATTG CTTTCAACGG CGAGTTAGAT GCCGCGACGG CGAGTGCTAT CGTTGAGCGT
CAATTCGTTG AAGTGATTAT CGCCCCAAGC GTCAGCCAAG CGGCGCGCGA TGTGGTGGCG
AAAAAGACCA ACGTGCGTTT ATTGGAATGT GGTCAGTGGA ACACTAAGAC CCAAACCTTA
GACTACAAAC GCGTTAACGG CGGCTTGTTA GTACAAGATC GCGACCAAGG CATGGTCGGC
TTAGAAGACA TCAAAGTGGT TTCTAAACGT CAACCAACTG CAAGCGAACT GAAAGACTTA
ATGTTCTGCT GGAAAGTAGC GAAATTCGTT AAATCTAACG CCATCGTTTA TGCCAAAGAC
GGCATGACTA TCGGTGTCGG CGCAGGCCAA ATGAGCCGCG TTTACAGCGC TAAAATCGCT
GGCATCAAGG CCGCCGATGA AGGCTTAGAA GTAGTGAACT CTGTGATGGC ATCCGATGCT
TTCTTCCCCT TCCGTGACGG TATCGATGCC GCTGCGGCTG CGGGCATTAG CTGCATCATC
CAACCGGGTG GCTCAATGCG CGATGCTGAA ATCATCGCCG CAGCAGACGA GCACGGCATG
GCCATGGTGA TGACGGGCAT GCGCCACTTC CGTCACTAA
 
Protein sequence
MTAANNARPI RRALLSVSDK TGILEFAKAL HAQGVELLST GGTARLLADN GVPVIEVSDY 
TGHPEIMDGR VKTLHPKVHG GILARRGLDE SVMADNNINA IDLVAVNLYP FAETVAKAGC
TLEDAIENID IGGPTMVRAA AKNHKDVTIV VNAADYSRVL AEMTANNGST THATRFDLAI
AAFEHTAGYD GMIANYFGTM IPAHSTDECF ADSKFPRTFN TQLVKKQDLR YGENSHQAAA
FYVDTKINEA SVATAIQLQG KALSYNNIAD TDAALECVKE FLEPACVIVK HANPCGVALG
KDLLDAYNRA YQTDPTSAFG GIIAFNGELD AATASAIVER QFVEVIIAPS VSQAARDVVA
KKTNVRLLEC GQWNTKTQTL DYKRVNGGLL VQDRDQGMVG LEDIKVVSKR QPTASELKDL
MFCWKVAKFV KSNAIVYAKD GMTIGVGAGQ MSRVYSAKIA GIKAADEGLE VVNSVMASDA
FFPFRDGIDA AAAAGISCII QPGGSMRDAE IIAAADEHGM AMVMTGMRHF RH