Gene Shew185_3913 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShew185_3913 
SymbolpurH 
ID5371186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS185 
KingdomBacteria 
Replicon accessionNC_009665 
Strand
Start bp4640094 
End bp4641692 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content52% 
IMG OID640832174 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001368100 
Protein GI153002419 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGCTG CAAATAATGC CAGACCCATT CGTCGCGCGC TGTTAAGCGT TTCAGATAAA 
ACCGGAATTC TCGAGTTCGC CAAAGCACTT CACGCCCAAG GTGTGGAGCT GTTGTCAACT
GGCGGCACCG CTCGCCTGTT AGCGGATAAC GGCGTGCCTG TTATCGAAGT ATCTGATTAC
ACAGGACACC CTGAGATCAT GGACGGTCGC GTTAAGACGC TGCACCCTAA AGTGCACGGC
GGCATTTTGG CGCGCCGCGG TCTTGATGAA AGCGTGATGG CCGACAACAA TATCAATGCC
ATCGATCTGG TTGCGGTTAA CCTTTATCCT TTCGCTGAAA CTGTGGCTAA AGCCGGTTGT
ACCTTAGAAG ACGCTATCGA AAATATCGAT ATTGGCGGCC CAACTATGGT GCGCGCAGCG
GCAAAAAACC ACAAAGACGT CACCATAGTC GTTAATGCCG CCGATTACTC ACGCGTACTG
GCAGAAATGA CGGCTAACAA TGGCAGCACG ACCCATGCGA CGCGTTTCGA CTTAGCGATT
GCGGCCTTTG AGCACACTGC GGGTTACGAT GGCATGATCG CCAACTACTT CGGCACTATG
GTTCCTGCGC ATAGCACGGA CGAATGCTTT GCTGATTCTA AGTTCCCACG CACGTTCAAC
ACCCAATTAG TGAAGAAGCA AGACTTACGC TATGGCGAAA ACAGCCATCA AGCGGCGGCC
TTCTATGTCG ATACGAAAAT TGATGAAGCC TCTGTGGCGA CGGCAATTCA GTTGCAAGGC
AAAGCCTTGT CTTACAACAA CATTGCCGAT ACCGACGCCG CTCTTGAGTG CGTAAAAGAA
TTCTTGGAAC CCGCCTGCGT TATCGTTAAA CACGCTAACC CATGTGGTGT GGCCTTAGGT
AAAGACTTGC TCGATGCCTA TAACCGCGCT TATCAAACTG ACCCAACCTC AGCCTTCGGT
GGCATTATTG CTTTCAACGG CGAGTTAGAT GCCGCGACGG CGAGTGCTAT CGTTGAGCGT
CAATTCGTTG AAGTGATTAT CGCCCCAAGC GTCAGCCAAG CGGCGCGCGA TGTGGTGGCG
AAAAAGACCA ACGTGCGTTT ATTGGAATGT GGTCAGTGGA ACACTAAGAC CCAAACCTTA
GACTACAAAC GCGTTAACGG CGGCTTGTTA GTACAAGATC GCGACCAAGG CATGGTCGGC
TTAGAAGACA TCAAAGTGGT TTCTAAACGT CAACCAACTG CAAGCGAACT GAAAGACTTA
ATGTTCTGCT GGAAAGTGGC GAAATTCGTT AAATCTAACG CCATCGTTTA TGCCAAAGAC
GGCATGACTA TCGGTGTCGG CGCAGGCCAA ATGAGCCGCG TTTACAGCGC TAAAATCGCT
GGCATCAAGG CCGCCGATGA AGGCTTAGAA GTAGTGAACT CTGTGATGGC ATCCGATGCT
TTCTTCCCCT TCCGTGACGG TATCGATGCC GCAGCGGCTG CGGGCATTAG CTGCATCATC
CAACCGGGTG GCTCAATGCG CGATGCTGAA ATCATCGCTG CAGCAGACGA GCACGGCATG
GCCATGGTGA TGACGGGCAT GCGCCACTTC CGTCACTAA
 
Protein sequence
MTAANNARPI RRALLSVSDK TGILEFAKAL HAQGVELLST GGTARLLADN GVPVIEVSDY 
TGHPEIMDGR VKTLHPKVHG GILARRGLDE SVMADNNINA IDLVAVNLYP FAETVAKAGC
TLEDAIENID IGGPTMVRAA AKNHKDVTIV VNAADYSRVL AEMTANNGST THATRFDLAI
AAFEHTAGYD GMIANYFGTM VPAHSTDECF ADSKFPRTFN TQLVKKQDLR YGENSHQAAA
FYVDTKIDEA SVATAIQLQG KALSYNNIAD TDAALECVKE FLEPACVIVK HANPCGVALG
KDLLDAYNRA YQTDPTSAFG GIIAFNGELD AATASAIVER QFVEVIIAPS VSQAARDVVA
KKTNVRLLEC GQWNTKTQTL DYKRVNGGLL VQDRDQGMVG LEDIKVVSKR QPTASELKDL
MFCWKVAKFV KSNAIVYAKD GMTIGVGAGQ MSRVYSAKIA GIKAADEGLE VVNSVMASDA
FFPFRDGIDA AAAAGISCII QPGGSMRDAE IIAAADEHGM AMVMTGMRHF RH