Gene Sbal_0420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal_0420 
SymbolpurH 
ID4844024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS155 
KingdomBacteria 
Replicon accessionNC_009052 
Strand
Start bp478607 
End bp480205 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content51% 
IMG OID640117642 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001048821 
Protein GI126172672 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGCTG TAAATAATGC CAGACCCATT CGTCGCGCGC TGTTAAGCGT TTCAGATAAA 
ACCGGAATTC TCGAGTTCGC CAAAGCACTT CACGCCCAAG GTGTAGAATT GTTATCAACT
GGCGGCACCG CTCGCTTGTT AGCGGATAAC GGCGTGCCTG TTATCGAAGT ATCTGATTAC
ACAGGACACC CTGAGATCAT GGACGGTCGC GTTAAGACGC TGCACCCTAA AGTGCACGGC
GGCATTTTGG CGCGCCGCGG TCTTGATGAA AGCGTTATGG CCGACAACAA TATCAATGCC
ATCGATCTGG TTGCGGTTAA CCTTTATCCT TTCGCTGAAA CTGTTGCTAA AGCCGGTTGT
ACCTTAGAGG ACGCTATCGA AAATATCGAT ATTGGCGGCC CAACTATGGT GCGCGCAGCG
GCAAAAAACC ACAAAGACGT CACCATAGTC GTTAATGCGG CCGATTACTC ACGCGTACTG
GCAGAAATGA CGGCTAACAA TGGCAGCACG ACTCATGCGA CGCGTTTCGA CTTAGCGATT
GCAGCCTTTG AGCACACTGC GGGTTACGAT GGCATGATCG CCAACTACTT CGGCACTATG
GTTCCTGCGC ATAGCACGGA CGAATGCTTT GCTGATTCTA AGTTCCCACG CACGTTCAAC
ACCCAATTAG TGAAGAAGCA AGACTTACGC TATGGCGAAA ACAGCCATCA AGCGGCGGCT
TTCTATGTTG ACACTAAAAT TGATGAAGCC TCTGTGGCGA CGGCAATTCA GTTGCAAGGC
AAAGCCTTGT CTTACAACAA CATTGCCGAT ACCGACGCCG CCCTTGAGTG CGTAAAAGAA
TTCTTGGAAC CCGCCTGCGT TATCGTTAAA CACGCTAACC CATGTGGTGT GGCCTTAGGT
AAAGACTTGC TCGATGCCTA TAACCGCGCT TATCAAACTG ACCCAACGTC AGCCTTCGGT
GGCATTATTG CTTTCAACGG CGAGTTAGAT GCAGCGACGG CGAGTGCTAT CGTTGAGCGT
CAATTCGTTG AAGTGATTAT CGCCCCAAGC GTCAGCCAAG CGGCACGCGA TGTGGTGGCG
AAAAAGACCA ACGTGCGTTT ATTGGAATGT GGTCAGTGGA ACACTAAGAC CCAAACCTTA
GACTACAAAC GCGTTAATGG CGGCTTGTTA GTACAAGATC GCGACCAAGG CATGGTCGGT
TTAGAAGACA TCAAAGTGGT TTCTAAACGT CAACCAACTG CAAGCGAACT GAAAGACTTA
ATGTTCTGCT GGAAAGTGGC GAAATTCGTT AAATCTAACG CCATCGTTTA TGCCAAAGAC
GGCATGACTA TCGGTGTCGG CGCAGGCCAA ATGAGCCGCG TTTACAGCGC TAAAATCGCT
GGCATCAAGG CCGCCGATGA AGGCTTAGAA GTAGTGAACT CTGTGATGGC ATCCGATGCT
TTCTTCCCCT TCCGTGACGG TATCGATGCC GCTGCGGCTG CGGGCATTAG CTGCATCATC
CAACCGGGTG GCTCAATGCG CGATGCTGAA ATCATCGCCG CAGCAGACGA GCACGGCATG
GCCATGGTAA TGACGGGCAT GCGCCACTTC CGTCACTAA
 
Protein sequence
MTAVNNARPI RRALLSVSDK TGILEFAKAL HAQGVELLST GGTARLLADN GVPVIEVSDY 
TGHPEIMDGR VKTLHPKVHG GILARRGLDE SVMADNNINA IDLVAVNLYP FAETVAKAGC
TLEDAIENID IGGPTMVRAA AKNHKDVTIV VNAADYSRVL AEMTANNGST THATRFDLAI
AAFEHTAGYD GMIANYFGTM VPAHSTDECF ADSKFPRTFN TQLVKKQDLR YGENSHQAAA
FYVDTKIDEA SVATAIQLQG KALSYNNIAD TDAALECVKE FLEPACVIVK HANPCGVALG
KDLLDAYNRA YQTDPTSAFG GIIAFNGELD AATASAIVER QFVEVIIAPS VSQAARDVVA
KKTNVRLLEC GQWNTKTQTL DYKRVNGGLL VQDRDQGMVG LEDIKVVSKR QPTASELKDL
MFCWKVAKFV KSNAIVYAKD GMTIGVGAGQ MSRVYSAKIA GIKAADEGLE VVNSVMASDA
FFPFRDGIDA AAAAGISCII QPGGSMRDAE IIAAADEHGM AMVMTGMRHF RH