Gene Sbal195_4036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal195_4036 
SymbolpurH 
ID5755855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS195 
KingdomBacteria 
Replicon accessionNC_009997 
Strand
Start bp4747653 
End bp4749251 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content52% 
IMG OID641290382 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001556456 
Protein GI160877140 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCTG CAAATAATGC CAGACCCATT CGTCGCGCGC TGTTAAGCGT TTCAGATAAA 
ACCGGAATTC TCGAGTTCGC CAAAGCACTT CACGCCCAAG GTGTGGAACT GTTATCAACT
GGCGGCACCG CTCGCCTGTT AGCGGATAAC GGCGTGCCTG TTATCGAAGT ATCTGATTAC
ACAGGACACC CTGAGATCAT GGACGGTCGC GTTAAGACGC TGCACCCTAA AGTGCACGGC
GGCATTTTGG CGCGCCGCGG TCTTGATGAA AGCGTTATGG CCGACAACAA TATCAACGCC
ATCGATCTGG TTGCGGTTAA CCTTTATCCT TTCGCTGAAA CCGTAGCTAA AGCCGGTTGT
ACCTTAGAGG ACGCTATCGA AAATATCGAT ATTGGCGGCC CAACTATGGT GCGCGCAGCG
GCAAAAAACC ACAAAGACGT CACCATAGTC GTTAATGCCG CCGATTACTC ACGCGTACTG
GCAGAAATGA CGGCTAACAA TGGCAGCACG ACCCATGCGA CGCGTTTCGA CTTAGCGATT
GCAGCCTTTG AGCACACTGC GGGTTACGAT GGCATGATCG CCAACTACTT CGGCACTATG
GTTCCTGCAC ACAGCACGGA CGAATGCTTT GCTGATTCTA AGTTCCCACG CACGTTCAAC
ACCCAATTAG TGAAGAAGCA AGACTTACGC TATGGCGAAA ACAGCCATCA AGCGGCGGCC
TTCTATGTCG ACACTAAAAT TGATGAAGCC TCTGTGGCGA CGGCAATTCA GTTGCAAGGC
AAAGCTTTGT CTTACAACAA CATTGCCGAT ACAGACGCCG CTCTTGAGTG CGTAAAAGAA
TTCTTGGAAC CTGCCTGCGT TATCGTTAAA CACGCTAACC CATGTGGTGT GGCCTTAGGT
AAAGACTTGC TCGATGCCTA TAACCGCGCT TATCAAACAG ACCCAACGTC AGCCTTCGGT
GGCATTATTG CTTTCAACGG CGAGTTAGAT GCCGCGACGG CGAGTGCTAT CGTTGAGCGT
CAATTCGTTG AAGTGATTAT CGCCCCAAGC GTCAGCCAAG CGGCGCGCGA TGTGGTGGCG
AAAAAGACCA ACGTACGTTT ATTGGAATGT GGTCAGTGGA ACACTAAGAC CCAAACCTTA
GACTTCAAAC GCGTTAACGG CGGCTTGTTA GTACAAGATC GCGACCAAGG CATGGTCGGC
TTAGAAGACA TCAAAGTGGT TTCTAAACGT CAACCAACTG CAAGCGAACT GAAAGACTTA
ATGTTCTGCT GGAAAGTAGC GAAATTCGTT AAATCTAACG CCATCGTTTA TGCAAAAGAC
GGCATGACTA TCGGTGTCGG CGCAGGCCAA ATGAGCCGCG TTTACAGCGC TAAAATCGCT
GGCATCAAGG CCGCCGACGA AGGTTTAGAA GTAGTGAACT CTGTGATGGC ATCCGATGCT
TTCTTCCCCT TCCGTGACGG TATCGATGCC GCAGCGGCTG CGGGCATTAG CTGCATCATC
CAACCGGGTG GCTCAATGCG CGATGCAGAA ATCATCGCCG CAGCAGACGA GCACGGCATG
GCCATGGTGA TGACGGGCAT GCGCCACTTC CGTCACTAA
 
Protein sequence
MTAANNARPI RRALLSVSDK TGILEFAKAL HAQGVELLST GGTARLLADN GVPVIEVSDY 
TGHPEIMDGR VKTLHPKVHG GILARRGLDE SVMADNNINA IDLVAVNLYP FAETVAKAGC
TLEDAIENID IGGPTMVRAA AKNHKDVTIV VNAADYSRVL AEMTANNGST THATRFDLAI
AAFEHTAGYD GMIANYFGTM VPAHSTDECF ADSKFPRTFN TQLVKKQDLR YGENSHQAAA
FYVDTKIDEA SVATAIQLQG KALSYNNIAD TDAALECVKE FLEPACVIVK HANPCGVALG
KDLLDAYNRA YQTDPTSAFG GIIAFNGELD AATASAIVER QFVEVIIAPS VSQAARDVVA
KKTNVRLLEC GQWNTKTQTL DFKRVNGGLL VQDRDQGMVG LEDIKVVSKR QPTASELKDL
MFCWKVAKFV KSNAIVYAKD GMTIGVGAGQ MSRVYSAKIA GIKAADEGLE VVNSVMASDA
FFPFRDGIDA AAAAGISCII QPGGSMRDAE IIAAADEHGM AMVMTGMRHF RH