Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal195_4036 |
Symbol | purH |
ID | 5755855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS195 |
Kingdom | Bacteria |
Replicon accession | NC_009997 |
Strand | + |
Start bp | 4747653 |
End bp | 4749251 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641290382 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_001556456 |
Protein GI | 160877140 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGCTG CAAATAATGC CAGACCCATT CGTCGCGCGC TGTTAAGCGT TTCAGATAAA ACCGGAATTC TCGAGTTCGC CAAAGCACTT CACGCCCAAG GTGTGGAACT GTTATCAACT GGCGGCACCG CTCGCCTGTT AGCGGATAAC GGCGTGCCTG TTATCGAAGT ATCTGATTAC ACAGGACACC CTGAGATCAT GGACGGTCGC GTTAAGACGC TGCACCCTAA AGTGCACGGC GGCATTTTGG CGCGCCGCGG TCTTGATGAA AGCGTTATGG CCGACAACAA TATCAACGCC ATCGATCTGG TTGCGGTTAA CCTTTATCCT TTCGCTGAAA CCGTAGCTAA AGCCGGTTGT ACCTTAGAGG ACGCTATCGA AAATATCGAT ATTGGCGGCC CAACTATGGT GCGCGCAGCG GCAAAAAACC ACAAAGACGT CACCATAGTC GTTAATGCCG CCGATTACTC ACGCGTACTG GCAGAAATGA CGGCTAACAA TGGCAGCACG ACCCATGCGA CGCGTTTCGA CTTAGCGATT GCAGCCTTTG AGCACACTGC GGGTTACGAT GGCATGATCG CCAACTACTT CGGCACTATG GTTCCTGCAC ACAGCACGGA CGAATGCTTT GCTGATTCTA AGTTCCCACG CACGTTCAAC ACCCAATTAG TGAAGAAGCA AGACTTACGC TATGGCGAAA ACAGCCATCA AGCGGCGGCC TTCTATGTCG ACACTAAAAT TGATGAAGCC TCTGTGGCGA CGGCAATTCA GTTGCAAGGC AAAGCTTTGT CTTACAACAA CATTGCCGAT ACAGACGCCG CTCTTGAGTG CGTAAAAGAA TTCTTGGAAC CTGCCTGCGT TATCGTTAAA CACGCTAACC CATGTGGTGT GGCCTTAGGT AAAGACTTGC TCGATGCCTA TAACCGCGCT TATCAAACAG ACCCAACGTC AGCCTTCGGT GGCATTATTG CTTTCAACGG CGAGTTAGAT GCCGCGACGG CGAGTGCTAT CGTTGAGCGT CAATTCGTTG AAGTGATTAT CGCCCCAAGC GTCAGCCAAG CGGCGCGCGA TGTGGTGGCG AAAAAGACCA ACGTACGTTT ATTGGAATGT GGTCAGTGGA ACACTAAGAC CCAAACCTTA GACTTCAAAC GCGTTAACGG CGGCTTGTTA GTACAAGATC GCGACCAAGG CATGGTCGGC TTAGAAGACA TCAAAGTGGT TTCTAAACGT CAACCAACTG CAAGCGAACT GAAAGACTTA ATGTTCTGCT GGAAAGTAGC GAAATTCGTT AAATCTAACG CCATCGTTTA TGCAAAAGAC GGCATGACTA TCGGTGTCGG CGCAGGCCAA ATGAGCCGCG TTTACAGCGC TAAAATCGCT GGCATCAAGG CCGCCGACGA AGGTTTAGAA GTAGTGAACT CTGTGATGGC ATCCGATGCT TTCTTCCCCT TCCGTGACGG TATCGATGCC GCAGCGGCTG CGGGCATTAG CTGCATCATC CAACCGGGTG GCTCAATGCG CGATGCAGAA ATCATCGCCG CAGCAGACGA GCACGGCATG GCCATGGTGA TGACGGGCAT GCGCCACTTC CGTCACTAA
|
Protein sequence | MTAANNARPI RRALLSVSDK TGILEFAKAL HAQGVELLST GGTARLLADN GVPVIEVSDY TGHPEIMDGR VKTLHPKVHG GILARRGLDE SVMADNNINA IDLVAVNLYP FAETVAKAGC TLEDAIENID IGGPTMVRAA AKNHKDVTIV VNAADYSRVL AEMTANNGST THATRFDLAI AAFEHTAGYD GMIANYFGTM VPAHSTDECF ADSKFPRTFN TQLVKKQDLR YGENSHQAAA FYVDTKIDEA SVATAIQLQG KALSYNNIAD TDAALECVKE FLEPACVIVK HANPCGVALG KDLLDAYNRA YQTDPTSAFG GIIAFNGELD AATASAIVER QFVEVIIAPS VSQAARDVVA KKTNVRLLEC GQWNTKTQTL DFKRVNGGLL VQDRDQGMVG LEDIKVVSKR QPTASELKDL MFCWKVAKFV KSNAIVYAKD GMTIGVGAGQ MSRVYSAKIA GIKAADEGLE VVNSVMASDA FFPFRDGIDA AAAAGISCII QPGGSMRDAE IIAAADEHGM AMVMTGMRHF RH
|
| |