Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Syncc9605_0243 |
Symbol | purH |
ID | 3736171 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus sp. CC9605 |
Kingdom | Bacteria |
Replicon accession | NC_007516 |
Strand | + |
Start bp | 250916 |
End bp | 252478 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637774824 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_380574 |
Protein GI | 78211795 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.853465 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCCTG TCGCTCTGCT GAGTGTTTCC GATAAGTCCG GGCTGGTGCC CCTGGCGGAG GCCCTGCATC GGACCCATGG CTATCAGCTG CTCTCCAGTG GGGGCACGGC CAAGGTGCTC GAGCAGGCCG GCCTACCGGT GACCCGTGTG TCGGACCACA CCGGGGCTCC AGAAATTCTT GGCGGTCGTG TGAAAACGCT CCATCCAAGG GTGCATGGCG GGATTTTGGC CAAGCGGGGT GACGCGTCTC ACCAGGCCGA TCTTGAGCAG CAGAACATCG CCCCCATCGA TATGGTGGTG GTCAACCTCT ATCCCTTCCG CGAAACGATT GCGCGGCCTG ACGTCACCTG GGATCAGGCG ATCGAGAACA TCGACATCGG TGGCCCTGCC ATGGTGCGGG CGGCCGCCAA GAATCACGCC GATGTGGCTG TCCTCACCAG TCCTGACCAA TACGACCGTC TGTTGACCGC CATGGCGGAG TCGGGCGGGA GCGTGCCTTC GGCGCTGCGG CGCCAACTGG CCCTTGAAGC GTTCAATCAC ACCGCGTCGT ACGACACCGC CATCGGCCGC TGGATGGCCG AGCAAGCCAC CGCAAAAGGC TGCCCCTGGT TGGAGGCGGT GCCGCTGCGG CAGACCCTGC GGTATGGCGA AAATCCCCAC CAGAAAGCGC GCTGGTTCAG CCATCCCAAA CAGGGTTGGG GTGGTGCCAT TCAGCTGCAG GGCAAGGAGC TGAGCACTAA CAACCTTCTG GATCTCGAGG CGGCCCTCGC CACGGTGCGG GAGTTCGGCT ACGGAGCCGA CGGCTCCGCA CCGGCGTCGC AACCCGCGGC CGTGGTCGTC AAGCACACCA ATCCCTGTGG CGTGGCCGTC GGAGCTTCGA TGCCTGCAGC ACTGACGCGG GCCCTGGATG CCGATCGGGT GAGTGCCTTC GGCGGCATCA TCGCCATGAA CGATGTGGTG GAAGCAACGG CGGCCCGTGA GCTCACCAGC CTGTTCCTGG AATGCGTCGT GGCACCAGGT TTCACGCCCG AAGCGCGGGA GGTGCTGGCG GCCAAAGCCA ATCTGCGCTT GTTGGAACTG GCTCCGCAGG CCATTGATGT GGCTGGCCCC GATCACGTGC GGAGCATTCT GGGTGGTCTC CTGGTTCAGG ATCTCGATGA CCAGGCGATC ACGCCGACCG ACTGGACCGT GGCCAGCCAG CGGCCGCCCA CACCCCAGGA AAAGCTGGAC CTTGAATTTG CCTGGCGTTT GGTGCGTCAC GTGCGCTCCA ACGCCATCGT TGTTGCCAAG GATGGGCAGA GCCTTGGCGT GGGTGCCGGG CAGATGAATC GCGTGGGCTC CGCGCGGATT GCCCTGGAAG CTGCAGGTGA GAAAGCGCAG GGAGCCGTTC TGGCAAGTGA TGGCTTCTTC CCGTTTGACG ACACAGTGCG TCTGGCTGCC AGCCAGGGCA TCACCGCAGT GATTCATCCC GGCGGGAGCA TGCGCGATGG CGATTCGATC AAAGCTTGCG ATGAGCTCGG CCTGGCGATG CAGCTCACGG GGCGCCGTCA TTTCCTGCAT TGA
|
Protein sequence | MAPVALLSVS DKSGLVPLAE ALHRTHGYQL LSSGGTAKVL EQAGLPVTRV SDHTGAPEIL GGRVKTLHPR VHGGILAKRG DASHQADLEQ QNIAPIDMVV VNLYPFRETI ARPDVTWDQA IENIDIGGPA MVRAAAKNHA DVAVLTSPDQ YDRLLTAMAE SGGSVPSALR RQLALEAFNH TASYDTAIGR WMAEQATAKG CPWLEAVPLR QTLRYGENPH QKARWFSHPK QGWGGAIQLQ GKELSTNNLL DLEAALATVR EFGYGADGSA PASQPAAVVV KHTNPCGVAV GASMPAALTR ALDADRVSAF GGIIAMNDVV EATAARELTS LFLECVVAPG FTPEAREVLA AKANLRLLEL APQAIDVAGP DHVRSILGGL LVQDLDDQAI TPTDWTVASQ RPPTPQEKLD LEFAWRLVRH VRSNAIVVAK DGQSLGVGAG QMNRVGSARI ALEAAGEKAQ GAVLASDGFF PFDDTVRLAA SQGITAVIHP GGSMRDGDSI KACDELGLAM QLTGRRHFLH
|
| |