Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Syncc9902_0272 |
Symbol | purH |
ID | 3742317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus sp. CC9902 |
Kingdom | Bacteria |
Replicon accession | NC_007513 |
Strand | + |
Start bp | 284925 |
End bp | 286487 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637770439 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_376288 |
Protein GI | 78183854 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCCTG TCGCTCTGCT GAGTGTGTCC GACAAATCTG GGCTTTTGCC GTTGGCCGAG GCTCTGCATC GAATCCATGG CTATCAATTG CTCTCCAGTG GTGGCACCGC CAAGGTGCTT GAGCAGGCTG GCCTTCCGGT AACCCGTGTT TCCGAGTACA CCGGGGCCCC TGAGATTTTG GGTGGCCGCG TCAAAACGCT GCATCCCCGT GTTCACGGTG GAATTTTGGC CAAGCGTGGT GATGCGGCCC ATCAGAACGA CCTTGAACAA CAGAACATCA ACTTCATTGA TGTGGTGGTC GTGAATCTGT ATCCCTTCCG GGAAACAGTT GCCAAGGCTG ATGTCACTTG GGATCAAGCG ATTGAAAACA TTGATATTGG TGGGCCCACC ATGGTGCGCT CTGCCGCGAA AAATCATGCC GACGTTGCTG TTCTGACCAG TCCAGATCAG TACGACCGTT TGCTTGAAGC GATGGCTCAA GCCGGTGGTG AGGTGCCGGC GGCATTACGC CGTCAGCTTG CTCTTGAAGC CTTCCAGCAC ACTGCGGCCT ACGACACCGC CATTAGCCGC TGGATGGACC AGGCGGTGGC CGCAGATGGA TCCCCTTGGC TTGAGGCGGT TCCTCTGCGT CAAACCTTGC GCTACGGCGA GAACCCTCAT CAGAAAGCCC GTTGGTATAG CCATGCCCAG CAGGGATGGG GCGGTGCGGT TCAACTGCAA GGCAAGGAAC TGAGTACGAA CAATCTGTTG GATCTCGAAG CTGCTCTCGC CATGGTTCGG GAGTTTGGCT ACGGCTCTGA TGGCGCTGAG CCGGCTGTTC AGCCAGCCGC GGTGGTGGTG AAACACACCA ATCCCTGTGG TGTTGCCATC GGATCGGATG TGTCAACTGC ACTCACGAGG GCCTTGGATG CTGATCGAGT CAGTGCCTTT GGGGGAATCG TCGCCATCAA TGGCGTGGTG AGCGCCGCAG CGGCAGGGGA ACTGAAAAGC TTGTTTTTGG AATGCGTCGT GGCGCCAAGC TTTTCTCCAG AAGCCAGAGA GATTCTTGCG GCCAAAGCGA ATCTGCGTTT GCTGGAGCTC CAGCCTGCCG CGATCGATGC GGCGGGCCCC GACCACGTCC GCAGCATTCT TGGTGGATTG TTGGTTCAAG ACCTAGACGA TCAAGCGATC ACACCAAGCG AGTGGACAGT GGCAAGTCAG CGGCCTCCCT CATCCCAGGA ACAGCAGGAT TTGGAGTTCG CTTGGCGATT GGTGCGTCAC GTGCGTTCCA ACGCCATCGT GGTCGCCTCC AAGGGGCAGA GCTTGGGCAT AGGGGCCGGT CAAATGAACC GGGTTGGCTC GGCTCGCCTC GCGCTTGATG CGGCTGGGGA TCAAGCCACA GGGGCTGTGC TGGCCAGTGA TGGATTTTTC CCGTTTGACG ACACCGTGCG TCTTGCGGCG AGCCACGGAA TTACAGCTGT AATTCATCCA GGTGGCAGCT TGCGCGATGC GGATTCGATC AAGGCCTGTG ACGAACTGGG GCTCGCAATG CTGTTAACAG GCCGTCGACA CTTCCTTCAT TGA
|
Protein sequence | MAPVALLSVS DKSGLLPLAE ALHRIHGYQL LSSGGTAKVL EQAGLPVTRV SEYTGAPEIL GGRVKTLHPR VHGGILAKRG DAAHQNDLEQ QNINFIDVVV VNLYPFRETV AKADVTWDQA IENIDIGGPT MVRSAAKNHA DVAVLTSPDQ YDRLLEAMAQ AGGEVPAALR RQLALEAFQH TAAYDTAISR WMDQAVAADG SPWLEAVPLR QTLRYGENPH QKARWYSHAQ QGWGGAVQLQ GKELSTNNLL DLEAALAMVR EFGYGSDGAE PAVQPAAVVV KHTNPCGVAI GSDVSTALTR ALDADRVSAF GGIVAINGVV SAAAAGELKS LFLECVVAPS FSPEAREILA AKANLRLLEL QPAAIDAAGP DHVRSILGGL LVQDLDDQAI TPSEWTVASQ RPPSSQEQQD LEFAWRLVRH VRSNAIVVAS KGQSLGIGAG QMNRVGSARL ALDAAGDQAT GAVLASDGFF PFDDTVRLAA SHGITAVIHP GGSLRDADSI KACDELGLAM LLTGRRHFLH
|
| |