Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_3818 |
Symbol | purH |
ID | 3678750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 4750459 |
End bp | 4751979 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637719170 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_324318 |
Protein GI | 75910022 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000025454 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCGTC TAGCACTGCT GAGTGTATCT AACAAGACTG GTTTAATTGA CTTAGCTCGT CGCTTGGTAG AAGAATTTGA GTTTGATTTA ATCAGCAGTG GGGGGACAGC CCAAGCCCTC AAGGATGCGG GTTTACCTGT GACGAAGGTT GCAGATTACA CGGGTTCGCC AGAGATTTTG GGTGGACGGG TGAAAACTCT CCATCCCCGG ATTCATGGGG GGATTTTGGC TAGGCGGGAT GTACCTAGTG ATTTGACGGA TTTGGAAAAT AACCAAATTC GCCCGATTGA TTTGGTGGTG GTGAATCTTT ACCCGTTTGA GGAGACTATT GCTAAACCAG GGGTGACGTT GGCGGAAGCT GTGGAACAAA TTGATATTGG CGGCCCGGCG ATGTTACGGG CATCATCAAA GAATTTTGCC CATCTAACAG TCTTATGTGA TCCGGCGCAG TATGATGAAT ATCTGCAAGA ATTACGACAA AATAACGGAG TAGCTTCCTT AGAATTTCGG CAAAAGGCAG CTTTGAAGGG GTTTTTGCAT ACGGCGAGTT ATGATAGTGC CATTGCCTCT TACCTCTCAG GTACACAACA GCATACCCTC AACGGTACAG AATTACAATC TCTGCGTTAC GGTGAGAATC CCCATCAGCC CGCAGCTTGG TATCAAACTG GAACTACGCC AACAGGGTGG ACGGCAGCCA AGAAACTGCA AGGCAAGGAA CTCAGCTACA ATAATTTGGT TGACTTAGAA GCCGCCCGCC GCATTATTGC AGAGTTCACT GATACGCCAG CCGCCACGAT TATTAAACAT ACTAATCCCT GCGGTACGGC ATTGGCAGAT ACCATCGTGG AAGCTTATCA AAAAGCTTTT AATGCTGACG CTACTTCGGC ATTTGGGGGG ATTGTCGCCC TGAACCGCCC TATTGATGCA GCGACAGCCA GCGAGTTAAC CAAGACGTTT TTAGAATGTG TAGTTGCGCC TGATTGCGAT GCAGAAGCGC AAAAAATTCT GGCGAAGAAA TCTAATGTGC GGGTGTTGAC TTTAGCAGAT TTGAGTACAG GCCCCAAAAC TCTGGTAAAA CAAATTGCTG GCGGTTTCCT GGTGCAGGCT GCGGATGATA TTGCTGCTGA CACAATTCAA TGGCAAGTAG TTACAGAACG CCAACCTACT GCTGATGAAT TAGCAGAATT GTTATTTGCA TGGAAAGTCT GCAAACACGT TAAATCTAAT GCTATTGTTG TGACAAGCGA TCGCACTACT CTTGGTGTAG GTGCAGGACA AATGAACCGC ATTGGTTCAA CGAAAATTGC CCTAGAACAA GCAGGGGACA AAGCCAAAGG TGCAATCCTC GCCAGCGATG GATTTTTCCC CTTTGATGAT ACCGTGAGAA CCGCCGCCGC CGCCGGTATT AGCGCCATTG TCCAGCCAGG GGGAAGCCTG CGCGATCAAG ATTCTGTCAA GGCTGCCAAT GAACTCGGTT TGTTAATGGT GCTGACTGGG GTGCGGCATT TTTTACATTA G
|
Protein sequence | MARLALLSVS NKTGLIDLAR RLVEEFEFDL ISSGGTAQAL KDAGLPVTKV ADYTGSPEIL GGRVKTLHPR IHGGILARRD VPSDLTDLEN NQIRPIDLVV VNLYPFEETI AKPGVTLAEA VEQIDIGGPA MLRASSKNFA HLTVLCDPAQ YDEYLQELRQ NNGVASLEFR QKAALKGFLH TASYDSAIAS YLSGTQQHTL NGTELQSLRY GENPHQPAAW YQTGTTPTGW TAAKKLQGKE LSYNNLVDLE AARRIIAEFT DTPAATIIKH TNPCGTALAD TIVEAYQKAF NADATSAFGG IVALNRPIDA ATASELTKTF LECVVAPDCD AEAQKILAKK SNVRVLTLAD LSTGPKTLVK QIAGGFLVQA ADDIAADTIQ WQVVTERQPT ADELAELLFA WKVCKHVKSN AIVVTSDRTT LGVGAGQMNR IGSTKIALEQ AGDKAKGAIL ASDGFFPFDD TVRTAAAAGI SAIVQPGGSL RDQDSVKAAN ELGLLMVLTG VRHFLH
|
| |