Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2746 |
Symbol | |
ID | 5734627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3500668 |
End bp | 3502188 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641279889 |
Product | phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_001545512 |
Protein GI | 159899265 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.221378 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAGCTA TTGTCAGTGT TTCAGACAAA TCGGGGTTGG CGGATTTTGC TCAGGGCTTG AGCGATTTGG GCATTGAATT ATTTTCAACT GGCGGTACCA AAGCCGCGTT GGTTGGGGCT GGTGTGCCAG TCCGCAGTGT CAGCGATTTG ACGGGCTTTC CTGAAATTTT GGAAGGTCGG GTCAAAACCT TGCATCCTGG TGTGCATGCT GGCATTTTGG CCCGCCGTGA TAAGCCAGCC CATCTGGCCC AATTGGCTGA GCACCACATC GGCCAGATCG ATTTGGTAGT TGTCAATTTG TATCCGTTTG CTCAAACCAT CGCCAAACCT GATGTTACTT TAGAAGAAGC AGTTGAACAG ATTGATATTG GTGGCCCAAC GATGGTGCGA GCTTCGGCCA AAAATCATGC CCATGTCTTG ATTGTGATCG ATCCTGCTGA TTATCCACGG GTTTTGGAAG CCTTGCGGAC TGAGCAAATT ACGCCTGAAT TGCGCCGCCA ACTTGCTGCC AAAGCCTTTG CCCATACCGC AGCCTACGAT AGCGCGATTG CCGCCTACTT AACCGACGAA ACGTTCCCGC AGCAATTGCC CTTGGCCTGG GAATTAGCCC AATCGCTGCG TTATGGCGAG AATCCGCATC AAGCAGCGGC GTTTTATCGC GCTCCCAATG CTGCCGCCAA CACCTTGGCA AAAGCAGTGC AACATCAAGG CAAAGAGCTT TCCTACAACA ACTTGCTTGA TGCTGATGCT ACCTTGCAAG TAATTCAAAA TTTTGATCAG CCAACCGTGG CGATTATCAA GCACACTAAT CCTTGTGGCC TGGCTTCGGC TGAGGATTTG GTGGCGGCCC ACAAAGCGGC GCGGGCGGGT GATCCGCTTT CGGCGTTTGG TGGCATCGTC GGGGTCAATC GCCCCGTCGA TCGGGCTTTA GCCAATGTGC TGAAAAAATA TTTTTACGAA GTGATTATTG CCCCATCCTT TAGCCCTGAA GCCTTGACAA TTTTGGCCGA AAAGCCCAAT TTACGCTTGT TGAGCGTCGA TACCAGCCGC TCAAGCAGCA ACGATTGGGA ATATCGCAGT ATTGGCGGCG GCATTTTGGC CCAACATGTT GATCGCGTTG GTAATGATCG CTGGGATGCT TGGCAGGTTG TTACTGAAAC TGTGCCCAGC GATGAGCAAC TGGCAGCCTT GCAATTTGCT TGGAAAGCTT GTGCCAGCGT GAAATCGAAT GCGATTGTCT TGGTGCAAGG CGAGGAATTG GTGGGGATGG GCGCTGGTCA GCCTTCACGG GTTGATTCGG TATTGACGGC GATTCGCAAG GCGGGCGAAC GGGCCAAGGG TAGCGTGCTA GCTTCCGATG CCTTCTTCCC CAAAGCCGAT GGAATTCAGG CCGCGATTGA AGCTGGCGTG AGCGCAATTG TTCAGCCTGG TGGCTCGCAA GGTGATGATG AAGTGATTGC TGCTGCTAAC GCCGCAGGCA TCGCGATGAT CTTCACTGCT ACTCGCCACT TCAAACACTA A
|
Protein sequence | MRAIVSVSDK SGLADFAQGL SDLGIELFST GGTKAALVGA GVPVRSVSDL TGFPEILEGR VKTLHPGVHA GILARRDKPA HLAQLAEHHI GQIDLVVVNL YPFAQTIAKP DVTLEEAVEQ IDIGGPTMVR ASAKNHAHVL IVIDPADYPR VLEALRTEQI TPELRRQLAA KAFAHTAAYD SAIAAYLTDE TFPQQLPLAW ELAQSLRYGE NPHQAAAFYR APNAAANTLA KAVQHQGKEL SYNNLLDADA TLQVIQNFDQ PTVAIIKHTN PCGLASAEDL VAAHKAARAG DPLSAFGGIV GVNRPVDRAL ANVLKKYFYE VIIAPSFSPE ALTILAEKPN LRLLSVDTSR SSSNDWEYRS IGGGILAQHV DRVGNDRWDA WQVVTETVPS DEQLAALQFA WKACASVKSN AIVLVQGEEL VGMGAGQPSR VDSVLTAIRK AGERAKGSVL ASDAFFPKAD GIQAAIEAGV SAIVQPGGSQ GDDEVIAAAN AAGIAMIFTA TRHFKH
|
| |