Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3772 |
Symbol | |
ID | 5735636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4740394 |
End bp | 4743273 |
Gene Length | 2880 bp |
Protein Length | 959 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641280924 |
Product | phosphoribosylformylglycinamidine synthase II |
Protein accession | YP_001546536 |
Protein GI | 159900289 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0046] Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain |
TIGRFAM ID | [TIGR01736] phosphoribosylformylglycinamidine synthase II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.384894 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCCGCAT ATTTGGTCAC TGTTTGCCCA CGCGAGGCCG ACAACCACGA ACGGCTGTAC CTTCTCGCCG GCGATCTTTC CTCAGAAGAT GTCCAACGCC TGACCCTCGA ATTACTGCAT GATCCCGTTG CCCATACTGC TACCTGGCAA GCGCTTGACG CTGAGCTAGC CACGCCCAAA GCTGGAGCAT TGGTGGAGAT TGCCTTTCGT CCAGGCGTGA CCGATAACGA AGCTGAGACG ATTTTAGTTG GCGCACGCCA TATTGGGATT AACGGCTTGA AACAGGCCAA AACCCTGCGT CGCGTCTATG TGTCCGATGT GCAAGACGAA GCTGTTTTGC GTCAATTTGC TGGTGAACAT CTTTTAAACG ATTTGATTGA AACTGCTTAT TCTACCCTTG AGGCTCGTAG CGCTGAACGT TTGCAGTTTT ATCAACACTT ATTACAACTT CCAGCTCCTC ACACGCCCAC GATCACCCGC GTGGCTTTGC GCGGAGTCAG CGATAGCGAA CTCGAACGGA TTAGCCGTGA AGGCATTTTG GCCTTGAGTT TGGCTGAAAT GCAGGCAGTT CGCGATTATT TTGAGGATTT AGGCCGCGAC CCAACCGATG GTGAGCTGGA AACCCTTGCT CAAACATGGT CGGAACATTG CCGCCATAAA ACCTTCCGCG CCACGATCAG CTATCAACAA GTTGCCGCAG ATCAGGGAAT TGATGCGGCG TTGCACCCAG CCTTGGCTGA ACTCAATGCC GCCAACGGGG CAACGATCAA TGGTTTGCTC AATCACTATT TGCGTAGCGC TACCAACGCT GTTAGCAACG AGGCCTTGCT CTCGGCATTT GTCGATAATG CTGGGATTGT GGCCTTCGAT GAGCAGTATG AAATTTCCTT CAAGGTCGAA ACCCACAATC ACCCTTCAGC ACTAGAGCCA TTTGGCGGTG CAAATACTGG GGTTGGTGGG GTTGTGCGCG ACGTGTTGGG GGTTTCAGCT AAGCCAATCG CCGTGACCGA TGTGTTGTGT TTTGGCTACC CTGATTTGCC AGAAAGCGAG CTTTCGCAAG GTGTGCTGCA CCCACGGCGG ATTCGCGAAG GTGTCGTGGC TGGGGTACGC GATTATGGCA ACAAACTAGG AATTCCCAAT GTCAACGGGG CGGTTTGGTA TGACCATGGC TACACCGCCA ATCCATTGGT ATTCTGTGGC ACGCTGGGCA TTGCACCACG CGGCAGCCAC CCACGCGGCG TTCAAGCAGG CGACGCAATT GTCGTAATCG GCGGACGCAC TGGCCGCGAT GGCATTCACG GTGCAACCTT CTCGTCGGTT GAATTAACTC ACGACACTGC TGAAACGGTT GGGGCGGCGG TGCAAATCGG CGATCCCGTC ACCGAAAAAA CCGTGATCGA CGTGTTGTTG CAAGCCCGCG ATTTAGGTTT GTACAGCGCA ATTACCGATT GTGGCGCGGG CGGGCTTTCC TCGGCGGTTG GCGAGATGGG CGAAGAAACT GGCGCAGTTG TTGAATTACG CGATGTGCCG CTCAAATATG CTGGCTTGCA ACCATGGGAA ATTTGGATCT CCGAAGCCCA AGAGCGCATG GTCGTTTCCG TGCCGCCGCA AAATGTCCAA ACCTTGCTTG ATCTTTGCCG TGGCGAAGAT GTTGAGGCAA CCGTGATTGG TCACTTCACT GCTGATGGTG TGCTGACAGT CAAGCACAAC CAATTAACCG TGGTTGAGCT GGATATGGCC TTTTTGCATA GCGGCGGGGT GCAATTTAAG CTGAATGCTG ATTGGCAGCC AAGCCCAGCA CCAGCCAGCC AACCAGCCAC TATCGATCAC ACGGCGCTAC TCAAGGCAAC GTTGGGACAG CCAATCGTCG CCAGCAACGA AAATATTGTG CGCACCTACG ACCATGAAGT GCAGGCGGCC ACCGTGCTCA AGCCCTTGGT TGGCGTGAAC GAAGATGGCC CAGGCGATGC TGGGGTATTA CAACCACGGG TCGATTCAAA CCGTGGCGTG GTGCTTGGCT GTGGCCTGAA TCCGTTGTAT GGCAAAATCG ATCCGTATTG GATGGCCTTA GCAGCGGTTG ATGAAGCCTT GCGCAACATC GTCGCAGCTG GCGGCGACCC CGAACAAACC TGGATTTTGG ATAACTTCTG TTGGGGCGAC CCCAAATTGC CTGACCGCTT GGCAGGCTTG GTGCGAGCTT CGGCTGGTTG TCACGATGCC GCCTTGGCGT ATCGCACGCC CTTTATTTCG GGCAAAGATT CGCTCAACAA CGAATATCGC GATGCAGAAG GCAAGCGCGT GGCGATTCCA CCAACCTTGC TGATTTCGGC CATGGCCTTA GTACCCGATG TGTTGCAAAC AATCTCGATG GACGCGAAAG CCGCTGGCAA TGCGATCTAC TTGGTTGGTT TGACTCACAA CGAACGCGGC GGGGCAGTCA GTGCTTTGGT TGGTGGCATC GATAATGGCA ATCTGCCCAA GGTTAATCTG GCAACCGCGC CAAGCGTGCA TAAAGCGCTA CATGCGGCAA TTCGCGCCAA TAGCGTGCGA GCCTGCCACG ATTTGAGTGA AGGTGGCTTG GCGGTCGCCG CTGCCGAAAT GGCCTTTGCT GGCGGGTTTG GCTTGAGCTT GGAATTGAGC GCTATGCCAA CATCTGGCAG TTTGAGCGCT GATGCCTTGT TGTGGAGCGA ATCGACCACC CGTTTCTTGG TCGAGGTTGC CCCAGAGCAA GCCGCCAATT TCGAAGCTCA GTTGAGCAAC ATCGCCTACG CCAAAATTGG CCAAGTGCTG GCTGAACCAC GCCTGATCAT CAACGATTTG GCTGGCCAGC CGATTATCGA CAGCGATTTG GCAAGCCTCA AGGCTGCATG GCAAGCTTAA
|
Protein sequence | MSAYLVTVCP READNHERLY LLAGDLSSED VQRLTLELLH DPVAHTATWQ ALDAELATPK AGALVEIAFR PGVTDNEAET ILVGARHIGI NGLKQAKTLR RVYVSDVQDE AVLRQFAGEH LLNDLIETAY STLEARSAER LQFYQHLLQL PAPHTPTITR VALRGVSDSE LERISREGIL ALSLAEMQAV RDYFEDLGRD PTDGELETLA QTWSEHCRHK TFRATISYQQ VAADQGIDAA LHPALAELNA ANGATINGLL NHYLRSATNA VSNEALLSAF VDNAGIVAFD EQYEISFKVE THNHPSALEP FGGANTGVGG VVRDVLGVSA KPIAVTDVLC FGYPDLPESE LSQGVLHPRR IREGVVAGVR DYGNKLGIPN VNGAVWYDHG YTANPLVFCG TLGIAPRGSH PRGVQAGDAI VVIGGRTGRD GIHGATFSSV ELTHDTAETV GAAVQIGDPV TEKTVIDVLL QARDLGLYSA ITDCGAGGLS SAVGEMGEET GAVVELRDVP LKYAGLQPWE IWISEAQERM VVSVPPQNVQ TLLDLCRGED VEATVIGHFT ADGVLTVKHN QLTVVELDMA FLHSGGVQFK LNADWQPSPA PASQPATIDH TALLKATLGQ PIVASNENIV RTYDHEVQAA TVLKPLVGVN EDGPGDAGVL QPRVDSNRGV VLGCGLNPLY GKIDPYWMAL AAVDEALRNI VAAGGDPEQT WILDNFCWGD PKLPDRLAGL VRASAGCHDA ALAYRTPFIS GKDSLNNEYR DAEGKRVAIP PTLLISAMAL VPDVLQTISM DAKAAGNAIY LVGLTHNERG GAVSALVGGI DNGNLPKVNL ATAPSVHKAL HAAIRANSVR ACHDLSEGGL AVAAAEMAFA GGFGLSLELS AMPTSGSLSA DALLWSESTT RFLVEVAPEQ AANFEAQLSN IAYAKIGQVL AEPRLIINDL AGQPIIDSDL ASLKAAWQA
|
| |