Gene Haur_3772 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3772 
Symbol 
ID5735636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4740394 
End bp4743273 
Gene Length2880 bp 
Protein Length959 aa 
Translation table11 
GC content55% 
IMG OID641280924 
Productphosphoribosylformylglycinamidine synthase II 
Protein accessionYP_001546536 
Protein GI159900289 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0046] Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain 
TIGRFAM ID[TIGR01736] phosphoribosylformylglycinamidine synthase II 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.384894 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCCGCAT ATTTGGTCAC TGTTTGCCCA CGCGAGGCCG ACAACCACGA ACGGCTGTAC 
CTTCTCGCCG GCGATCTTTC CTCAGAAGAT GTCCAACGCC TGACCCTCGA ATTACTGCAT
GATCCCGTTG CCCATACTGC TACCTGGCAA GCGCTTGACG CTGAGCTAGC CACGCCCAAA
GCTGGAGCAT TGGTGGAGAT TGCCTTTCGT CCAGGCGTGA CCGATAACGA AGCTGAGACG
ATTTTAGTTG GCGCACGCCA TATTGGGATT AACGGCTTGA AACAGGCCAA AACCCTGCGT
CGCGTCTATG TGTCCGATGT GCAAGACGAA GCTGTTTTGC GTCAATTTGC TGGTGAACAT
CTTTTAAACG ATTTGATTGA AACTGCTTAT TCTACCCTTG AGGCTCGTAG CGCTGAACGT
TTGCAGTTTT ATCAACACTT ATTACAACTT CCAGCTCCTC ACACGCCCAC GATCACCCGC
GTGGCTTTGC GCGGAGTCAG CGATAGCGAA CTCGAACGGA TTAGCCGTGA AGGCATTTTG
GCCTTGAGTT TGGCTGAAAT GCAGGCAGTT CGCGATTATT TTGAGGATTT AGGCCGCGAC
CCAACCGATG GTGAGCTGGA AACCCTTGCT CAAACATGGT CGGAACATTG CCGCCATAAA
ACCTTCCGCG CCACGATCAG CTATCAACAA GTTGCCGCAG ATCAGGGAAT TGATGCGGCG
TTGCACCCAG CCTTGGCTGA ACTCAATGCC GCCAACGGGG CAACGATCAA TGGTTTGCTC
AATCACTATT TGCGTAGCGC TACCAACGCT GTTAGCAACG AGGCCTTGCT CTCGGCATTT
GTCGATAATG CTGGGATTGT GGCCTTCGAT GAGCAGTATG AAATTTCCTT CAAGGTCGAA
ACCCACAATC ACCCTTCAGC ACTAGAGCCA TTTGGCGGTG CAAATACTGG GGTTGGTGGG
GTTGTGCGCG ACGTGTTGGG GGTTTCAGCT AAGCCAATCG CCGTGACCGA TGTGTTGTGT
TTTGGCTACC CTGATTTGCC AGAAAGCGAG CTTTCGCAAG GTGTGCTGCA CCCACGGCGG
ATTCGCGAAG GTGTCGTGGC TGGGGTACGC GATTATGGCA ACAAACTAGG AATTCCCAAT
GTCAACGGGG CGGTTTGGTA TGACCATGGC TACACCGCCA ATCCATTGGT ATTCTGTGGC
ACGCTGGGCA TTGCACCACG CGGCAGCCAC CCACGCGGCG TTCAAGCAGG CGACGCAATT
GTCGTAATCG GCGGACGCAC TGGCCGCGAT GGCATTCACG GTGCAACCTT CTCGTCGGTT
GAATTAACTC ACGACACTGC TGAAACGGTT GGGGCGGCGG TGCAAATCGG CGATCCCGTC
ACCGAAAAAA CCGTGATCGA CGTGTTGTTG CAAGCCCGCG ATTTAGGTTT GTACAGCGCA
ATTACCGATT GTGGCGCGGG CGGGCTTTCC TCGGCGGTTG GCGAGATGGG CGAAGAAACT
GGCGCAGTTG TTGAATTACG CGATGTGCCG CTCAAATATG CTGGCTTGCA ACCATGGGAA
ATTTGGATCT CCGAAGCCCA AGAGCGCATG GTCGTTTCCG TGCCGCCGCA AAATGTCCAA
ACCTTGCTTG ATCTTTGCCG TGGCGAAGAT GTTGAGGCAA CCGTGATTGG TCACTTCACT
GCTGATGGTG TGCTGACAGT CAAGCACAAC CAATTAACCG TGGTTGAGCT GGATATGGCC
TTTTTGCATA GCGGCGGGGT GCAATTTAAG CTGAATGCTG ATTGGCAGCC AAGCCCAGCA
CCAGCCAGCC AACCAGCCAC TATCGATCAC ACGGCGCTAC TCAAGGCAAC GTTGGGACAG
CCAATCGTCG CCAGCAACGA AAATATTGTG CGCACCTACG ACCATGAAGT GCAGGCGGCC
ACCGTGCTCA AGCCCTTGGT TGGCGTGAAC GAAGATGGCC CAGGCGATGC TGGGGTATTA
CAACCACGGG TCGATTCAAA CCGTGGCGTG GTGCTTGGCT GTGGCCTGAA TCCGTTGTAT
GGCAAAATCG ATCCGTATTG GATGGCCTTA GCAGCGGTTG ATGAAGCCTT GCGCAACATC
GTCGCAGCTG GCGGCGACCC CGAACAAACC TGGATTTTGG ATAACTTCTG TTGGGGCGAC
CCCAAATTGC CTGACCGCTT GGCAGGCTTG GTGCGAGCTT CGGCTGGTTG TCACGATGCC
GCCTTGGCGT ATCGCACGCC CTTTATTTCG GGCAAAGATT CGCTCAACAA CGAATATCGC
GATGCAGAAG GCAAGCGCGT GGCGATTCCA CCAACCTTGC TGATTTCGGC CATGGCCTTA
GTACCCGATG TGTTGCAAAC AATCTCGATG GACGCGAAAG CCGCTGGCAA TGCGATCTAC
TTGGTTGGTT TGACTCACAA CGAACGCGGC GGGGCAGTCA GTGCTTTGGT TGGTGGCATC
GATAATGGCA ATCTGCCCAA GGTTAATCTG GCAACCGCGC CAAGCGTGCA TAAAGCGCTA
CATGCGGCAA TTCGCGCCAA TAGCGTGCGA GCCTGCCACG ATTTGAGTGA AGGTGGCTTG
GCGGTCGCCG CTGCCGAAAT GGCCTTTGCT GGCGGGTTTG GCTTGAGCTT GGAATTGAGC
GCTATGCCAA CATCTGGCAG TTTGAGCGCT GATGCCTTGT TGTGGAGCGA ATCGACCACC
CGTTTCTTGG TCGAGGTTGC CCCAGAGCAA GCCGCCAATT TCGAAGCTCA GTTGAGCAAC
ATCGCCTACG CCAAAATTGG CCAAGTGCTG GCTGAACCAC GCCTGATCAT CAACGATTTG
GCTGGCCAGC CGATTATCGA CAGCGATTTG GCAAGCCTCA AGGCTGCATG GCAAGCTTAA
 
Protein sequence
MSAYLVTVCP READNHERLY LLAGDLSSED VQRLTLELLH DPVAHTATWQ ALDAELATPK 
AGALVEIAFR PGVTDNEAET ILVGARHIGI NGLKQAKTLR RVYVSDVQDE AVLRQFAGEH
LLNDLIETAY STLEARSAER LQFYQHLLQL PAPHTPTITR VALRGVSDSE LERISREGIL
ALSLAEMQAV RDYFEDLGRD PTDGELETLA QTWSEHCRHK TFRATISYQQ VAADQGIDAA
LHPALAELNA ANGATINGLL NHYLRSATNA VSNEALLSAF VDNAGIVAFD EQYEISFKVE
THNHPSALEP FGGANTGVGG VVRDVLGVSA KPIAVTDVLC FGYPDLPESE LSQGVLHPRR
IREGVVAGVR DYGNKLGIPN VNGAVWYDHG YTANPLVFCG TLGIAPRGSH PRGVQAGDAI
VVIGGRTGRD GIHGATFSSV ELTHDTAETV GAAVQIGDPV TEKTVIDVLL QARDLGLYSA
ITDCGAGGLS SAVGEMGEET GAVVELRDVP LKYAGLQPWE IWISEAQERM VVSVPPQNVQ
TLLDLCRGED VEATVIGHFT ADGVLTVKHN QLTVVELDMA FLHSGGVQFK LNADWQPSPA
PASQPATIDH TALLKATLGQ PIVASNENIV RTYDHEVQAA TVLKPLVGVN EDGPGDAGVL
QPRVDSNRGV VLGCGLNPLY GKIDPYWMAL AAVDEALRNI VAAGGDPEQT WILDNFCWGD
PKLPDRLAGL VRASAGCHDA ALAYRTPFIS GKDSLNNEYR DAEGKRVAIP PTLLISAMAL
VPDVLQTISM DAKAAGNAIY LVGLTHNERG GAVSALVGGI DNGNLPKVNL ATAPSVHKAL
HAAIRANSVR ACHDLSEGGL AVAAAEMAFA GGFGLSLELS AMPTSGSLSA DALLWSESTT
RFLVEVAPEQ AANFEAQLSN IAYAKIGQVL AEPRLIINDL AGQPIIDSDL ASLKAAWQA