Gene Ava_3818 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3818 
SymbolpurH 
ID3678750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4750459 
End bp4751979 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content48% 
IMG OID637719170 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_324318 
Protein GI75910022 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000025454 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGTC TAGCACTGCT GAGTGTATCT AACAAGACTG GTTTAATTGA CTTAGCTCGT 
CGCTTGGTAG AAGAATTTGA GTTTGATTTA ATCAGCAGTG GGGGGACAGC CCAAGCCCTC
AAGGATGCGG GTTTACCTGT GACGAAGGTT GCAGATTACA CGGGTTCGCC AGAGATTTTG
GGTGGACGGG TGAAAACTCT CCATCCCCGG ATTCATGGGG GGATTTTGGC TAGGCGGGAT
GTACCTAGTG ATTTGACGGA TTTGGAAAAT AACCAAATTC GCCCGATTGA TTTGGTGGTG
GTGAATCTTT ACCCGTTTGA GGAGACTATT GCTAAACCAG GGGTGACGTT GGCGGAAGCT
GTGGAACAAA TTGATATTGG CGGCCCGGCG ATGTTACGGG CATCATCAAA GAATTTTGCC
CATCTAACAG TCTTATGTGA TCCGGCGCAG TATGATGAAT ATCTGCAAGA ATTACGACAA
AATAACGGAG TAGCTTCCTT AGAATTTCGG CAAAAGGCAG CTTTGAAGGG GTTTTTGCAT
ACGGCGAGTT ATGATAGTGC CATTGCCTCT TACCTCTCAG GTACACAACA GCATACCCTC
AACGGTACAG AATTACAATC TCTGCGTTAC GGTGAGAATC CCCATCAGCC CGCAGCTTGG
TATCAAACTG GAACTACGCC AACAGGGTGG ACGGCAGCCA AGAAACTGCA AGGCAAGGAA
CTCAGCTACA ATAATTTGGT TGACTTAGAA GCCGCCCGCC GCATTATTGC AGAGTTCACT
GATACGCCAG CCGCCACGAT TATTAAACAT ACTAATCCCT GCGGTACGGC ATTGGCAGAT
ACCATCGTGG AAGCTTATCA AAAAGCTTTT AATGCTGACG CTACTTCGGC ATTTGGGGGG
ATTGTCGCCC TGAACCGCCC TATTGATGCA GCGACAGCCA GCGAGTTAAC CAAGACGTTT
TTAGAATGTG TAGTTGCGCC TGATTGCGAT GCAGAAGCGC AAAAAATTCT GGCGAAGAAA
TCTAATGTGC GGGTGTTGAC TTTAGCAGAT TTGAGTACAG GCCCCAAAAC TCTGGTAAAA
CAAATTGCTG GCGGTTTCCT GGTGCAGGCT GCGGATGATA TTGCTGCTGA CACAATTCAA
TGGCAAGTAG TTACAGAACG CCAACCTACT GCTGATGAAT TAGCAGAATT GTTATTTGCA
TGGAAAGTCT GCAAACACGT TAAATCTAAT GCTATTGTTG TGACAAGCGA TCGCACTACT
CTTGGTGTAG GTGCAGGACA AATGAACCGC ATTGGTTCAA CGAAAATTGC CCTAGAACAA
GCAGGGGACA AAGCCAAAGG TGCAATCCTC GCCAGCGATG GATTTTTCCC CTTTGATGAT
ACCGTGAGAA CCGCCGCCGC CGCCGGTATT AGCGCCATTG TCCAGCCAGG GGGAAGCCTG
CGCGATCAAG ATTCTGTCAA GGCTGCCAAT GAACTCGGTT TGTTAATGGT GCTGACTGGG
GTGCGGCATT TTTTACATTA G
 
Protein sequence
MARLALLSVS NKTGLIDLAR RLVEEFEFDL ISSGGTAQAL KDAGLPVTKV ADYTGSPEIL 
GGRVKTLHPR IHGGILARRD VPSDLTDLEN NQIRPIDLVV VNLYPFEETI AKPGVTLAEA
VEQIDIGGPA MLRASSKNFA HLTVLCDPAQ YDEYLQELRQ NNGVASLEFR QKAALKGFLH
TASYDSAIAS YLSGTQQHTL NGTELQSLRY GENPHQPAAW YQTGTTPTGW TAAKKLQGKE
LSYNNLVDLE AARRIIAEFT DTPAATIIKH TNPCGTALAD TIVEAYQKAF NADATSAFGG
IVALNRPIDA ATASELTKTF LECVVAPDCD AEAQKILAKK SNVRVLTLAD LSTGPKTLVK
QIAGGFLVQA ADDIAADTIQ WQVVTERQPT ADELAELLFA WKVCKHVKSN AIVVTSDRTT
LGVGAGQMNR IGSTKIALEQ AGDKAKGAIL ASDGFFPFDD TVRTAAAAGI SAIVQPGGSL
RDQDSVKAAN ELGLLMVLTG VRHFLH