Gene BBta_0323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_0323 
SymbolpurH 
ID5153405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp327673 
End bp329265 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content67% 
IMG OID640555347 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001236521 
Protein GI148251936 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.234113 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAACC AGCTTCGTCC CGTTCATCGT GCTCTGCTCT CCGTGTCCGA CAAGACCGGC 
CTGGTCGAGT TCGCCCGCTC GCTCGCCGCA CGCGGCATCG AGCTGATCTC GACCGGCGGC
ACCGCCAAGG CGATTGCCGA TGCCGGCCTG AAGGTGAAGG ACGTCTCCGA CCTCACCGGC
TTTCCCGAGA TGATGGACGG CCGCGTCAAG ACCCTGCATC CGAAGGTTCA CGGCGGACTG
CTCGCGATCC GCGGCAATGA CGAACACGCC GAGGCGATGA AGACCCACGG CATCGCGCCG
ATCGACCTGC TGGTCGTCAA TCTCTACCCG TTCGAGGCCA CGGTCGAGCG CAGCGCGCCG
TTCAGCGACT GCATCGAGAA TATCGACATC GGCGGCCCGG CGATGATTCG CGCGGCGTCG
AAGAACCATG AGGATGTCGC CGTCGTCGTC GACGTCAATG ATTACGACGC CGTGCTGGAA
GACCTCGCCC GGCATGAGGG CTCGACCACG CTGCTGTTGC GCCGCCGCCT TGCGGCCAAG
GCCTATGCCC GCACCGCTGC CTATGATGCC GCGATCTCCA ACTGGTTCGC CGCCACCATT
CAGAACGACG CGCCGGATTA CCGCGCCTTC GGCGGCAGGC TGATCCAGTC GCTGCGGTAT
GGCGAGAACC CGCACCAGCA CGCCGCGTTC TACGCACTGC CCGGCCGCCG TCCCGGCGTC
GCCACGGCGC GCCAGGTGCA GGGCAAGGAG CTCTCCTACA ACAACATCAA CGACACCGAC
GCCGCCTATG AATGCATCGC CGAATTCGAC CCGGCGCGCA CCGCGGCCTG CGTGATCGTC
AAGCACGCCA ATCCGTGCGG CGTCGCCGAG GGCCCGGATC TGATCACCGC CTATCAGAAG
GCGCTGGCCT GCGATTCGAC CTCGGCATTC GGCGGCATCA TCGCGATGAA CCGCAAGCTC
GACGCCGCCA CCGCGCGCGC GATCACCGGC ATCTTCACCG AGGTGATCAT CGCGCCCGAC
GCGACCGAGG AGGCGATCGC GGTGATCGCG GCGCGCAAGA CTCTGCGGCT ACTGCTGGCC
GGCGCCCTGC CCGATCCGCG CGAGGCCGGC CTGACGGCAA AGACGGTTGC CGGCGGGCTC
CTGGTGCAGA GCCGCGACAA TGCCGTCGTC GATGACATGA CGCTGAAGGT CGTGACCAAG
CGGGCGCCGA CCGAGGCGGA GTTGCGCGAC CTGCGCTTCG CCTTCCGCGT CGCCAAACAC
GTGAAGTCGA ACACCATCAT CTATGCCAAG GATTCGGCCA CTGTCGGCAT CGGTGCGGGC
CAGATGAGCC GGGTCGATTC CGCCCGCATC GCGGCGCGCA AGGCGCTGGA TGCCGCGAGC
GAGCTGAAGC TCGCCGAGCC CCTGACCAAG GGATCGGTGG TCGCTTCCGA CGCCTTCTTC
CCGTTCGCCG ACGGCATGCT GGCCTGCATC GAGGCCGGCG CCACCGCAGT GATTCAGCCC
GGCGGATCGG TGCGCGACGA CGAGGTCATC AAGGCCGCCG ACGAGCACGG CATCGCCATG
GTGCTGACCG GCGTCCGGCA TTTCCGGCAC TGA
 
Protein sequence
MTNQLRPVHR ALLSVSDKTG LVEFARSLAA RGIELISTGG TAKAIADAGL KVKDVSDLTG 
FPEMMDGRVK TLHPKVHGGL LAIRGNDEHA EAMKTHGIAP IDLLVVNLYP FEATVERSAP
FSDCIENIDI GGPAMIRAAS KNHEDVAVVV DVNDYDAVLE DLARHEGSTT LLLRRRLAAK
AYARTAAYDA AISNWFAATI QNDAPDYRAF GGRLIQSLRY GENPHQHAAF YALPGRRPGV
ATARQVQGKE LSYNNINDTD AAYECIAEFD PARTAACVIV KHANPCGVAE GPDLITAYQK
ALACDSTSAF GGIIAMNRKL DAATARAITG IFTEVIIAPD ATEEAIAVIA ARKTLRLLLA
GALPDPREAG LTAKTVAGGL LVQSRDNAVV DDMTLKVVTK RAPTEAELRD LRFAFRVAKH
VKSNTIIYAK DSATVGIGAG QMSRVDSARI AARKALDAAS ELKLAEPLTK GSVVASDAFF
PFADGMLACI EAGATAVIQP GGSVRDDEVI KAADEHGIAM VLTGVRHFRH