Gene Plut_0432 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlut_0432 
SymbolpurH 
ID3745358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium luteolum DSM 273 
KingdomBacteria 
Replicon accessionNC_007512 
Strand
Start bp507793 
End bp509367 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content60% 
IMG OID637768472 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_374363 
Protein GI78186320 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATC CGCTTATCAA AAGGGCGCTG GTGTCGGTTT CCGACAAAAC CGGCATCGTC 
GACTTCTGCC GCGAGCTTTC CTCCCTCGGC GTTGAGATTT TTTCAACCGG GGGTACCCTC
AAAGCCCTGC AGGATTCCGG AGTCAAAGCG GCCTCGATCT CCCTCATCAC AGGGTTCCCC
GAAATCATGG ACGGACGGGT CAAGACCCTC CATCCGAAAA TCCACGGCGG CCTGCTTGCA
GTCAGGGACA ATGCGGATCA TGTGGCACAG GCAGTCGAGA ACGGCATCGG CTTCATCGAC
ATGGTTGTCG TCAACCTCTA TCCTTTTGAA GCGACGGTCG CCAGGCCCGG CGTTTCCTTC
GAAGATGCCA TCGAGAATAT CGACATCGGA GGGCCCTCGA TGCTGCGCAG CGCAGCCAAG
AACAACGAGT CGGTCACGGT TCTTACCGAC AGTGCCGATT ACCCCTGCGT CCTTGCAGAG
ATGCGTTCCT CCGGCGGCAG GACCACTCGT GCCACCAGGC TTCGCCTGGC CCGCCAGGTG
TTCCAGCTGA CCTCCCGCTA CGATGGTGCA ATCGCCCGGT ACCTCACCGG GGCTGAAGGT
GCCGCTCCCG CTGCAGCCGA GACCATGACG GTGAAGCTTG AGCGCGAACT CGACATGCGC
TACGGCGAGA ACCCCCACCA GAGCGCGGGA TTCTACACCC TCACCGACGG CGAGGGAACC
CGCTCGTTCG GAGACTATTT CGAGAAGCTC CACGGCAAGG AGCTCTCCTA CAACAACATG
CTCGACATTG CTGCGGCATC GGGACTCGTT GAGGAGTTCC GGGGAGAGGA GCCGTCGGTT
GTCATCATCA AACACACCAA CCCGTGCGGT GTCGCACAGG CCCCGACGCT CGTCGAAGCA
TGGCACAATG CGTTTGCAAC CGACACCCAG GCACCCTTCG GCGGCATCAT CGCCTTCAAC
CGCCCGCTCG ACATGGTGAC GGCAGAGGCG GTCAACGGCA TCTTCACCGA GATCCTCATC
GCTCCTTCCT ATGAAGAGGG CGTGCTCGAT CTGCTCATGA AGAAAAAGGA TCGCCGGCTT
CTCGTGCAGA AGCAGGCACT TCCGAAAGGC GGATGGGAGT TCAAGTCCAC TCCGTTCGGC
ATGCTCGTGC AGGAGCGCGA CAGCAAGATC GTTGCCAGGG AGGACCTCAA CGTGGTGACG
AAGCGCCAGC CGACCGAGGA GGAACTCGGT GACCTCATGT TCGCATGGAA AATCTGCAAG
CACATCAAGA GCAACACCAT CCTCTACGTT AAGAACCGTC GCACGTTCGG CGTGGGAGCC
GGTCAGATGT CGCGAGTTGA CTCCTCGAAG ATCGCACGAT GGAAGGCTTC GGAGGTCAAT
TTAGACCTCC ATGGATCGGT TGTGGCTTCC GATGCCTTCT TCCCGTTCGC CGACGGCCTG
CTTGCCGCAG CCGAAGCCGG TGTCACTGCG GTAATCCAGC CGGGTGGCTC CATCCGTGAC
AACGAGGTGA TCGAGGCGGC TGATGCCAAC AACCTCGCCA TGGTCTTCAC CGGCATGCGC
CACTTCAAGC ACTGA
 
Protein sequence
MSDPLIKRAL VSVSDKTGIV DFCRELSSLG VEIFSTGGTL KALQDSGVKA ASISLITGFP 
EIMDGRVKTL HPKIHGGLLA VRDNADHVAQ AVENGIGFID MVVVNLYPFE ATVARPGVSF
EDAIENIDIG GPSMLRSAAK NNESVTVLTD SADYPCVLAE MRSSGGRTTR ATRLRLARQV
FQLTSRYDGA IARYLTGAEG AAPAAAETMT VKLERELDMR YGENPHQSAG FYTLTDGEGT
RSFGDYFEKL HGKELSYNNM LDIAAASGLV EEFRGEEPSV VIIKHTNPCG VAQAPTLVEA
WHNAFATDTQ APFGGIIAFN RPLDMVTAEA VNGIFTEILI APSYEEGVLD LLMKKKDRRL
LVQKQALPKG GWEFKSTPFG MLVQERDSKI VAREDLNVVT KRQPTEEELG DLMFAWKICK
HIKSNTILYV KNRRTFGVGA GQMSRVDSSK IARWKASEVN LDLHGSVVAS DAFFPFADGL
LAAAEAGVTA VIQPGGSIRD NEVIEAADAN NLAMVFTGMR HFKH