Gene Francci3_0657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0657 
SymbolpurH 
ID3902991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp751457 
End bp753103 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content71% 
IMG OID637877990 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_479770 
Protein GI86739370 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0944398 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCAT CGGGCGAAGG TGTGGCCGGC GAGGGCGGGG CTGTCCGCGG ATCCGAACCC 
GGCGAAGTGG TCCCCACCGG GCGGCGGCCG CTGCGACGGG CGCTTGTCAG CGTCTACGAC
AAGAGCGGGC TCGACGTCCT CGCGGAGGCG TTCCTTGCGG CCGACGTCGA GGTGGTCTCG
ACCGGTTCGA CCGCCGACGT CCTGGCCCGT CACGGGCTGG CGGTCACACC GGTGAGCACC
GTGACCGGAT TTCCGGAGGT GCTGGGCGGT CGCGTCAAGA CGCTGCACCC CCACGTGCAC
GCCGGTCTGT TGGCCGATCT GCGTAACGCC GAGCACGCCG CGGTGTTGGC CGAACTCGAC
ATCGCTCCGT TCGACCTGGT CGTCGTCAAT CTGTACCCGT TCGCCGCGAC AGTCGCCGCC
GGTGCGAGCG AGGACGAGGC GATCGAGCAG ATCGACATCG GCGGTCCGGC CATGATCCGC
GCGGCGGCGA AGAACCATGC GTCGGTCGCC GTTGTCGTCG CACCCGGCGA CTACGCCGAG
CTGGCCGCCG CAGTCCGCGG ATCCGGATAT GATCTTCCCG CTCGCCGCCG GCTCGCCGCG
AAGGCGTTCG CCCACACCGC GGCGTACGAC ATCGCCGTGT CCTCGTGGTT CGCCGGCGTC
GTCGCGCCGG ACGAGGTGGC GCGGGAGAGC GGATGGCCCG ACGTGCTGTC CGCGCAGTGG
CACCGTACGG AGGTCCTGCG TTACGGCGAG AACCCCCATC AGCGCGCCGC GCTCTACGTG
GAGAGCGACG CCGAGGGTCG GCCCGGCCTC GCCTCGGCCC GTCAGCTGCA CGGCAAGCAG
ATGTCCTACA ACAACTACAC CGACACCGAC GCCGCCCGCC GAGCGGTGTT CGACTTCACC
GAGCCCGCCG TGGCCGTGAT CAAGCACGCC AACCCCTGCG GCATCGCGAT CGGCGCCACC
ATCGCCGAGG CCCACCGCAA GGCGCATGCC TGCGACCCGG TCTCCGCCTT CGGCGGGGTG
ATCGCGACCA ACCGTCCGGT CTCGGGCGAG CTCGCCGAAC AGATCGCGGA GATCTTCACC
GAGGTCGTCG TCGCACCGGC CTACGAACCC GCCGCGGTGG AGATCCTCTC TCGTAAGCCG
TCGATCCGGC TGCTGGAGTG CCCACCGCCG CCGCACCAGC GCGGGATCGA ACTGCGCCAG
ATCAGCGGAG GCCTGCTCCT GCAGTCGCGG GACGCCGTCG ATGCGCCGGG CGACGAGCCG
TCCGGATGGA CGCTGGAGGC GGGATCGCCT GCGGACGAGG CCCTGCTGGC CGAGTTGCGG
TTCGCCTGGC GGGCGGTGCG CTCCGTGAAG TCGAACGCCA TCCTCCTCGC GTCCGGCGGT
GCCACCGTCG GAGTCGGGAT GGGCCAGGTG AACCGGGTGG ACGCTGCCCG GCTCGCGGTG
ACCCGGGCAG GGGACCGGGC GAAGGGGGCT GTCGCCGCGA GCGACGCCTA TTTCCCGTTC
CCCGACGGTT TCGAGGTGCT CGCCGAGGCG GGGGTGCGGG CCGTGGTCGA ACCGGGCGGG
TCGGTGCGCG ACGAGCTCGT CATCACGGCT GCCCGGGAGG CCGGCGTCAC GCTCTACTTC
AGCGGTGTCC GCCACTTCGC GCACTGA
 
Protein sequence
MTSSGEGVAG EGGAVRGSEP GEVVPTGRRP LRRALVSVYD KSGLDVLAEA FLAADVEVVS 
TGSTADVLAR HGLAVTPVST VTGFPEVLGG RVKTLHPHVH AGLLADLRNA EHAAVLAELD
IAPFDLVVVN LYPFAATVAA GASEDEAIEQ IDIGGPAMIR AAAKNHASVA VVVAPGDYAE
LAAAVRGSGY DLPARRRLAA KAFAHTAAYD IAVSSWFAGV VAPDEVARES GWPDVLSAQW
HRTEVLRYGE NPHQRAALYV ESDAEGRPGL ASARQLHGKQ MSYNNYTDTD AARRAVFDFT
EPAVAVIKHA NPCGIAIGAT IAEAHRKAHA CDPVSAFGGV IATNRPVSGE LAEQIAEIFT
EVVVAPAYEP AAVEILSRKP SIRLLECPPP PHQRGIELRQ ISGGLLLQSR DAVDAPGDEP
SGWTLEAGSP ADEALLAELR FAWRAVRSVK SNAILLASGG ATVGVGMGQV NRVDAARLAV
TRAGDRAKGA VAASDAYFPF PDGFEVLAEA GVRAVVEPGG SVRDELVITA AREAGVTLYF
SGVRHFAH