Gene Synpcc7942_0396 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_0396 
SymbolpurH 
ID3774917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp389953 
End bp391494 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content57% 
IMG OID637798802 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_399415 
Protein GI81299207 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGAT TTGCCCTGCT CAGTGTTTCC GATAAAACTG GCCTCGTTGA TTTTGCACGT 
CAACTGGTCG ATCGCTTTCA GTTTCAGATT GTCAGTAGCG GTGGCACCGC CAAGCAACTC
CTAGAAGCAG GTATCCCCGT TACGAAAGTT GCAGAACACA CGGGCTCACC CGAAATCCTC
GGTGGCCGAG TCAAAACCTT GCATCCCCGC ATTCATGGCG GCATTTTGGC GCGCCGCGAT
CGCGAAGAGG ATCAAGCAGA TTTAGCGGCC AACAACATTC AGCCGTTTGA CTTAGTCGTC
GTCAATCTCT ATCCCTTCGA GGCCACGATC GCCCGTCCTG AAGTGACGCT GGCAGATGCG
ATCGAGCAAA TTGACATCGG CGGGCCAGCA ATGGTGCGGG CCTCGGCCAA GAACCATGCT
CACCTAACGA TTCTGACTAA CCCCAGCCAG TACGAGCCCT ATCTGACGGC CCTGGCCGAT
GGAGAGGGAC AAATTCCCCT CGCGTTCCGC CAGCAGTGCG CTCTAGCGGC TTTCCAGCAC
ACCGCTGCTT ATGATGCGGC GATCGCGACT TATCTCGCTG AGCAATTTGA AGCGACCAGC
GATCGCTTGC AACTGAGTGC TCAGCCGGTG CAAGTCCTGC GCTACGGCGA AAACCCCCAC
CAGGCGGCGA CTTGGTATCA AACCGGTGCT ACGGCCAGCG GTTGGGCAGC GGCGCAGCAA
CTACAGGGCA AAGAGCTGAG CTACAACAAC CTAGTTGACC TCGAGGCAGC ACGCCAAATT
ATTGCGGAGT TCCCGGCGGA TGGCCCCGCT GCTGCCGCGA TTCTCAAACA CAATAATCCC
TGCGGAGTCG CCACAGCTGA GGCATTGAGT GATGCCTATC AAAAAGCGTT TGACGCGGAC
TCCGTCTCTG CTTTCGGGGG CATTGTGGCC CTGAATCGGG CGATCGATGC GGCAACTGCA
ACGGCGATGA CCGGCACCTT CCTCGAATGT ATTGTTGCCC CATCGGTTGA GCCAGCAGCG
GCTGAGATTC TCGCAGCTAA GAAAAACCTA CGGGTACTCA CCCTGGCAGA TTTCAATAGC
GGTCCGCAGC AAACTGTGCG ATCGATCGCC GGCGGATTCT TGGTGCAGGA CAGCGATGAT
CAGCTGGAAA CCGTTGATGC TTGGCAGGTC GTCACGGAAC AGCAGCCTAG CGAAGCCGAT
TGGCAGGAGT TGCTGTTTGC TTGGAAGGTG GTCAAACATG TCAAATCCAA TGCGATCGCG
GTGACGGCGA ATGGCGTCAC GCTCGGGATT GGTGCCGGTC AAATGAATCG GGTCGGTTCC
GTCAAAATTG CCTTGGAACA AGCGGGCGAT CGCGCTCAAG GTGCCATTCT TGCCAGCGAT
GGTTTCTTTC CCTTTGATGA CAGTGTTCGT ACCGCAGCGG CAGCTGGAAT TCGGGCGATC
GTGCAACCCG GCGGGAGTCT GCGTGATGCT GATTCAATTG CAGCGGCGAA CGAACTTGGC
CTTGTGATGG TCTTTACTGG CACGCGCCAC TTCCTCCACT AA
 
Protein sequence
MPRFALLSVS DKTGLVDFAR QLVDRFQFQI VSSGGTAKQL LEAGIPVTKV AEHTGSPEIL 
GGRVKTLHPR IHGGILARRD REEDQADLAA NNIQPFDLVV VNLYPFEATI ARPEVTLADA
IEQIDIGGPA MVRASAKNHA HLTILTNPSQ YEPYLTALAD GEGQIPLAFR QQCALAAFQH
TAAYDAAIAT YLAEQFEATS DRLQLSAQPV QVLRYGENPH QAATWYQTGA TASGWAAAQQ
LQGKELSYNN LVDLEAARQI IAEFPADGPA AAAILKHNNP CGVATAEALS DAYQKAFDAD
SVSAFGGIVA LNRAIDAATA TAMTGTFLEC IVAPSVEPAA AEILAAKKNL RVLTLADFNS
GPQQTVRSIA GGFLVQDSDD QLETVDAWQV VTEQQPSEAD WQELLFAWKV VKHVKSNAIA
VTANGVTLGI GAGQMNRVGS VKIALEQAGD RAQGAILASD GFFPFDDSVR TAAAAGIRAI
VQPGGSLRDA DSIAAANELG LVMVFTGTRH FLH