Gene Syncc9605_0243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9605_0243 
SymbolpurH 
ID3736171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9605 
KingdomBacteria 
Replicon accessionNC_007516 
Strand
Start bp250916 
End bp252478 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content64% 
IMG OID637774824 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_380574 
Protein GI78211795 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.853465 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCCTG TCGCTCTGCT GAGTGTTTCC GATAAGTCCG GGCTGGTGCC CCTGGCGGAG 
GCCCTGCATC GGACCCATGG CTATCAGCTG CTCTCCAGTG GGGGCACGGC CAAGGTGCTC
GAGCAGGCCG GCCTACCGGT GACCCGTGTG TCGGACCACA CCGGGGCTCC AGAAATTCTT
GGCGGTCGTG TGAAAACGCT CCATCCAAGG GTGCATGGCG GGATTTTGGC CAAGCGGGGT
GACGCGTCTC ACCAGGCCGA TCTTGAGCAG CAGAACATCG CCCCCATCGA TATGGTGGTG
GTCAACCTCT ATCCCTTCCG CGAAACGATT GCGCGGCCTG ACGTCACCTG GGATCAGGCG
ATCGAGAACA TCGACATCGG TGGCCCTGCC ATGGTGCGGG CGGCCGCCAA GAATCACGCC
GATGTGGCTG TCCTCACCAG TCCTGACCAA TACGACCGTC TGTTGACCGC CATGGCGGAG
TCGGGCGGGA GCGTGCCTTC GGCGCTGCGG CGCCAACTGG CCCTTGAAGC GTTCAATCAC
ACCGCGTCGT ACGACACCGC CATCGGCCGC TGGATGGCCG AGCAAGCCAC CGCAAAAGGC
TGCCCCTGGT TGGAGGCGGT GCCGCTGCGG CAGACCCTGC GGTATGGCGA AAATCCCCAC
CAGAAAGCGC GCTGGTTCAG CCATCCCAAA CAGGGTTGGG GTGGTGCCAT TCAGCTGCAG
GGCAAGGAGC TGAGCACTAA CAACCTTCTG GATCTCGAGG CGGCCCTCGC CACGGTGCGG
GAGTTCGGCT ACGGAGCCGA CGGCTCCGCA CCGGCGTCGC AACCCGCGGC CGTGGTCGTC
AAGCACACCA ATCCCTGTGG CGTGGCCGTC GGAGCTTCGA TGCCTGCAGC ACTGACGCGG
GCCCTGGATG CCGATCGGGT GAGTGCCTTC GGCGGCATCA TCGCCATGAA CGATGTGGTG
GAAGCAACGG CGGCCCGTGA GCTCACCAGC CTGTTCCTGG AATGCGTCGT GGCACCAGGT
TTCACGCCCG AAGCGCGGGA GGTGCTGGCG GCCAAAGCCA ATCTGCGCTT GTTGGAACTG
GCTCCGCAGG CCATTGATGT GGCTGGCCCC GATCACGTGC GGAGCATTCT GGGTGGTCTC
CTGGTTCAGG ATCTCGATGA CCAGGCGATC ACGCCGACCG ACTGGACCGT GGCCAGCCAG
CGGCCGCCCA CACCCCAGGA AAAGCTGGAC CTTGAATTTG CCTGGCGTTT GGTGCGTCAC
GTGCGCTCCA ACGCCATCGT TGTTGCCAAG GATGGGCAGA GCCTTGGCGT GGGTGCCGGG
CAGATGAATC GCGTGGGCTC CGCGCGGATT GCCCTGGAAG CTGCAGGTGA GAAAGCGCAG
GGAGCCGTTC TGGCAAGTGA TGGCTTCTTC CCGTTTGACG ACACAGTGCG TCTGGCTGCC
AGCCAGGGCA TCACCGCAGT GATTCATCCC GGCGGGAGCA TGCGCGATGG CGATTCGATC
AAAGCTTGCG ATGAGCTCGG CCTGGCGATG CAGCTCACGG GGCGCCGTCA TTTCCTGCAT
TGA
 
Protein sequence
MAPVALLSVS DKSGLVPLAE ALHRTHGYQL LSSGGTAKVL EQAGLPVTRV SDHTGAPEIL 
GGRVKTLHPR VHGGILAKRG DASHQADLEQ QNIAPIDMVV VNLYPFRETI ARPDVTWDQA
IENIDIGGPA MVRAAAKNHA DVAVLTSPDQ YDRLLTAMAE SGGSVPSALR RQLALEAFNH
TASYDTAIGR WMAEQATAKG CPWLEAVPLR QTLRYGENPH QKARWFSHPK QGWGGAIQLQ
GKELSTNNLL DLEAALATVR EFGYGADGSA PASQPAAVVV KHTNPCGVAV GASMPAALTR
ALDADRVSAF GGIIAMNDVV EATAARELTS LFLECVVAPG FTPEAREVLA AKANLRLLEL
APQAIDVAGP DHVRSILGGL LVQDLDDQAI TPTDWTVASQ RPPTPQEKLD LEFAWRLVRH
VRSNAIVVAK DGQSLGVGAG QMNRVGSARI ALEAAGEKAQ GAVLASDGFF PFDDTVRLAA
SQGITAVIHP GGSMRDGDSI KACDELGLAM QLTGRRHFLH