Gene Syncc9902_0272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_0272 
SymbolpurH 
ID3742317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp284925 
End bp286487 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content58% 
IMG OID637770439 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_376288 
Protein GI78183854 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCCTG TCGCTCTGCT GAGTGTGTCC GACAAATCTG GGCTTTTGCC GTTGGCCGAG 
GCTCTGCATC GAATCCATGG CTATCAATTG CTCTCCAGTG GTGGCACCGC CAAGGTGCTT
GAGCAGGCTG GCCTTCCGGT AACCCGTGTT TCCGAGTACA CCGGGGCCCC TGAGATTTTG
GGTGGCCGCG TCAAAACGCT GCATCCCCGT GTTCACGGTG GAATTTTGGC CAAGCGTGGT
GATGCGGCCC ATCAGAACGA CCTTGAACAA CAGAACATCA ACTTCATTGA TGTGGTGGTC
GTGAATCTGT ATCCCTTCCG GGAAACAGTT GCCAAGGCTG ATGTCACTTG GGATCAAGCG
ATTGAAAACA TTGATATTGG TGGGCCCACC ATGGTGCGCT CTGCCGCGAA AAATCATGCC
GACGTTGCTG TTCTGACCAG TCCAGATCAG TACGACCGTT TGCTTGAAGC GATGGCTCAA
GCCGGTGGTG AGGTGCCGGC GGCATTACGC CGTCAGCTTG CTCTTGAAGC CTTCCAGCAC
ACTGCGGCCT ACGACACCGC CATTAGCCGC TGGATGGACC AGGCGGTGGC CGCAGATGGA
TCCCCTTGGC TTGAGGCGGT TCCTCTGCGT CAAACCTTGC GCTACGGCGA GAACCCTCAT
CAGAAAGCCC GTTGGTATAG CCATGCCCAG CAGGGATGGG GCGGTGCGGT TCAACTGCAA
GGCAAGGAAC TGAGTACGAA CAATCTGTTG GATCTCGAAG CTGCTCTCGC CATGGTTCGG
GAGTTTGGCT ACGGCTCTGA TGGCGCTGAG CCGGCTGTTC AGCCAGCCGC GGTGGTGGTG
AAACACACCA ATCCCTGTGG TGTTGCCATC GGATCGGATG TGTCAACTGC ACTCACGAGG
GCCTTGGATG CTGATCGAGT CAGTGCCTTT GGGGGAATCG TCGCCATCAA TGGCGTGGTG
AGCGCCGCAG CGGCAGGGGA ACTGAAAAGC TTGTTTTTGG AATGCGTCGT GGCGCCAAGC
TTTTCTCCAG AAGCCAGAGA GATTCTTGCG GCCAAAGCGA ATCTGCGTTT GCTGGAGCTC
CAGCCTGCCG CGATCGATGC GGCGGGCCCC GACCACGTCC GCAGCATTCT TGGTGGATTG
TTGGTTCAAG ACCTAGACGA TCAAGCGATC ACACCAAGCG AGTGGACAGT GGCAAGTCAG
CGGCCTCCCT CATCCCAGGA ACAGCAGGAT TTGGAGTTCG CTTGGCGATT GGTGCGTCAC
GTGCGTTCCA ACGCCATCGT GGTCGCCTCC AAGGGGCAGA GCTTGGGCAT AGGGGCCGGT
CAAATGAACC GGGTTGGCTC GGCTCGCCTC GCGCTTGATG CGGCTGGGGA TCAAGCCACA
GGGGCTGTGC TGGCCAGTGA TGGATTTTTC CCGTTTGACG ACACCGTGCG TCTTGCGGCG
AGCCACGGAA TTACAGCTGT AATTCATCCA GGTGGCAGCT TGCGCGATGC GGATTCGATC
AAGGCCTGTG ACGAACTGGG GCTCGCAATG CTGTTAACAG GCCGTCGACA CTTCCTTCAT
TGA
 
Protein sequence
MAPVALLSVS DKSGLLPLAE ALHRIHGYQL LSSGGTAKVL EQAGLPVTRV SEYTGAPEIL 
GGRVKTLHPR VHGGILAKRG DAAHQNDLEQ QNINFIDVVV VNLYPFRETV AKADVTWDQA
IENIDIGGPT MVRSAAKNHA DVAVLTSPDQ YDRLLEAMAQ AGGEVPAALR RQLALEAFQH
TAAYDTAISR WMDQAVAADG SPWLEAVPLR QTLRYGENPH QKARWYSHAQ QGWGGAVQLQ
GKELSTNNLL DLEAALAMVR EFGYGSDGAE PAVQPAAVVV KHTNPCGVAI GSDVSTALTR
ALDADRVSAF GGIVAINGVV SAAAAGELKS LFLECVVAPS FSPEAREILA AKANLRLLEL
QPAAIDAAGP DHVRSILGGL LVQDLDDQAI TPSEWTVASQ RPPSSQEQQD LEFAWRLVRH
VRSNAIVVAS KGQSLGIGAG QMNRVGSARL ALDAAGDQAT GAVLASDGFF PFDDTVRLAA
SHGITAVIHP GGSLRDADSI KACDELGLAM LLTGRRHFLH