Gene Paes_1700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1700 
SymbolpurH 
ID6459862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1855553 
End bp1857130 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content51% 
IMG OID642725688 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002016365 
Protein GI194334505 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATC CTGTTATCAA GCGAGCGTTG GTTTCTGTTT CTGATAAAAC CGGCATCGTC 
GATTTTTGCC GTGAGCTGGG GGCTATGGGT GTCGAGATTT TTTCAACCGG AGGCACCCTG
CGTATTCTTC AGGAGTCCGG TATCGAAGCT GCTTCAATTT CGACGATAAC CGGTTTTCCT
GAGATTATGG ATGGCCGCGT GAAAACGCTT CACCCGAAAA TTCACGGCGG CCTGCTTGCT
GTTCGTGACA ACGAGGATCA TGTCGGCCAG GCAAAGGCGA ATGGTATTGA GTTTATCGAT
ATGGTTGTTG TTAATCTCTA CCCTTTCGAG GCTACTGTTG CCAAGCCTGA TGTGACGTTT
GAAGAAGCTA TCGAAAATAT TGACATCGGC GGTCCGTCGA TGCTTCGCAG CGCAGCGAAG
AATAACGAGT CTGTCACGGT GGTGACCGAC AGCGCTGATT ATGCAACGGT TCTTGATGAG
ATGCGCTCCA ATAATGGTGC GACCCGTCGT GAGACTCGCC TGACCCTTGC AAGAAAGGTG
TTTGAACTTA CCTCGCGCTA TGACCGTGCG ATTGCTGATT ACCTGATCGG TGCCGAGGAG
AGTGGCGAAA CGGAGGCTCC GGCCGCTATT TCCGTGAAGC TTGAAAAAGA GCTCGATATG
CGTTATGGCG AGAATCCTCA TCAGAGCGCC GGTTTCTATC GTCTTGTGGA TGGTCAGGGA
TCACGCTGTT TTGATGATTT CTTCGACAAG TTGCACGGTA AGGAGTTGTC GTACAACAAT
ATGCTTGATA TTGCCGCTGC GACTGGACTT GTCGAGGAGT TTCGCGGTGA GGATCCTGCA
GTGGTTATCA TCAAACACAC CAACCCGTGC GGGGTTGCTC AGGCAGGAAC TCTTGTTGAC
GCGTATCGCA AGGCGTTTTC GACCGATACA CAGTCACCGT TTGGCGGTAT CATCGCTTTT
AACGTTCCGC TCGATATGGA GACAGCACTT GCCGTCGATG AGATTTTTAC CGAGATTCTG
ATTGCTCCGG CATACGAGGA TGGGGTGCTG GATATGCTGA TGAAGAAGAA AAACCGTCGT
CTTGTTCTTC AGAAAAAGGC GCTTCTCCAG GAGGTCATGG AATACAAGTC GACACAGTTC
GGCATGCTCG TTCAGGATCG CGACAGCAAG ATTGTTTCTC GTGAGGACCT GAAGGTTGTG
ACGAAACGCC AGCCGGACGA GCAGGAACTT GATGATATGA TGTTTGCATG GAAGATCGCC
AAGCATGTGA AGTCCAATAC GATTGTGTAT GTGAAAAACG GTCAGACGAT TGGTGTGGGA
GCAGGTCAGA TGTCGCGTAT CGATTCGGCA AAAATCGCTC GTTCCAAGGC TGCCGAGGCC
GGTTTGGATA TCAAGGGTTC TGCAGTTGCT TCAGATGCGT TTTTCCCGTT TGCAGATGGT
TTGCTTGCCG CTGCTGAAGC CGGTGCGACA TCGGTCATAC AGCCTGGCGG ATCGATCCGC
GACGATGAGG TTATTGCCGC TGCGGACGAG AACAACCTTG CAATGGTCTT TACGTCGATG
CGCCACTTCA AGCATTGA
 
Protein sequence
MSDPVIKRAL VSVSDKTGIV DFCRELGAMG VEIFSTGGTL RILQESGIEA ASISTITGFP 
EIMDGRVKTL HPKIHGGLLA VRDNEDHVGQ AKANGIEFID MVVVNLYPFE ATVAKPDVTF
EEAIENIDIG GPSMLRSAAK NNESVTVVTD SADYATVLDE MRSNNGATRR ETRLTLARKV
FELTSRYDRA IADYLIGAEE SGETEAPAAI SVKLEKELDM RYGENPHQSA GFYRLVDGQG
SRCFDDFFDK LHGKELSYNN MLDIAAATGL VEEFRGEDPA VVIIKHTNPC GVAQAGTLVD
AYRKAFSTDT QSPFGGIIAF NVPLDMETAL AVDEIFTEIL IAPAYEDGVL DMLMKKKNRR
LVLQKKALLQ EVMEYKSTQF GMLVQDRDSK IVSREDLKVV TKRQPDEQEL DDMMFAWKIA
KHVKSNTIVY VKNGQTIGVG AGQMSRIDSA KIARSKAAEA GLDIKGSAVA SDAFFPFADG
LLAAAEAGAT SVIQPGGSIR DDEVIAAADE NNLAMVFTSM RHFKH