Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1877 |
Symbol | purH |
ID | 6375568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 2037180 |
End bp | 2038757 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642684373 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_001960275 |
Protein GI | 189500805 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0128464 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGATC CTGTTATCAA GCGTGCATTA GTGTCAGTTT CTGATAAATC CGGCGTTGTT GAATTCTGCC GCGAACTCTC ATCCATGGGG GTTGAAATCT TCTCGACCGG AGGAACTCTG CGGAAACTTC AGGAATCCGG TGTTGCAGCG GCTTCCATTT CAACCATTAC GGGCTTTCCG GAAATTATGG ATGGTCGGGT GAAAACGCTG CATCCGAAAA TCCATGGCGG ACTGCTTGCT GTGCGTGATA ATGCCGATCA TATCGCTCAG GCGCGGGATA ACGGTATCGG TTTTATCGAC ATGGTTGTCG TCAATCTCTA TCCGTTCCAG GAGACAGTCG CGAAACCTGA TGTGACGTTT GAAGAGGCGA TTGAAAATAT CGATATCGGC GGACCTTCGA TGCTTCGCAG CGCGGCAAAG AACCATGAGT CGGTGACCGT TATCACTGAA AGCGCCGATT ACCGGACGGT ACTCGATGAA ATGCGGGAGA ACAACGGCGC GACCACACGT TCGACCCGAC TGAAGCTGGC AGGAAAAGTG TTTACACTGA CATCCCGTTA CGACCGGGCA ATCGCGGATT ACCTGGCTGC ATCTTCAGAG GGAGAGGCAT CTTCGGAAGC TGGATCGATC AGTGTCCGGC TGGAAAAAGA GATCGATATG CGCTATGGTG AGAACCCGCA TCAGAACGCC GGTTTCTATC GTATGGACGA CGGCAGCGGG TCACGCTCGT TTGAGGAGTA TTTCCGGAAA CTTCACGGTA AGGATCTTTC ATACAACAAC ATGCTCGATA CTGCCGCGGC GACCGCTCTG ATTGAAGAGT TCAGGGATGA AGCGCCGGCG GTGGTTATTA TCAAACATAC CAATCCTTGC GGTGTCGCGC AGGCCGATAC GCTTGTCGAG GCCTATCGCA AGGCGTTCTC AACCGATACA CAGTCTCCTT TCGGCGGGAT CATCGCATGC AACAGACCGC TCGATATGGA AACCGCGAAG GCCATTGATG AAATCTTCAC CGAAATCCTT ATTGCTCCGG CCTATGAAGA AGGGGTTCTT GATATGCTGA TGAAGAAGAA GAACCGGCGT CTTCTTCTCC AGAGAAAACC TCTTCTGCAG GAGGTTACGG AATACAAGTC AACCCGGTTC GGCATGCTGG TACAGGAAAG AGACAGCCGG ATTGCTTCCC GGGATGACCT GAAAGTCGTC ACGAAACGTC AGCCTTCAGC GCAGGAGCTC GATGATCTCA TGTTTGCATG GAAGATCTGC AAGCATGTGA AGTCAAACAC GATCGTCTAT GTGAAGAACC GACAGACAGT CGGGGTTGGA GCAGGACAGA TGTCCCGTGT CGATTCAGCG AAAATCGCCC GTTCAAAAGC TGCCGAGGCG GGCCTTGACC TGAACGGATC CGCGGTCGCG TCAGACGCGT TTTTCCCGTT TGCCGACGGA CTGCTCGCAG CGGCAGAAGC GGGAGCTATG GCGGTTATAC AGCCCGGCGG ATCGGTTCGC GATGATGAGG TTATCGCCGC CGCCGACGAG CATGACCTCG CGATGGTGTT CACCTCTATG CGGCACTTCA AGCATTGA
|
Protein sequence | MSDPVIKRAL VSVSDKSGVV EFCRELSSMG VEIFSTGGTL RKLQESGVAA ASISTITGFP EIMDGRVKTL HPKIHGGLLA VRDNADHIAQ ARDNGIGFID MVVVNLYPFQ ETVAKPDVTF EEAIENIDIG GPSMLRSAAK NHESVTVITE SADYRTVLDE MRENNGATTR STRLKLAGKV FTLTSRYDRA IADYLAASSE GEASSEAGSI SVRLEKEIDM RYGENPHQNA GFYRMDDGSG SRSFEEYFRK LHGKDLSYNN MLDTAAATAL IEEFRDEAPA VVIIKHTNPC GVAQADTLVE AYRKAFSTDT QSPFGGIIAC NRPLDMETAK AIDEIFTEIL IAPAYEEGVL DMLMKKKNRR LLLQRKPLLQ EVTEYKSTRF GMLVQERDSR IASRDDLKVV TKRQPSAQEL DDLMFAWKIC KHVKSNTIVY VKNRQTVGVG AGQMSRVDSA KIARSKAAEA GLDLNGSAVA SDAFFPFADG LLAAAEAGAM AVIQPGGSVR DDEVIAAADE HDLAMVFTSM RHFKH
|
| |