Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0492 |
Symbol | |
ID | 4569262 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 539510 |
End bp | 540280 |
Gene Length | 771 bp |
Protein Length | 256 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 639765091 |
Product | 1-(5-phosphoribosyl)-5-[(5- phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase |
Protein accession | YP_910973 |
Protein GI | 119356329 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0106] Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase |
TIGRFAM ID | [TIGR00007] phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGATAA TTCCTGCTAT AGACATTAAG GACGGAAGAT GTGTGCGGTT GACTCGTGGT GATTTCAACC AGCAGAAAAT CTACCTCGAC AATCCCTGCG ACATGGCCAT CATCTGGAGA AAACAGAATG CCAAGATGCT TCATGTCGTT GATCTGGATG CTGCATTGAC CGGCGAAATG GTCAATTTTC TGAAAATAGA GCAAATCGTA CGCGAACTGG ATATTCCTTT GCAGGTTGGG GGAGGAATCC GTTCTCTCGA TGCGGTTAAG CGATATCTTG ATATCGGTGT CGGCCGGGTG GTGATAGGGT CGGCAGCGGT GACAAATCCC GGGTTGATTG AAGAAGTGCT GAAAATATAT CGGCCCTCGA AAATTGTTGT CGGGATTGAC GCGGAGAATG GCATCCCTAA AATCAAGGGG TGGACAGAAA GCTGTGGTAT GAAAGATTAC GAGCTGGGGC TTGAGATGAA GCAAATGGGT ATTGAACGCG TTGTCTATAC CGATATTTCA AAAGACGGCA TGATGCAGGG GTTTGGTTAC GAAAGCACCA AGCGGTTTGC CGAAATGACC GGCATGAAAG TAACTGCTTC CGGTGGCGTT ACCAGTTCAG ATGACCTCAT GAAGCTTGTT GGCTTGCAGC AGTACGGGGT TGATTCCGTT ATTATCGGAA AAGCCCTGTA CGAGTGTAAT TTTCCGTGCC AGGAGCTTTG GTATAATTTT GAAGAGGATA TCTGTATTGA TCATAACTTT TCCACAGCAA GGAAGAAGTG A
|
Protein sequence | MLIIPAIDIK DGRCVRLTRG DFNQQKIYLD NPCDMAIIWR KQNAKMLHVV DLDAALTGEM VNFLKIEQIV RELDIPLQVG GGIRSLDAVK RYLDIGVGRV VIGSAAVTNP GLIEEVLKIY RPSKIVVGID AENGIPKIKG WTESCGMKDY ELGLEMKQMG IERVVYTDIS KDGMMQGFGY ESTKRFAEMT GMKVTASGGV TSSDDLMKLV GLQQYGVDSV IIGKALYECN FPCQELWYNF EEDICIDHNF STARKK
|
| |