Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_5043 |
Symbol | |
ID | 5902505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 5444174 |
End bp | 5444899 |
Gene Length | 726 bp |
Protein Length | 241 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641565564 |
Product | 1-(5-phosphoribosyl)-5-[(5- phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase |
Protein accession | YP_001686661 |
Protein GI | 167648998 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0106] Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase |
TIGRFAM ID | [TIGR00007] phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 47 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.226642 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCTCT ATCCCGCCAT CGACCTGAAG GACGGCCAGT GCGTGCGCCT GCTGCACGGC GACATGGAAA AGGCCACGGT CTTCAACAAC AGTCCGGCCG ACCAGGCCGA GCGCTTCGTG CGCGACGGCT TCTCCTGGCT GCACGTGGTC GACCTGAACG GCGCGATCGA GGGCAAGTCG GTCAACACCG CCGCCGTGCA GTCGATCCTG GAGTCGATCT CGATCCCCGT GCAGCTGGGC GGCGGCATCC GGACCCTGGA GGGCGTCGAG GCCTGGATCG AGGCGGGCGT GTCGCGCGTG ATCCTCGGCA CCGTGGCCGT CCACGACCCA GACCTGGTCC GCAAGGCCGC TCGCCTTTGG CCCGAACAGA TCGCCGTGGC CGTCGACGTG CGCGACGGCA AGGTGGCGGT CGACGGCTGG ACCGGCCTGT CGGACCTGGA CGCCATCACC CTGGGCAAGC GCTTCGAGGA CGTGGGCGTG GCCGCGCTGA TCGTCACCGA TATCAGCCGC GACGGCGCCC TGACCGGGGT CAATGTCGAA GGCGTGGGCG AGCTGGCCGA CGCGGTCTCG ATTCCGGTCA TCGCCTCGGG CGGCGTGGCC TCGGTGGCCG ACATCGAGCG GCTGAAGGCC CGGCCCGGCG TCGAGATCGC CGGCGCCATC CTGGGCCGAT CGCTCTATGC CGGCACGATT CGTCCGGCCG AAGCCCTGAC CATAGCGGCC GCCTGA
|
Protein sequence | MILYPAIDLK DGQCVRLLHG DMEKATVFNN SPADQAERFV RDGFSWLHVV DLNGAIEGKS VNTAAVQSIL ESISIPVQLG GGIRTLEGVE AWIEAGVSRV ILGTVAVHDP DLVRKAARLW PEQIAVAVDV RDGKVAVDGW TGLSDLDAIT LGKRFEDVGV AALIVTDISR DGALTGVNVE GVGELADAVS IPVIASGGVA SVADIERLKA RPGVEIAGAI LGRSLYAGTI RPAEALTIAA A
|
| |