Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0903 |
Symbol | |
ID | 5898358 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 954897 |
End bp | 955991 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641561386 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_001682532 |
Protein GI | 167644869 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0668291 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTAGTC CCCCCGCCTT TCCCCTGGCC CCGGGTTCGA CTATCGGCAT CCTCGGCGGG GGACAGCTGG GCCGCATGCT GGCCCTGGCC GCCGCGCGCC TGGGCTTCGA CGTCGTGATC CTGGATCCGG AGGAGAACAG CCCGGCCGGC CGCGTCGCCG CCCGTCAGAT CGTCGGTCCC TATGACGACC GCTGGTCGCT GCGCCGCCTG GCCGAGGTCG CCGACGTCGT CACCTACGAG TTCGAGAACG TTCCGGCCGA CACCGTCGCC GAGCTGACCG CCCAGGGCGT GCTGGTCGCC CCCGGCGCCA AGGCCCTGGC CGCCGCCCAG GACCGGGTGG TGGAAAAGAG CTTCCTGGCC GAGATCGGCG TCCCGACCGT GGCCTTCGCC CCCGTCGAGA CCGTCGACGA CCTGCCCGCC GCGATCGCCA AGATCGGCGC CCCGTCGCTG CTGAAGACCC GGCGCGAGGG CTATGACGGC AAGGGCCAGG CCTGGGTGCA GCGTCCGGCC GACGCCGCCG CCAGCTTCGA GAAGATCGGC CAGCAGCCCG CCATCCTTGA GGCCCCGGCC GACTTCGTGC GCGAGCTGTC GGTGATCGCC GCCCGCGGCC GCGACGGCGA GATCGCCTGC TACCCGCTAT CGGACAACCA CCACGAGGGC GGCGTCCTGC GCCGCACCAG CGCCCCGGCG AAGGTCTCAC CCGCCACCCG CGACCAGGCC GAGGCCATCG TTGTGCGTAT CCTGACCGCG CTCGACTATG TGGGTGTGAT CGGGGTGGAG CTGTTCGAGA TGGCCGACGG CAAGCTGCTG GTCAACGAGT TCGCCCCCCG GGTGCACAAC ACCGGCCACT GGACCCAGGA CGGCTGCGAG GTCGATCAGT TCGAGCAGCA CATCCGCGCC GTCGCCGGCT GGCCCCTGGG TCCCACCGCC CCCCGCGCCC ATGTCGAGAT GACCAACCTG CTGGGCGCCG AGGTCGAGGC CTGGGCCAAG CTGGCCGCCG AACCCGAGAC CCGCCTCCAC CTCTACGGCA AGGGCGAAGC CCGGCCGGGA CGGAAGATGG GTCACGTGAA CCGGTTGCGG GGGCTGAAGG ACTGA
|
Protein sequence | MRSPPAFPLA PGSTIGILGG GQLGRMLALA AARLGFDVVI LDPEENSPAG RVAARQIVGP YDDRWSLRRL AEVADVVTYE FENVPADTVA ELTAQGVLVA PGAKALAAAQ DRVVEKSFLA EIGVPTVAFA PVETVDDLPA AIAKIGAPSL LKTRREGYDG KGQAWVQRPA DAAASFEKIG QQPAILEAPA DFVRELSVIA ARGRDGEIAC YPLSDNHHEG GVLRRTSAPA KVSPATRDQA EAIVVRILTA LDYVGVIGVE LFEMADGKLL VNEFAPRVHN TGHWTQDGCE VDQFEQHIRA VAGWPLGPTA PRAHVEMTNL LGAEVEAWAK LAAEPETRLH LYGKGEARPG RKMGHVNRLR GLKD
|
| |