Gene Caul_0903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0903 
Symbol 
ID5898358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp954897 
End bp955991 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content72% 
IMG OID641561386 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_001682532 
Protein GI167644869 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0668291 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAGTC CCCCCGCCTT TCCCCTGGCC CCGGGTTCGA CTATCGGCAT CCTCGGCGGG 
GGACAGCTGG GCCGCATGCT GGCCCTGGCC GCCGCGCGCC TGGGCTTCGA CGTCGTGATC
CTGGATCCGG AGGAGAACAG CCCGGCCGGC CGCGTCGCCG CCCGTCAGAT CGTCGGTCCC
TATGACGACC GCTGGTCGCT GCGCCGCCTG GCCGAGGTCG CCGACGTCGT CACCTACGAG
TTCGAGAACG TTCCGGCCGA CACCGTCGCC GAGCTGACCG CCCAGGGCGT GCTGGTCGCC
CCCGGCGCCA AGGCCCTGGC CGCCGCCCAG GACCGGGTGG TGGAAAAGAG CTTCCTGGCC
GAGATCGGCG TCCCGACCGT GGCCTTCGCC CCCGTCGAGA CCGTCGACGA CCTGCCCGCC
GCGATCGCCA AGATCGGCGC CCCGTCGCTG CTGAAGACCC GGCGCGAGGG CTATGACGGC
AAGGGCCAGG CCTGGGTGCA GCGTCCGGCC GACGCCGCCG CCAGCTTCGA GAAGATCGGC
CAGCAGCCCG CCATCCTTGA GGCCCCGGCC GACTTCGTGC GCGAGCTGTC GGTGATCGCC
GCCCGCGGCC GCGACGGCGA GATCGCCTGC TACCCGCTAT CGGACAACCA CCACGAGGGC
GGCGTCCTGC GCCGCACCAG CGCCCCGGCG AAGGTCTCAC CCGCCACCCG CGACCAGGCC
GAGGCCATCG TTGTGCGTAT CCTGACCGCG CTCGACTATG TGGGTGTGAT CGGGGTGGAG
CTGTTCGAGA TGGCCGACGG CAAGCTGCTG GTCAACGAGT TCGCCCCCCG GGTGCACAAC
ACCGGCCACT GGACCCAGGA CGGCTGCGAG GTCGATCAGT TCGAGCAGCA CATCCGCGCC
GTCGCCGGCT GGCCCCTGGG TCCCACCGCC CCCCGCGCCC ATGTCGAGAT GACCAACCTG
CTGGGCGCCG AGGTCGAGGC CTGGGCCAAG CTGGCCGCCG AACCCGAGAC CCGCCTCCAC
CTCTACGGCA AGGGCGAAGC CCGGCCGGGA CGGAAGATGG GTCACGTGAA CCGGTTGCGG
GGGCTGAAGG ACTGA
 
Protein sequence
MRSPPAFPLA PGSTIGILGG GQLGRMLALA AARLGFDVVI LDPEENSPAG RVAARQIVGP 
YDDRWSLRRL AEVADVVTYE FENVPADTVA ELTAQGVLVA PGAKALAAAQ DRVVEKSFLA
EIGVPTVAFA PVETVDDLPA AIAKIGAPSL LKTRREGYDG KGQAWVQRPA DAAASFEKIG
QQPAILEAPA DFVRELSVIA ARGRDGEIAC YPLSDNHHEG GVLRRTSAPA KVSPATRDQA
EAIVVRILTA LDYVGVIGVE LFEMADGKLL VNEFAPRVHN TGHWTQDGCE VDQFEQHIRA
VAGWPLGPTA PRAHVEMTNL LGAEVEAWAK LAAEPETRLH LYGKGEARPG RKMGHVNRLR
GLKD