Gene Caul_2545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2545 
Symbol 
ID5900000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2761651 
End bp2762676 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content70% 
IMG OID641563036 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_001684170 
Protein GI167646507 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACC AGCAGAACGG CCTCACCTAC GCCCAGGCTG GTGTCGATAT CGACGCCGGG 
AACGCCCTGG TCGAGGCGAT CAAACCCTTG GCCAAGGCCA CGCGGCGCCC CGGCGCGGAC
GGCGGCCTGG GCGGCTTCGG CGCCCTGTTC GACCTGAAGG CGGCGGGTTA CGACGACCCG
CTTCTGGTCT CCACCACCGA CGGCGTCGGC ACCAAACTGC GCATCGCCAT CGACGCCAAG
ATGCACGCCA CGGTCGGCAT CGACCTGGTG GCCATGTGCG TCAACGATCT GCTGGCCCAG
GGCGCCGAGC CGCTGCTGTT CCTCGACTAT TTCGCCACCG GCAAGCTGGA CCTGGAGGTC
GCCAAGAGCG TGGTGGCCGG CATCGCCGAC GGCTGCAAGC TGGCCGGCGC GGCCTTGGTG
GGCGGCGAGA CGGCGGAAAT GCCGGGCATG TACGGCGACG GCGAATACGA CCTGGCCGGC
TTCTCGGTCG GCGCGGTCGA GCGCGACGGC GTGCTGCCCA AGCTGGACAA GCAGCGGGCC
GGCGATCTGA TCATCGGCGT CGGCTCGTCG GGCCCGCACT CCAACGGCTA CAGCCTGGTG
CGCCGCGTGG TCGAGCGCTC GGGCCTGACC TGGGACGCCC CCTGCCCGTT TGAGGACGGC
AAGACCCTGG CCGAGGCCCT GATGGCCCCG ACCCGCATCT ATGTGAAGTC GATCCTGCCC
CTGCTGCAGT CGGGCCGGGT CAAGGGCGGC GCCCACATCA CCGGCGGCGG CCTGATCGAG
AACCCGCCGC GCTGCATCGC CGATGGTCTC AAGCCCGAAT TCGACTGGAA CGCCTGGCCC
CTGCCGCCGG TCTTCGACTG GCTGCAGCGC GAAGGCGGCA TCACCGACCA CGAACTGCGC
CGCACCTTCA ACTGCGGCGT CGGCTTCATC CTGGTGGTCG CCGCCAGCGA CGCCGAGCCG
GTGCTGGCGG CGCTGCTGAA CGCGGGCGAG GACGCGTTTG TTTGTGGGCA GTTGGTGGCG
GGCTGA
 
Protein sequence
MSDQQNGLTY AQAGVDIDAG NALVEAIKPL AKATRRPGAD GGLGGFGALF DLKAAGYDDP 
LLVSTTDGVG TKLRIAIDAK MHATVGIDLV AMCVNDLLAQ GAEPLLFLDY FATGKLDLEV
AKSVVAGIAD GCKLAGAALV GGETAEMPGM YGDGEYDLAG FSVGAVERDG VLPKLDKQRA
GDLIIGVGSS GPHSNGYSLV RRVVERSGLT WDAPCPFEDG KTLAEALMAP TRIYVKSILP
LLQSGRVKGG AHITGGGLIE NPPRCIADGL KPEFDWNAWP LPPVFDWLQR EGGITDHELR
RTFNCGVGFI LVVAASDAEP VLAALLNAGE DAFVCGQLVA G