Gene Caul_1298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1298 
Symbol 
ID5898753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1370908 
End bp1372113 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content71% 
IMG OID641561783 
Productimidazolonepropionase 
Protein accessionYP_001682926 
Protein GI167645263 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.167315 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTGTG ATCGGGTGTG GAGCAAGGCC CGCCTGGCGA CTTTCGCGGC TGGCCGACCG 
GGAATCGGCG TGGTCGAGGA CGGCGTCGTG GCCAGCCTGA GCGGTCGCAT CGTCTATGCC
GGTCCCGCCA CCGAAGCGCC CGCATTCGAG GCGCTGGAAA CCCTCGACTG CGAGGGCCGC
TGGATCACGC CGGGCCTGAT CGATCCCCAC ACCCACCTGG TGTTCGGCGG CGACCGCGCC
CGCGAGTTCG AGCTGCGGCT GGCCGGCGCC ACGTACGAGG AGATCGCCCG GGCCGGCGGC
GGCATCGTCT CGACCATGAA GGCCACCCGC GCCGCCTCGG AAGGCGAACT GGTCGCCAGC
GCCCTGCCCC GCCTGGACGC CCTGATCGCC GAGGGCCTGA CGACGATCGA GATCAAGTCC
GGCTATGGCC TGTCGCTGGA CGACGAGCTG AAGAGCCTGC GCGCCGCCCG CGCCCTGGCC
GATGTCCGCA AGGTTTCGGT GACCACCACC TTCCTCGGCG CCCACGCCCT GCCGCCGGAA
TATGAGGGCG ACCCGGACGG CTATATCGAC CACGTCTGCT ACCAGATGAT CCCGGCCGTC
GCCGCCGAGG GCCTGGCCGA CGCGGTCGAC GCCTTCTGCG AGGGCATCGG CTTTTCCCGC
GCCCAGACCC GCCGGGTGTT CCAGGCGGCG CGCGAGCGTG GGCTGCCGGT CAAGCTGCAC
GCCGAGCAAC TTTCCAACCT CGACGGCGCG GCCCTGGCCG CCGAGTTCGG GGCGCTGTCG
GCCGATCATC TGGAGCATCT GGACGGCGCC GGGATCGCCG CCATGGCCCA GGCCGGCACG
ACGGCGGTGC TGCTGCCGGG GGCCTTCTAT TTCGTGCGCG AGACCAGACT TCCGCCGATC
CAGGCCCTGC GCGCCGCCGG CGTGCCCCTG GCCCTGGCCA CCGACTGCAA CCCCGGCACC
TCGCCCCTGA CCAGCCTGCT GCTGACCCTG AACATGGCCG CCACCCTGTT CCGCATGACC
GTCGATGAGT GCCTGGCCGG CGTCACGCGC GAGGCGGCGC GGGCCATCGG CCGCCTCGAC
CACATCGGCA CGCTGGAGGC CGGGAAGTCC TGCGACCTCG CCATCTGGGA CATCGAGCGT
CCCGCGCAAC TTGTCTACCG CATGGGCTTC AACCCGCTCC ATGCACGCGT CTGGAAGGGC
CTGTAA
 
Protein sequence
MRCDRVWSKA RLATFAAGRP GIGVVEDGVV ASLSGRIVYA GPATEAPAFE ALETLDCEGR 
WITPGLIDPH THLVFGGDRA REFELRLAGA TYEEIARAGG GIVSTMKATR AASEGELVAS
ALPRLDALIA EGLTTIEIKS GYGLSLDDEL KSLRAARALA DVRKVSVTTT FLGAHALPPE
YEGDPDGYID HVCYQMIPAV AAEGLADAVD AFCEGIGFSR AQTRRVFQAA RERGLPVKLH
AEQLSNLDGA ALAAEFGALS ADHLEHLDGA GIAAMAQAGT TAVLLPGAFY FVRETRLPPI
QALRAAGVPL ALATDCNPGT SPLTSLLLTL NMAATLFRMT VDECLAGVTR EAARAIGRLD
HIGTLEAGKS CDLAIWDIER PAQLVYRMGF NPLHARVWKG L