Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1298 |
Symbol | |
ID | 5898753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1370908 |
End bp | 1372113 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641561783 |
Product | imidazolonepropionase |
Protein accession | YP_001682926 |
Protein GI | 167645263 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.167315 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTGTG ATCGGGTGTG GAGCAAGGCC CGCCTGGCGA CTTTCGCGGC TGGCCGACCG GGAATCGGCG TGGTCGAGGA CGGCGTCGTG GCCAGCCTGA GCGGTCGCAT CGTCTATGCC GGTCCCGCCA CCGAAGCGCC CGCATTCGAG GCGCTGGAAA CCCTCGACTG CGAGGGCCGC TGGATCACGC CGGGCCTGAT CGATCCCCAC ACCCACCTGG TGTTCGGCGG CGACCGCGCC CGCGAGTTCG AGCTGCGGCT GGCCGGCGCC ACGTACGAGG AGATCGCCCG GGCCGGCGGC GGCATCGTCT CGACCATGAA GGCCACCCGC GCCGCCTCGG AAGGCGAACT GGTCGCCAGC GCCCTGCCCC GCCTGGACGC CCTGATCGCC GAGGGCCTGA CGACGATCGA GATCAAGTCC GGCTATGGCC TGTCGCTGGA CGACGAGCTG AAGAGCCTGC GCGCCGCCCG CGCCCTGGCC GATGTCCGCA AGGTTTCGGT GACCACCACC TTCCTCGGCG CCCACGCCCT GCCGCCGGAA TATGAGGGCG ACCCGGACGG CTATATCGAC CACGTCTGCT ACCAGATGAT CCCGGCCGTC GCCGCCGAGG GCCTGGCCGA CGCGGTCGAC GCCTTCTGCG AGGGCATCGG CTTTTCCCGC GCCCAGACCC GCCGGGTGTT CCAGGCGGCG CGCGAGCGTG GGCTGCCGGT CAAGCTGCAC GCCGAGCAAC TTTCCAACCT CGACGGCGCG GCCCTGGCCG CCGAGTTCGG GGCGCTGTCG GCCGATCATC TGGAGCATCT GGACGGCGCC GGGATCGCCG CCATGGCCCA GGCCGGCACG ACGGCGGTGC TGCTGCCGGG GGCCTTCTAT TTCGTGCGCG AGACCAGACT TCCGCCGATC CAGGCCCTGC GCGCCGCCGG CGTGCCCCTG GCCCTGGCCA CCGACTGCAA CCCCGGCACC TCGCCCCTGA CCAGCCTGCT GCTGACCCTG AACATGGCCG CCACCCTGTT CCGCATGACC GTCGATGAGT GCCTGGCCGG CGTCACGCGC GAGGCGGCGC GGGCCATCGG CCGCCTCGAC CACATCGGCA CGCTGGAGGC CGGGAAGTCC TGCGACCTCG CCATCTGGGA CATCGAGCGT CCCGCGCAAC TTGTCTACCG CATGGGCTTC AACCCGCTCC ATGCACGCGT CTGGAAGGGC CTGTAA
|
Protein sequence | MRCDRVWSKA RLATFAAGRP GIGVVEDGVV ASLSGRIVYA GPATEAPAFE ALETLDCEGR WITPGLIDPH THLVFGGDRA REFELRLAGA TYEEIARAGG GIVSTMKATR AASEGELVAS ALPRLDALIA EGLTTIEIKS GYGLSLDDEL KSLRAARALA DVRKVSVTTT FLGAHALPPE YEGDPDGYID HVCYQMIPAV AAEGLADAVD AFCEGIGFSR AQTRRVFQAA RERGLPVKLH AEQLSNLDGA ALAAEFGALS ADHLEHLDGA GIAAMAQAGT TAVLLPGAFY FVRETRLPPI QALRAAGVPL ALATDCNPGT SPLTSLLLTL NMAATLFRMT VDECLAGVTR EAARAIGRLD HIGTLEAGKS CDLAIWDIER PAQLVYRMGF NPLHARVWKG L
|
| |