Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1299 |
Symbol | |
ID | 5898754 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1372200 |
End bp | 1373675 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641561784 |
Product | N-formimino-L-glutamate deiminase |
Protein accession | YP_001682927 |
Protein GI | 167645264 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | [TIGR02022] formiminoglutamate deiminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.450827 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGACG CCGAGTTTTC GCCAGGATCC GAACCCGCCG ACGAGCCGAT CTACACGCCG CCTGGCCAGA GCCCGGCCGC GCCGCGAGCC GCCCCTCAAT CGGACGATCC GCGGACCCTT TGGTTCGAGC AGGCCCTGCT GGCCGACGGC TGGGCCAGGG ACGTGCGGTT GACGCTGAGC GACGGCCTGA TCGCCCGGAT CGACACCGAC GTGGCCCGCC AGTCCGGCGA GGCGGCGCAC GGCCCCTGCC TTCCCGGCCT GCCCAACCTG CACAGCCACG CCTTCCAGCG GGCGATGGCC GGCCTGACCG AGGTGCGGGG ACCCACGGGA GACAGCTTCT GGACCTGGCG CGAGCTGATG TACCGCTTCG TCGACCGGAT CGGCCCCGAC GAGGTCCAGG CTGTCGCTGC TCTAGCTTAC ATGGAGATGC TGGAAACCGG TTCGACCCGG GTGGGCGAGT TCCACTACCT GCACCACGAC AAGGACGGAT CGCCCTATGC CGACCCGGCC GAGATGGCGG CGCGGATCGC CGCGGCGGCC GACGAGACCG GCATCGGCCT GACCCTGCTG CCGGTGTTCT ACGCCCATGC CGGCTTTGGC GGCGTCGCGC CGGGGCAGGG CCAGCGGCGG TTCATCCATG ACATCGACGG CTACGGCCGG CTGATCGAGG CCAGCCGCGC GGCGGTGGCG GACTTGCCGG ACGCGGTGGT CGGCATCGCG CCGCACAGCC TGCGGGCGGT GACCGGCGAG GAGCTGACGG CGATCCTGCC GCTTGCCGGG ATCGGGGCGG GCGCAGGTCC CGTCCACATC CACATCGCCG AGCAGACCCA GGAGGTCGAC GACTGCCTGG CCGCCACCGG CGCCCGGCCG GTGCGCTGGC TAATGAACAA CGCGCCGGTC GACAAGCGCT GGTGTCTGGT CCACGCCACC CATCTCAACG CCATGGAGAC CGAGCGCCTG GCCAAGAGCG GCGCGGTCGC CGGCCTGTGC CCGATCACCG AGGCCAATCT CGGCGACGGC GTCTTCCCGG CCCATGATTA TCTGGCGGCC GGCGGAGCGT TCGGCATCGG CTCGGACTCC AACGTGCTGA TCGACGCGGC CGAGGAACTT CGGACGCTGG AATACGCCCA GCGCCTGACG CGCCGGGCTC GAAGCGTGCT GGCCAAGGGG GCCGGCGGCT CGACCGGCGG CGAGCTGTTT CGGTCCGCCG TCGCCGGCGG CGCCCAGGCC CTCGGCGTGG CGACAGGCCT GCGACGCGGG CGCCCGGCCG ATTTCGTGAC CCTGGATCGC ACCCATCCGG CGATGATCGG CCGCGATGGC GACGCCTTGC TGGACAGTTG GGTGTTCGCC GGGCGACACG GGGCCATCGA CGGAGTCTGG CGCCATGGTC GCCAGGTCGT CACCGGCGGC CGTCACAATG GGCGCGAGGC GATTCTCGCC CGCTATCGGA CGGCCTTGGG GAGCGTGTTG GCTTGA
|
Protein sequence | MTDAEFSPGS EPADEPIYTP PGQSPAAPRA APQSDDPRTL WFEQALLADG WARDVRLTLS DGLIARIDTD VARQSGEAAH GPCLPGLPNL HSHAFQRAMA GLTEVRGPTG DSFWTWRELM YRFVDRIGPD EVQAVAALAY MEMLETGSTR VGEFHYLHHD KDGSPYADPA EMAARIAAAA DETGIGLTLL PVFYAHAGFG GVAPGQGQRR FIHDIDGYGR LIEASRAAVA DLPDAVVGIA PHSLRAVTGE ELTAILPLAG IGAGAGPVHI HIAEQTQEVD DCLAATGARP VRWLMNNAPV DKRWCLVHAT HLNAMETERL AKSGAVAGLC PITEANLGDG VFPAHDYLAA GGAFGIGSDS NVLIDAAEEL RTLEYAQRLT RRARSVLAKG AGGSTGGELF RSAVAGGAQA LGVATGLRRG RPADFVTLDR THPAMIGRDG DALLDSWVFA GRHGAIDGVW RHGRQVVTGG RHNGREAILA RYRTALGSVL A
|
| |