Gene Caul_1299 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1299 
Symbol 
ID5898754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1372200 
End bp1373675 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content72% 
IMG OID641561784 
ProductN-formimino-L-glutamate deiminase 
Protein accessionYP_001682927 
Protein GI167645264 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02022] formiminoglutamate deiminase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.450827 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGACG CCGAGTTTTC GCCAGGATCC GAACCCGCCG ACGAGCCGAT CTACACGCCG 
CCTGGCCAGA GCCCGGCCGC GCCGCGAGCC GCCCCTCAAT CGGACGATCC GCGGACCCTT
TGGTTCGAGC AGGCCCTGCT GGCCGACGGC TGGGCCAGGG ACGTGCGGTT GACGCTGAGC
GACGGCCTGA TCGCCCGGAT CGACACCGAC GTGGCCCGCC AGTCCGGCGA GGCGGCGCAC
GGCCCCTGCC TTCCCGGCCT GCCCAACCTG CACAGCCACG CCTTCCAGCG GGCGATGGCC
GGCCTGACCG AGGTGCGGGG ACCCACGGGA GACAGCTTCT GGACCTGGCG CGAGCTGATG
TACCGCTTCG TCGACCGGAT CGGCCCCGAC GAGGTCCAGG CTGTCGCTGC TCTAGCTTAC
ATGGAGATGC TGGAAACCGG TTCGACCCGG GTGGGCGAGT TCCACTACCT GCACCACGAC
AAGGACGGAT CGCCCTATGC CGACCCGGCC GAGATGGCGG CGCGGATCGC CGCGGCGGCC
GACGAGACCG GCATCGGCCT GACCCTGCTG CCGGTGTTCT ACGCCCATGC CGGCTTTGGC
GGCGTCGCGC CGGGGCAGGG CCAGCGGCGG TTCATCCATG ACATCGACGG CTACGGCCGG
CTGATCGAGG CCAGCCGCGC GGCGGTGGCG GACTTGCCGG ACGCGGTGGT CGGCATCGCG
CCGCACAGCC TGCGGGCGGT GACCGGCGAG GAGCTGACGG CGATCCTGCC GCTTGCCGGG
ATCGGGGCGG GCGCAGGTCC CGTCCACATC CACATCGCCG AGCAGACCCA GGAGGTCGAC
GACTGCCTGG CCGCCACCGG CGCCCGGCCG GTGCGCTGGC TAATGAACAA CGCGCCGGTC
GACAAGCGCT GGTGTCTGGT CCACGCCACC CATCTCAACG CCATGGAGAC CGAGCGCCTG
GCCAAGAGCG GCGCGGTCGC CGGCCTGTGC CCGATCACCG AGGCCAATCT CGGCGACGGC
GTCTTCCCGG CCCATGATTA TCTGGCGGCC GGCGGAGCGT TCGGCATCGG CTCGGACTCC
AACGTGCTGA TCGACGCGGC CGAGGAACTT CGGACGCTGG AATACGCCCA GCGCCTGACG
CGCCGGGCTC GAAGCGTGCT GGCCAAGGGG GCCGGCGGCT CGACCGGCGG CGAGCTGTTT
CGGTCCGCCG TCGCCGGCGG CGCCCAGGCC CTCGGCGTGG CGACAGGCCT GCGACGCGGG
CGCCCGGCCG ATTTCGTGAC CCTGGATCGC ACCCATCCGG CGATGATCGG CCGCGATGGC
GACGCCTTGC TGGACAGTTG GGTGTTCGCC GGGCGACACG GGGCCATCGA CGGAGTCTGG
CGCCATGGTC GCCAGGTCGT CACCGGCGGC CGTCACAATG GGCGCGAGGC GATTCTCGCC
CGCTATCGGA CGGCCTTGGG GAGCGTGTTG GCTTGA
 
Protein sequence
MTDAEFSPGS EPADEPIYTP PGQSPAAPRA APQSDDPRTL WFEQALLADG WARDVRLTLS 
DGLIARIDTD VARQSGEAAH GPCLPGLPNL HSHAFQRAMA GLTEVRGPTG DSFWTWRELM
YRFVDRIGPD EVQAVAALAY MEMLETGSTR VGEFHYLHHD KDGSPYADPA EMAARIAAAA
DETGIGLTLL PVFYAHAGFG GVAPGQGQRR FIHDIDGYGR LIEASRAAVA DLPDAVVGIA
PHSLRAVTGE ELTAILPLAG IGAGAGPVHI HIAEQTQEVD DCLAATGARP VRWLMNNAPV
DKRWCLVHAT HLNAMETERL AKSGAVAGLC PITEANLGDG VFPAHDYLAA GGAFGIGSDS
NVLIDAAEEL RTLEYAQRLT RRARSVLAKG AGGSTGGELF RSAVAGGAQA LGVATGLRRG
RPADFVTLDR THPAMIGRDG DALLDSWVFA GRHGAIDGVW RHGRQVVTGG RHNGREAILA
RYRTALGSVL A