Gene Caul_3717 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3717 
Symbol 
ID5901173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4013466 
End bp4014755 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content69% 
IMG OID641564228 
Productguanine deaminase 
Protein accessionYP_001685342 
Protein GI167647679 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02967] guanine deaminase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCAAGCCT ATAGGGCGTC GATCCTCCAG CTGGAGGACG ACCCGACCAC CACGGACGGG 
GAGGCGCACG CCTTCCACGA GGACGGCCTG CTGCTGGTCG AGGAGGGGCT CGTGGTCGCT
TGCGGCGACT ACGCCAAGCT GGCCGACGAG CTGGGCGATG TCGCGCCTGA AGATCTGCGC
GGCAAGCTGA TCATCCCCGG CCTGATCGAC ACGCACATTC ATTTTCCGCA GGTCGACGTG
ATCGCCGCCC ATGGCGAGCA ACTGTTAGAC TGGCTGGAGC GCCACACCTT CCCGGCCGAG
GCCGCCTTCG CCGACAGAGG CCACGCCGAG GAGACCGCCG AGTTCTTCGT CGAGGAGCTG
CTGCGCAACG GCACGACCAG CGCCCTGGTG TTCGGCTCGG TGCACAAGGT CTCGGTCGAG
GCGCTGTTCG CCGCCGCCCT GAAGCGCGAC ATGCGGGTGA TCGCCGGCAA GTCGCTGATG
GACCGCAACG CCCCGCCGGG CCTGACCGAC ACGGTCGAGG GCAGCCGGCG CGACATGGAG
AGCCTGATCG CCGACTGGCA CGGCAAGGGC CGGCTGGGCT ACGCCGTGAC CCCGCGCTTC
GCGATCAGTT GCAGCGACGA ACAGCTGGCC ATGGCCGGCG AGGTCCTGGC TGAACACCCG
ACGGTGTGGA TGCAGACCCA CCTGTCGGAG AACATCCGCG AGATCGTCGA CACCGCCAAA
CTGTTCCCGG AGGCCAAGGA CTATCTGGAC GTCTATGATC GCTTCGGTCT GGTGGGGAAA
CGCTCAGTGT TCGCCCACTG CGTCCATCTT CAGGGCGAGG CGTTCCAGCG CCTGGCCAAC
GCCGGCGCGG CGATCGCCTT CTGTCCGACC TCGAACCTGT TTCTGGGCTC TGGCCTGTTC
CCGCTGGAGA CGGCCTGCGC CCACGGGGTG AAGGTCGGGA TCGGCACGGA CGTGGGGGCG
GGGACCACCT TCTCGATCCT CCACACGCTG GGCGAGGCCT ACAAGGTCGG CCAACTGCGC
GGCGAGGCGC TGGATCCGTT CCACGCCCTG TACCTGGCCA CCCTGGGCGG GGCGCGGACC
CTGGGGCTGG AAGGCGAAAT CGGCAGCCTG GAGCACGGCA AGATCGCCGA CTTCCTGGTG
CTGGACCTAG CCGCCACGCC GCTGCTGGCG CGGCGGATGC CGGCGGCGAA GTCGCTGGAG
GACCGGCTGT TCGCGCTGAC GGTGCTGGCG GACGACCGGG TGGTCGAGCG GACCTATGTG
GCCGGGGTGG AGCGGTATCG GCGGGGGTGA
 
Protein sequence
MQAYRASILQ LEDDPTTTDG EAHAFHEDGL LLVEEGLVVA CGDYAKLADE LGDVAPEDLR 
GKLIIPGLID THIHFPQVDV IAAHGEQLLD WLERHTFPAE AAFADRGHAE ETAEFFVEEL
LRNGTTSALV FGSVHKVSVE ALFAAALKRD MRVIAGKSLM DRNAPPGLTD TVEGSRRDME
SLIADWHGKG RLGYAVTPRF AISCSDEQLA MAGEVLAEHP TVWMQTHLSE NIREIVDTAK
LFPEAKDYLD VYDRFGLVGK RSVFAHCVHL QGEAFQRLAN AGAAIAFCPT SNLFLGSGLF
PLETACAHGV KVGIGTDVGA GTTFSILHTL GEAYKVGQLR GEALDPFHAL YLATLGGART
LGLEGEIGSL EHGKIADFLV LDLAATPLLA RRMPAAKSLE DRLFALTVLA DDRVVERTYV
AGVERYRRG