Gene Cwoe_0344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_0344 
Symbol 
ID8730772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp352970 
End bp354211 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content76% 
IMG OID646500958 
ProductN-isopropylammelide isopropylaminohydrolase 
Protein accessionYP_003392155 
Protein GI284041815 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0274445 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.270951 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGC CCGGCGTCAC GACGGACGAG CTGCTGCTCG CGAACGCGAC CCTGCCCGAC 
GGCCGCCGCG CCTCGGTGCG TGTCGCCGGC GGGCGGATCG CCGCGATCGA GCTGGCGGAC
GAGGGCGGGC GCGGCGACGC GCCGGCCGCG CCGGCCTCCG TCCCGCCCGC TGCCGCCACA
CGCATCGACC TCGCGGGCGC GCTGCTCGCG CCGGCGTTCG TCGACGGCCA CATCCACCTC
GACAAGGTCT TCATCGGCGT GCCGTGGCGC CCGCACGTGC CGCAGGACTC GCTCGCCGGG
CGGATCGCGG CCGAGCGCGC CGCGCTCGCC GAGATCGACG CCGAGGTCCC GATCGCCGAG
CGCGCCGTCG CACTCGTGAG ACGGGCGGCC GCGTACGGCA CCGGCCACCT GCGCACGCAT
GTCGACGTCG ACACCCGCCA CGGGCTCACG CGGCTGGAGG CGGTACTGGA GGCGCGCGAG
CGCTGCCGCG AGCTGGCCGG CATCCAGATC GTCGCGTTCC CGCAGTCGGG CGTCCTGTCG
GACCCCGGCA CCGCCGAGCT GCTCGACGCC GCCGTCCGCG CGGGCGCCGA CGTCGTCGGC
GGACTCGACC CAGCCGGCTT CGACGGCGAC GTCGAGGGCC AACTGGGCGT CGTCTTCGAC
GTCGCCGAGC GGCACGCCGC GCGCGTCGAC GTCCACCTCC ATGACGCCGG CACGCTCGGC
GCGTTCGAGC TGCGCCGGAT CGCCCACCAC ACCGAGCGGC GCGGACTCCA GGGGCGCGTC
GTCGTCAGCC ACGCGTACTG CCTCGGCGAG ATCGACGCCG ACGACTTCGG CGCGACCGCC
GAGGCGCTCG CGCGCGCAGG CGTCGCGATC CTCACCAACG CGCCCGGCGG CTCGGCGATG
CCGCCGGTGC TGCGGCTGCG CGCGGCCGGT GTCGAGGTGC TCGCCGGCAC CGACAACATC
CGCGACGCCT GGTGGCCGTA CGGCACCGGC GACATGCTCG AACGCGCGTA CATGGTCGGC
TACCGGCAGA GCCTCTTCAC CGACGAGGAG CTGGCGGTCG CGTTCGAGCT GGCGACCGCC
GCCGGCGCCC GCACGCTCGG CGTCGAGGGC TACGGCCTGG AGGTTGGCGC GCGCGCCGAC
CTCGTCGCGA TCGACGCGCC GTCGCTGCCG GAGGCGGTCG CCGCCCCGCC GCGGCGACTG
CTCGTGCTCC ACGACGGCCG GATCGTCGCC GACACGCGCT GA
 
Protein sequence
MTTPGVTTDE LLLANATLPD GRRASVRVAG GRIAAIELAD EGGRGDAPAA PASVPPAAAT 
RIDLAGALLA PAFVDGHIHL DKVFIGVPWR PHVPQDSLAG RIAAERAALA EIDAEVPIAE
RAVALVRRAA AYGTGHLRTH VDVDTRHGLT RLEAVLEARE RCRELAGIQI VAFPQSGVLS
DPGTAELLDA AVRAGADVVG GLDPAGFDGD VEGQLGVVFD VAERHAARVD VHLHDAGTLG
AFELRRIAHH TERRGLQGRV VVSHAYCLGE IDADDFGATA EALARAGVAI LTNAPGGSAM
PPVLRLRAAG VEVLAGTDNI RDAWWPYGTG DMLERAYMVG YRQSLFTDEE LAVAFELATA
AGARTLGVEG YGLEVGARAD LVAIDAPSLP EAVAAPPRRL LVLHDGRIVA DTR