Gene Cwoe_4067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_4067 
Symbol 
ID8734529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4317043 
End bp4318428 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content73% 
IMG OID646504694 
Productamidohydrolase 
Protein accessionYP_003395857 
Protein GI284045517 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.737174 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.275033 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCACG TGGTGATCCG CGGGGCGACG GTGCTCAGCC AGGACGACGA GATCGGCGAG 
CTGGTCGGCG ACGTCGAGGT GCGTGACGGT GAGATCGTCG CGGTCGGCGC CGGGCTCGCG
ACGGCTGGAG CGGAGGAGAT CGATGCGGCT GGGATGGTCG CGATCCCGGG CTTCGTCGAC
ACGCACTGGC ACCTGTGGGG CACGCTGCTG CGCGGCGTGA TCGGCGACGG CAGAGCTGAG
GGCTGGTTCG CGCGCAAGGG CAAGCTGGCG CCGCACATGT CGCCGGAGGA CATGTTCAAC
GGCGTGATGC TGGGCGCGGC CGACGGTCTC GCGACCGGCG TGACGACGAT CCACGACTGG
GCGCACAACG TGCTCAGCCC CGATCACGCG GACGCGAACC TGCGCGCGCA CAAGGAGCTG
GGCACGCGCG TCCACTTCAC GTACGGTGCG CCGAGCGCGC ATCCCTCGCT TTCGCGCGAG
GAGATGGCGG CGCAGGGCGC GCTGCCGCCC GACCAGGCGA TGGACGTCGC CGATGCCGTG
CGCGTGCGCG AGCAGTGGAG CGGCGAGTTC GGCGGGCTGT TGAGCGTCGG CGTCAACGTG
CGCGGGCCGG CACGGTCCGA CGAGGCGGTC TACCGCGAGG AGTGGCGCCA GGCGCGCGAC
GCGGGGTTGC CGATCGCGAT GCACTGCGCC GGCACCGAGG CTGAGGTCAG ACGGATCCGG
CAGGTCAAGC TGCTGGAGGC CGACGGGCTG CTCGGCCCCG ACGTGCTGCT GGCGCACTGC
CTCTTCCTCG ACGAGGAGGA GCGCGGGCTG CTCGCGGCGC GGGGCGTGCC GGTCACGTTC
AGCCCGCTGA GCGAGCTGCG GCTGGCGATG GGCTTCCCGA TCCTCGCGCA GCTGCGCGAG
GACGGCGTCC AGGTCAGCCT CTCGCTCGAC ACGACGGCGA TCGCCGGCGC GGCGGATCCG
TTCGCGGCGA TGCGCGTCGC GTTGGGGATC TCCAACTCCG CGCGCGGTGA CGCGACGGAG
GTGACGCCGC GCGACATGCT GCGCGTCGCG ACGCTGGCCG GTGCTGAGGC GCTCGGCCTC
GGCGACCGCG TCGGTTCGAT CACGCCCGGC AAGCGGGCGG ACATCGTGCT CGTCAGAACG
CGCACGCTGA ACGCGGCGCC GGTCGTCGAC CCCGCGGTGG CGGTCGTCCA CTCTGCGCTG
CCGAGCGACG TCGACACGGT GCTGGTCGAC GGGCGCGTCG TGAAGCGCGA CGGGCGGTTG
ACGACGGTCG AGCCGGAGGC CGTGATCGGC CGGGCGGAGG TGTCGCTGCG CGGCGTCTGC
GGCCGCGCCG GGTTCGAGAT GACGGGGCTT CAGAGCGAAG GGAGCGCAGC ATGGCGGTCG
GCGTGA
 
Protein sequence
MGHVVIRGAT VLSQDDEIGE LVGDVEVRDG EIVAVGAGLA TAGAEEIDAA GMVAIPGFVD 
THWHLWGTLL RGVIGDGRAE GWFARKGKLA PHMSPEDMFN GVMLGAADGL ATGVTTIHDW
AHNVLSPDHA DANLRAHKEL GTRVHFTYGA PSAHPSLSRE EMAAQGALPP DQAMDVADAV
RVREQWSGEF GGLLSVGVNV RGPARSDEAV YREEWRQARD AGLPIAMHCA GTEAEVRRIR
QVKLLEADGL LGPDVLLAHC LFLDEEERGL LAARGVPVTF SPLSELRLAM GFPILAQLRE
DGVQVSLSLD TTAIAGAADP FAAMRVALGI SNSARGDATE VTPRDMLRVA TLAGAEALGL
GDRVGSITPG KRADIVLVRT RTLNAAPVVD PAVAVVHSAL PSDVDTVLVD GRVVKRDGRL
TTVEPEAVIG RAEVSLRGVC GRAGFEMTGL QSEGSAAWRS A