Gene Cwoe_5704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5704 
Symbol 
ID8736180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp6105922 
End bp6107232 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content75% 
IMG OID646506331 
ProductCytosine deaminase 
Protein accessionYP_003397480 
Protein GI284047140 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.561691 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.731459 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAGCG TGTCTGCTGA TGTGACCCCG GCCGATCTGT CGATCGAGGA CGCGCGCGTG 
CTCGGCCGCG AGGGGCGCTT CGACGTCGCG ATCGCCGATG GCGTGATCCG CTCGGTGACG
CCGCGCGACG GCGGCGGTGC GGCGGGAGCG GGCGCGGCGG GTGCGGCTCC GGTCGTTCCG
GCTCCCGCGA TCGAGCGGAT CGACGCGCGC GGCGGGCTCG TCTCGCCGTC GTTCGTCGAG
CCGCACTACC ACCCCGACAA GGCGTTCAGC CGCCGGGGGC GCGTCGAGCA GCCGGACGGC
TTCGAGCTGG CGCGCGAGAT CAAGGCCGGC TTCACCGAGG CGGGCGTCGA GGCGCGCGCG
ACCGAGGCGT TCCGGCTCGC GGTCTCCCAG GGCGTTGGCA GCATGCGCGC GAACGTCGAC
GTCGACTCGA TCGCCGGGCT GACGAACGTC CGCGGCGTGC TCGCCGCGCG CGAGCGCGTG
CGCGACCTGA TCGACGTGCA GGTCGTCGCC TTCCCGCAGG AAGGCCTGCT GCGCGACGCG
CCGGCGCAGG AGCTGCTGGT GCAGGCGATG GCCGAGGGCG CCGACGTCGT CGGCGGCTGG
CCGAACGTCG AGGACGGCGA GGCGGCGCAG CTCGCCCACC TCGACTTCGT CTTCGATCTC
GCCGAGCGCT TCGACGCCGA CCTCGACGTG CACGTCGACT GCTACTGCGA CCCGGCGGAG
CGGATGCTGG AGCCGCTGGC GGAGCGCACG CTCGCGCGCG GCTTCGAGGG CCGCGTGCTG
GCGAGCCACT GCTGCGGCCT GGAGGTCTAC CCCGACGACG ACGCGCGGCG CGTGATCGGC
CGCGTCGCCG CGGCGCAGAT CCACGTCTGC GTGCAGCCGG CGAACACGTC GGCGCAGTAC
GGCCCGCGCG GGCTCTCGCG CACGCGCGAG CTGCTGGCGG CCGGCGTCCC GGTCAGCGCC
GGCAGCGACA ACATGTTCGA CGGCTGGTAC CTGCTCGGCA ACCTCGACCC GCTCGACCGC
GCCGTGCTGG CCTACCACGG CGCCGGGCTC GGCGGCCGCT ACACGCACCT CCCGACCGAG
CTGCTGTGGG AGCTGGTGAC CGACCGCGCC GCCGCCGCGA TCGGCACGAC CCCCGGTCGC
GTCGAGGCCG GGGCGCCCGC CGACCTCGTC GTCTTCGACG CGCCCGACGT CGAGCTGGCG
CTGGCGGCGC TGCCCGGCAA GCGCACGACG ATCAAGCGCG GGCGCGTCGT CGCCGCGCGC
GAGAGCGCGG TCTGGTCGGT GGGCGGGAGG TGCGCCGCGT GCGCACCGTG A
 
Protein sequence
MVSVSADVTP ADLSIEDARV LGREGRFDVA IADGVIRSVT PRDGGGAAGA GAAGAAPVVP 
APAIERIDAR GGLVSPSFVE PHYHPDKAFS RRGRVEQPDG FELAREIKAG FTEAGVEARA
TEAFRLAVSQ GVGSMRANVD VDSIAGLTNV RGVLAARERV RDLIDVQVVA FPQEGLLRDA
PAQELLVQAM AEGADVVGGW PNVEDGEAAQ LAHLDFVFDL AERFDADLDV HVDCYCDPAE
RMLEPLAERT LARGFEGRVL ASHCCGLEVY PDDDARRVIG RVAAAQIHVC VQPANTSAQY
GPRGLSRTRE LLAAGVPVSA GSDNMFDGWY LLGNLDPLDR AVLAYHGAGL GGRYTHLPTE
LLWELVTDRA AAAIGTTPGR VEAGAPADLV VFDAPDVELA LAALPGKRTT IKRGRVVAAR
ESAVWSVGGR CAACAP