Gene Cwoe_1055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_1055 
Symbol 
ID8731490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp1112970 
End bp1114481 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content78% 
IMG OID646501672 
Productamidohydrolase 
Protein accessionYP_003392862 
Protein GI284042522 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.292037 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGAGC CGGTGAGGGA CCTCGTGGTG CGCGGCTGCG ACGTGCTCGT GGAGCCGGGC 
GACCTGCGTG AGCAGGTCGA CCTGGTGCTC GCGGGCGAGC GCGTGGCGGC GGTGGGCGTG
GGCGCGGCGG GCGTGGGCGG CAGTGCGGCG GGCGCGGGGG CGCGGGAGCT GGACGGGCGC
GGGCTGCTCG CGATCCCGGG GCTCGTCAAC GCTCACACGC ACTCGCCCGA GAACTGCCTG
CGCGGCGTCG GCGAGGGGCT GGGGCTGGAG CCGTGGCTGA TGACGATGTT CGGCGCGGGC
GGCGACCTCG ACGCAGAGGC GCACGAGGTC ACGGTGCTCG CCGGCGCGGC GGAGATGCTG
CGCAACGGCA CGACCTCCGT GATCGACCAC CTCTGGATGA CGCCGCCGTC GCCCCGCGCG
CTCGACGCCG CGCTGCGCGC GTACGCGGCG AGCGGGATGC GCGCGACCGT CGCGCCGCTG
ATGGAGGACC GCGACGTCAC CGACCAGCTC GCGGCGCAGC TCGGGCTCGA CGTCTCCGCC
GGTCTCGTCA CCGCGCTGCC GGAGGCGCTC GGGACGACCG AGCTGCTCGC CGTCCTGCGC
CACGCGTTCG AGACCTGGCA CGGCGCGGAG GACGGCCGGC TGCGGATCCT CGCCGGCCCC
GGCGGCGTGC AGTGGGCGAG CGACGAGCTG CTGCTCGGCA GCGCGGAGCT GGCGGCGCGT
CACGGCGGCG GCGTCCACAT CCACCTGCTG GAGACGACCG TGCAGGCCGC CGCCTGCCGC
GCCGCGTTCG GCCGCAGCGG GCTGCAGCGG CTCGCCGACC TCGGGCTCGT CGGCCCCGGC
CTGTCGCTGC CGCACAGCGT CTGGATCGAG GCCGCCGACG TCGAGACGAT CGCGGCCGGC
GGCGCGACCG TCGTCCACAA TCCCGCTGCG AACACGCGCC TCGGCAGCGG GCGCGCGCCG
ATCGCCGCGC TCCTGCGCGC GGGCGCGCAC GTCGCGCTCG GGACCGACGG CTCCGCGTCC
TCCGACAACC AGTCGGTCTG GGACGCGATG AAGCTCGCCG CGCTGATCCA CAACGACGCC
GACGCGGACG TGTGGGTCGG GAGCGCGGAG GTGCTGCGGA TGGCGACGAC CGGCGGCGTC
CGCGTGATGG CGCGCGGTGG GGGCGGTGCC GCGGGCGCGA CCGCCGACGG CCTCGGGACG
CTCCGCCCGG GCGCACCGGC CGACTTCGCG CTGCTCGACC GCCGTGTCAG CGGCCTCGCG
GGCGCCTTCG CGCTGGAGCC GAGCCTCGTC CTGTCCGAGG ACGGGCGGGC GGTCCGCCAC
GTCTTCGTCG CCGGCCGCCA GCTCGTCGCC GACGGTCGCT GCCTGACGAT CGACGAGGCC
GACGTCAACG GCCGGCTGCG CGAGCTGGCG CAGCGGCGCG CCCGCGACGC CGACCCGCTG
CCGGCGGCCG TCGCGCGGGC CGTCGGGCAG ATGCGCGCGC TCCGTCAGGC GCTCGCCGAG
CGCGGCCCTT GA
 
Protein sequence
MDEPVRDLVV RGCDVLVEPG DLREQVDLVL AGERVAAVGV GAAGVGGSAA GAGARELDGR 
GLLAIPGLVN AHTHSPENCL RGVGEGLGLE PWLMTMFGAG GDLDAEAHEV TVLAGAAEML
RNGTTSVIDH LWMTPPSPRA LDAALRAYAA SGMRATVAPL MEDRDVTDQL AAQLGLDVSA
GLVTALPEAL GTTELLAVLR HAFETWHGAE DGRLRILAGP GGVQWASDEL LLGSAELAAR
HGGGVHIHLL ETTVQAAACR AAFGRSGLQR LADLGLVGPG LSLPHSVWIE AADVETIAAG
GATVVHNPAA NTRLGSGRAP IAALLRAGAH VALGTDGSAS SDNQSVWDAM KLAALIHNDA
DADVWVGSAE VLRMATTGGV RVMARGGGGA AGATADGLGT LRPGAPADFA LLDRRVSGLA
GAFALEPSLV LSEDGRAVRH VFVAGRQLVA DGRCLTIDEA DVNGRLRELA QRRARDADPL
PAAVARAVGQ MRALRQALAE RGP