Gene Cwoe_4435 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_4435 
Symbol 
ID8734897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4729569 
End bp4730924 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content71% 
IMG OID646505061 
Productamidohydrolase 
Protein accessionYP_003396224 
Protein GI284045884 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCGC CTCTGCCGCC TCGCGGCCGC TACCTGATAC GCGACGCCTA CGTGATCACG 
CTCGACGCCG AGCGCGGCGA CCTGCCGCAC GGTGACGTGC TCGTCGAGGA CGGTGAGATC
GTCGCCGTGG GCGAGGGCCT GTCCTCGCCC GGCAGCGAGG TGCTCGACGG GACAGGCCGG
ATTCTGGCCC CCGGCCTGGT GGACACGCAC ACGCATCTGT GGAACGGGCT TCTGCGCGGC
GTGATCGAGC AGGAGCCGGG GCGCACGTAC TTCGAGGTCA AGCGGCGCGT CGCGCGCCAC
TACGCGCCCG AGGAGTCGTA CGTCGCCGCC CGGCTCGGCC TCGCCGACGC GCTGATGTCA
GGAACGACGA CCGTCTGCGA TTGGGACCAC AACGCTCGCT CGCCCGAGGA CGTCGACGCA
AAGCTCAGAG CGCATCGAGA CTCGGGCATG CGAACGCGCT ACGCGTACGG CAACCCCGAC
AACCACCCGC GTGACGAGGT CATGGATCTC GCCGACGTCG CTCGCGTTCA GCGCGAGTGG
CTCGGCACCC GTGACGACGG GCGACTGAGC CTGTGCGTGG CCGTTCGCGG GCCGGCACGC
ACCGAGCGGG ACATCCTGTC GGCCGAGTGG GCGTTCGCGC GCGACAGAGG GCTGCCGATC
ACGCTCCATC TCGGTGGTCG CCGCGACGAC GCCGCGCGCT ACGCGGACCT GATGCAGATG
CACCGGGACG GACTCCTCGG GCCGGATGTC CAGGTCGTTC ACGCAGTCGA CGTGACGGAC
GAGGAGATCG CCATGTTGGC AGCGACCGGC ACGTCGGTCT GCCTCAGTCC GCTGACCGAG
TACGAGGGGA TGGGCATCCC TCGGATCACC GAGCTGCTCG ATGCCGGCGT GCTCGTCTCG
CTCTCGGTCG ACACCCTTGC CGCGCCGCTG AGCGCAAGCC TCCTTGCGGT CATGGGAACG
GCCCTGACGA TCGAGCGCGG ACGGCCGCGG GGCAAGGCCA TGACCGCACG CCGCATGCTC
GAGCTGGCGA CCATCGACGG TGCGCGCGAC CTCGGCCTCG ACCACCTGAT CGGCACGATC
ACCCCGGGCA AGCGGGCGGA CCTGATCCTG GTCAACCGCG CCGACCTCAA CATGGTCCCG
TGCGCGGACC CGCTGCCCGT GCTCGTGCTC TGCGCCCAGC CGGCGAACGT CGACACCGTG
CTGGTCGACG GTCGCGTGCT CAAGCGCAAC GGGGTGTTGA CCGCCGTCGA TCCGAACGAG
CTCGCGGCGG CGGCGACCCG GGCGCTCGCG GCGGTGCTCG ATCGCGCCGA CTGGCACCAG
TTCGCGTTGC CCGCACTGGC CGAAGCGGAC GCCTGA
 
Protein sequence
MDAPLPPRGR YLIRDAYVIT LDAERGDLPH GDVLVEDGEI VAVGEGLSSP GSEVLDGTGR 
ILAPGLVDTH THLWNGLLRG VIEQEPGRTY FEVKRRVARH YAPEESYVAA RLGLADALMS
GTTTVCDWDH NARSPEDVDA KLRAHRDSGM RTRYAYGNPD NHPRDEVMDL ADVARVQREW
LGTRDDGRLS LCVAVRGPAR TERDILSAEW AFARDRGLPI TLHLGGRRDD AARYADLMQM
HRDGLLGPDV QVVHAVDVTD EEIAMLAATG TSVCLSPLTE YEGMGIPRIT ELLDAGVLVS
LSVDTLAAPL SASLLAVMGT ALTIERGRPR GKAMTARRML ELATIDGARD LGLDHLIGTI
TPGKRADLIL VNRADLNMVP CADPLPVLVL CAQPANVDTV LVDGRVLKRN GVLTAVDPNE
LAAAATRALA AVLDRADWHQ FALPALAEAD A