Gene Cwoe_4253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_4253 
Symbol 
ID8734715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4519135 
End bp4520571 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content72% 
IMG OID646504879 
Productamidohydrolase 
Protein accessionYP_003396042 
Protein GI284045702 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0761182 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.637243 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGCGG CGAACGCCGC CGCGACGACC GTCGTCCGAG GCGCGTGGGT CCTCAGCATG 
GATCCCGCCC GCGAGCTGCT GCGCGACGGG GCGGTCGCAT TCGACGCCGG CGGCGAGATC
CTCGCCGTCG GACCGTGGGA GGAGCTGCGC GAGCGCTTCT CCGGCGCGGA GGTCGTCGGC
GACGGCAACG GCATCGTCCT GCCTGGCTTC GTCAACTGCC ACACGCACCT CACCGAGGGC
CTGATCACGG GCATGGGCGA GACGGCGTCG TTGTGGGAGT GGTTCGACCG CGTCGTCGAG
CCGGCCGGTC GCGTCACGAC ACGCGAGGAC GTGCGCGTCG GAACGAAGCT CAAGGGCGCC
GAGATGCTGC TGTCCGGCAT CACGACCGTC AACGACATGT CCTGCCACCG CAACCTCGGC
TCGCTCGCGT CGCTCGGCGC GGCGGACGGC CTGGTGGAGA TGGGGCTGCG CGGGATCGTC
TCGTTCGGCG CCGAGAACCT CTACGACGGG GCGCCCGGCG AGGACGTCTT CATGGCCGAG
CACGAGGCGC TCGCCGACCG CCTGTCGAGC GAGCCGCTGG TCGGCTTCCG GCTCGGCATA
GGGACGATCC TCGGCGTCAG CGACGAGCTG ATGACGCGCA GCGTCGCGGC GTGCGCCGAG
CACGGCTGGG GCGTCCACAC GCACCTCGCG GAGGTGCGCG AGGAGGTCAC CGAGTCGCGT
CACAGATACG CCGGCCGCAC GACGATCGAG CACTCCGCGC ACGTCGGGCT GCTCGACCAC
GAGGTGATCG CCGGCCACTG CATCTGGTGC GGCGAGCACG ACCTCTCGCT GCTGGCGGCG
AAGGACGTGG CGGTCGCGCA CAGCCCGGTC GCGAACATGA TCCTCGCGTC GGGCGTCTGC
CCTGTGCCGC GCCTGCGGCG CGAGGGCGTC CGCGTCGGGA TCGGCACCGA CGGCGCCGCG
TCCAACGACA ACCAGGACAT GTTCGGCGCC GTGAAGGCCG CGGCGCTGCT GCAGAAGGTC
CACCACCTGC GCGCCGACGC GATCACCGCG ATCGACGTGA TGCGGATGGC GACGATCGAG
GGCGCCCGTG CGCTCGGGCT CGACCGTGAG GTCGGCTCGC TCGTCGCCGG CAAGCGCGCC
GACGTGACGC TGCTCGACGG CAACACGCCG GAGCTGGCGG CGATCCACGA TCCGTGGCAG
CAGGTCGTCT ACTGCGCGAC CTCGCGCTGC GTCAGCCACG TGTGGGTCGA CGGGGCGCCG
CGCGTCGCCG ACGGCCGGCT CGCGCAGCAG GAGCTGCGCG AGATCGTCGT GGAGGGGCGC
GAGCAGGCGA TCGACCTCGC CGAGCGCGCC GCGCTCGGCG GCGAATCGGT GCTGACGGGG
GGCGAAGGAC GTGCCTTTCC GGCCTCCGCG CGCGAGGAGA CAGTAGGCGT CGTGTAG
 
Protein sequence
MEAANAAATT VVRGAWVLSM DPARELLRDG AVAFDAGGEI LAVGPWEELR ERFSGAEVVG 
DGNGIVLPGF VNCHTHLTEG LITGMGETAS LWEWFDRVVE PAGRVTTRED VRVGTKLKGA
EMLLSGITTV NDMSCHRNLG SLASLGAADG LVEMGLRGIV SFGAENLYDG APGEDVFMAE
HEALADRLSS EPLVGFRLGI GTILGVSDEL MTRSVAACAE HGWGVHTHLA EVREEVTESR
HRYAGRTTIE HSAHVGLLDH EVIAGHCIWC GEHDLSLLAA KDVAVAHSPV ANMILASGVC
PVPRLRREGV RVGIGTDGAA SNDNQDMFGA VKAAALLQKV HHLRADAITA IDVMRMATIE
GARALGLDRE VGSLVAGKRA DVTLLDGNTP ELAAIHDPWQ QVVYCATSRC VSHVWVDGAP
RVADGRLAQQ ELREIVVEGR EQAIDLAERA ALGGESVLTG GEGRAFPASA REETVGVV