Gene Cwoe_5157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5157 
Symbol 
ID8735623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp5515631 
End bp5517094 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content69% 
IMG OID646505782 
Productamidohydrolase 
Protein accessionYP_003396941 
Protein GI284046601 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGAGC GAGCCAGAGT TACGCGGCGC ACGTTGATAG GTGGAGCGAC AGCGGCGGCG 
GCTGTCGGAG CGGCGGCTGC GGCGCCGTCG GCCGGTGCGA GCCAGCGCCA CGGCGGCGGT
GGATGGGGAG GGCCGACGCC GCCGCGCGGC AACGAGTACG TCTTGCGGGA CGGGTTCGTG
CTCTCGATGG ATCCGGCGAT CGGAGATCTG CCGCGTGGCG ACGTCCACGT GCGGAACGGG
CAGATCGTCG CGGTCGGCGA GCGGTTGCGC GCGCATGGTG CGGTGTCGAT CGATGCGCGC
GACAAGATCG TGATGCCGGG TCTGGTCGAC ACGCATTGGC ATCTCTGGAA TGCATCGATG
CGCGCGTTCA TGGTGAATGG CGTAGCGGAC CGTGCGTACT TCAACGTCAC GAACATCCTC
GGTCCGCACT TCACTCCGAT CGACACCTAC CGCTCCACGC GGCTCGGTCT GCTGGAGGGG
GTCGCCTCCG GCATCACCAC GGTCCACGAC TGGTCCCACA ACGTGCGCGG TCCCGAGTAC
GCCGACGCCT CGCTGCGCGG GTTGCTCGAC GCCGGGGTTC GCGGCCGCTT CTCGTACGGG
TGGGCGCAGA GAGGCCCGCT CGACGTGCCG ATGGACGTGG ACGGCATCCG CCGCACCAAG
GATCGCTGGT TCTCGCGCGC GAGCACGACA CGCGGCCTGC TGCACCTGGG GATAGCGTCG
CGCAACGTCG TTCCCGGACA GAGCCCGCGC GGCTCGATCA CGATCGAGCT GGCGCGTCAG
GACTGGACCT CGGCGCGTGA GCTCGGGCTG CCGATCACGC TGCACGCGTC GCCGAGAGGT
CTCGTGACGA TGCTCGAGCA GGAGAGACTG CTCGGTCCCG ACCTGCTGCT CGTTCACCCG
ACCCTGACGA CCGAAGCCGA GAACGCGATC GTCGTCGAGC GCGGAACCGG ATGGAGCATC
TCTTCAGTCG GCGAGGCGGC TCGCGGGCCC GAGGAGCAGA TCCGCTACGC GGAGCTGGTC
GCCGCCGGGG CGAAGCTCGG CCTCTCGATC GACGCGAGCG CGGGGGACGG CGCGAACCTG
TTCACGGCGA TGCGCATGCT GCACACGATG ACGACGAACC GTCTCGGTGC CGTGCCCGGC
ATCACCTATC GCCGCGTGCT CGAGCTGGCG ACCGTCGAGG GCGCCAACAC GCTCGGACTG
GGCGACGTGG TCGGTTCGCT GACGCCGGGC AAGCGGGCAG ACGTGATCAC GATCAACCGG
CTCGATCCGA ACATGGCGCC GCCCGGCGAT CCCGCGACCC AGATCGTCGG GCTCGGTCAG
CCGCGCAACG TCGACACGGT GATGGTCGAC GGAAAGATCC TGCTGTGGCG CGGATTGCAC
GTCGGTGTCG ACGTCGAGCG CGTCGTCCGC GACGCGGGAC AGTCCGCGAC CGAGATCAGC
AGCCGAGCCG GCTGGCCCAC CTGA
 
Protein sequence
MSERARVTRR TLIGGATAAA AVGAAAAAPS AGASQRHGGG GWGGPTPPRG NEYVLRDGFV 
LSMDPAIGDL PRGDVHVRNG QIVAVGERLR AHGAVSIDAR DKIVMPGLVD THWHLWNASM
RAFMVNGVAD RAYFNVTNIL GPHFTPIDTY RSTRLGLLEG VASGITTVHD WSHNVRGPEY
ADASLRGLLD AGVRGRFSYG WAQRGPLDVP MDVDGIRRTK DRWFSRASTT RGLLHLGIAS
RNVVPGQSPR GSITIELARQ DWTSARELGL PITLHASPRG LVTMLEQERL LGPDLLLVHP
TLTTEAENAI VVERGTGWSI SSVGEAARGP EEQIRYAELV AAGAKLGLSI DASAGDGANL
FTAMRMLHTM TTNRLGAVPG ITYRRVLELA TVEGANTLGL GDVVGSLTPG KRADVITINR
LDPNMAPPGD PATQIVGLGQ PRNVDTVMVD GKILLWRGLH VGVDVERVVR DAGQSATEIS
SRAGWPT