Gene Cwoe_4736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_4736 
Symbol 
ID8735202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp5054951 
End bp5056072 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content73% 
IMG OID646505365 
ProductNitrilase/cyanide hydratase and apolipoprotein N- acyltransferase 
Protein accessionYP_003396524 
Protein GI284046184 
COG category[R] General function prediction only 
COG ID[COG0388] Predicted amidohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.923409 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGCGA CGGAGCCCGT CGCGCTCGGC GGCGCCTACC GTGCCGTCGT CTGCCAGATG 
GAGACGGAGA ACGCCGTCGA CCGCGCGGCG CTGGAGCGCA ACCTCGACCA CCTCTGCGAG
ATGGTCGACT GGGCGGTCGA GGGCGCGATG GCGATGGGCG CGCCGGTCGG GCTCGTCGTC
GCGCCGGAGC TGTCGATCCA CGGCGCGGCC GGCTACAGCT GGTCCGAGCA GCGCAGGCTC
GCGTGCGACA TCCCTGGTCC CGAGACCGAG CGGCTGGCGG AGAAGGCGCG CGAGTACGGC
ATCCACCTCG TGCCCGGCTC GCTGCTCGAA CGCGACCCGG ACAGCGACGC GATCTTCAAC
ACGCTCGTGC TGTTCGGTCC CGACGGCGAG CTGCTGTGGC GCTACCGCAA GCTCAACGTC
TGGCACCCGC TGGAGCCGAC GCAGTCGCCG GTCGACCTGC TCGACCACGG CTACGACGTC
GAGCGCTATC CGCTGCTGCC GGTCGCGCGG ACCACGATCG GCAATGTCGG CGGGCTGGTC
TGTTACGACG CGCTCTTCCC CGAGGTCACG CGCCAGCTCG CCTACGACGG CGCCGAGGTG
CTGGTGCGCG CCTCGGCGTT CATGGACCCG TGGGGCAGCG GCCCGGGCGG CGCGTCACTC
GCGACCGACC GCGTCCGCGC GCTGGAGTCG ATGGCGTACA TGGTCAGCTG CCACCAGGGC
GCGTCGCTGC GCAGCAGCCC GCCGTACTCG TGGACGGCGC CAAGCGCGGT GATCGACTTC
GAGGGCCGCG TGCTGGCGGA GGCGCAGGTC GGCGAGCGGA TCGTCCACGC GCGGCTCGAC
GTCGCGAGCC TGCGTGAGCA CCGCCGCTCG ACGCTCGCGT TCAACGTGCT CGCGCAGGGC
CGGCACGAGG CGTACGACTA CCTCGACCGC TCGCCCTCGC CTCCGCGGCC GGAGCTGGCG
ACGGCGAACG ACCTCCACGT GCGCGACTAC GAGCGCGCCG CGACCGCCGG CCATGAGGCG
TTCTGGAGCG CGTACTACGG CGAGCCGTGC AGGTTCCCGA CGCTGTCGGC GCCGTTCTGG
CGCGCGCAGC GGGAGCGGGC CGAGCGCGAG CGGGCGCGAT GA
 
Protein sequence
MSATEPVALG GAYRAVVCQM ETENAVDRAA LERNLDHLCE MVDWAVEGAM AMGAPVGLVV 
APELSIHGAA GYSWSEQRRL ACDIPGPETE RLAEKAREYG IHLVPGSLLE RDPDSDAIFN
TLVLFGPDGE LLWRYRKLNV WHPLEPTQSP VDLLDHGYDV ERYPLLPVAR TTIGNVGGLV
CYDALFPEVT RQLAYDGAEV LVRASAFMDP WGSGPGGASL ATDRVRALES MAYMVSCHQG
ASLRSSPPYS WTAPSAVIDF EGRVLAEAQV GERIVHARLD VASLREHRRS TLAFNVLAQG
RHEAYDYLDR SPSPPRPELA TANDLHVRDY ERAATAGHEA FWSAYYGEPC RFPTLSAPFW
RAQRERAERE RAR