Gene Cwoe_2920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_2920 
Symbol 
ID8733365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp3117977 
End bp3119197 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content70% 
IMG OID646503534 
Productformaldehyde dehydrogenase, glutathione- independent 
Protein accessionYP_003394714 
Protein GI284044374 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR02819] formaldehyde dehydrogenase, glutathione-independent 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.754967 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.420653 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACA CCAACAGAGG CGTCGTCTAC ATGGGTCCCG GCACGGTCGA GGTCCAGACG 
ACCGACTACC CGAGCTTCGT GCTCAGAGAC GGCCCCGGAG TCCATCCAGA CAGCGTCGGG
CGCGAGTGCA ACCACGGCGT GATCCTGCGG ATCGTCTCGA CCAACATCTG CGGCAGCGAC
CAGCACATGG TGCGCGGCCG CACGACGGCG CCCGAGGGGC TGATCCTCGG GCACGAGATC
ACCGGCGAGG TGATCGAGAG AGGCCGCGAC GTCGAGTACA TCGACGAGGG CGACCTCGTA
TCCGTCCCGT TCAACATCGC CTGCGGGCGC TGCCGCAACT GCAAGGAGCG CAAGACCGGC
ATCTGCCTCA ACACCAACCC GGCGCGGCCG GGGGCAGCGT ACGGCTACGT CGACATGGGC
GGCTGGCCCG GCGGGCAGGC GCGCTACGTG ATGGTCCCCT ATGCCGACTT CAACTGCCTC
AAGTTCCGTG ACAAGGAGCA GGCGCTGGCG AAGATCCTCG ACCTCACGAT GCTGTCGGAC
ATCTTCCCGA CGGGGTACCA CGGCTGCGTC ACGGCGGGCG TGACGACCGG CAGCACGGTC
TACATCGCGG GCGGCGGCCC GGTCGGGTTG GCGGCGGCGC ACGGCGCGCA GCTGCTCGGT
GCCGCGGTCG TGATCGTCGG CGACCTGATC CCGGAGCGGC TGGCGCAGGC GAAGAGCTTC
GGCTGCGAGA CGATCGACGT CTCCAGAGGA GACCCCGGCG AGCAGATCGA GCAGCTGCTC
GGCGTGCCGG AGGTCGACTG CGGCGTCGAC GCCGTCGGCT TCGAGGCGCG CGGCCACGGC
GAGCACGCGA GCGAGGAGCT GCCCGCGACG GTGCTGAACT CGCTGATGGG ACTGACGCGC
GCGGGCGGCG CGCTCGGCAT CCCGGGCCTC TACGTGACCG GCGACCCGGG CGCGCACACG
GACGCGGCGA AGGAGGGCTC GCTGTCGATC CGGATCGGGC TCGGCTGGGC GAAGTCGCAC
GTCTTCACGA CCGGCCAGTG CCCGGTGATG AGATACAACC GCGAGCTGAT GGAGGCGATC
CTCGGCGACC GCTGCCAGAT CGCCAGAGCG GTCAACGCGA CGGTGATCAC GCTCGACGAC
GCGCCGCAGG GCTACAGAGA CTTCGACAGA GGAGCGGCGA AGAAGTTCGT CCTCGACCCG
AACGGGCTGA TCCCGGCCTA G
 
Protein sequence
MADTNRGVVY MGPGTVEVQT TDYPSFVLRD GPGVHPDSVG RECNHGVILR IVSTNICGSD 
QHMVRGRTTA PEGLILGHEI TGEVIERGRD VEYIDEGDLV SVPFNIACGR CRNCKERKTG
ICLNTNPARP GAAYGYVDMG GWPGGQARYV MVPYADFNCL KFRDKEQALA KILDLTMLSD
IFPTGYHGCV TAGVTTGSTV YIAGGGPVGL AAAHGAQLLG AAVVIVGDLI PERLAQAKSF
GCETIDVSRG DPGEQIEQLL GVPEVDCGVD AVGFEARGHG EHASEELPAT VLNSLMGLTR
AGGALGIPGL YVTGDPGAHT DAAKEGSLSI RIGLGWAKSH VFTTGQCPVM RYNRELMEAI
LGDRCQIARA VNATVITLDD APQGYRDFDR GAAKKFVLDP NGLIPA