Gene Cwoe_3952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_3952 
Symbol 
ID8734409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4193337 
End bp4195079 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content79% 
IMG OID646504576 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003395744 
Protein GI284045404 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGGG CGCGCGATGC GGCGGCGCGC GAGCGCGGGG GCGGCGGCGG CGCGGGCGAT 
GGCGCCGCGC GTGAGCGCGG GGGCGGGGGC GGCGGCGCGG GTGACGGCGC CGCGCGTGAG
CACGGGAGCC TCGTCGCCGG GCGCGAGACG GCGACCGCGG CAGAGACGTT CACGGCGTTC
GACCCGCGCA CGGGGCGTCC GTCGCCGCTG CGCTTCCGCG AGGCGTCGGC GGCCGACGTC
GCGGCGGCCG CGGAGGCGGC GGCGGTCGCC TTCCGCGCGG TGCGCGAGTG GCCGCCCGAG
CGCTTCGGGC GGCTGCTGCG CGGTGTCGCG ACGGAGCTGG AGCGGGCCGA GCGCCCGCTG
CTGGCGACGG CCGACGGGGA GACGGCGCTC GGCACGGTGC GGCTGCGCGG CGAGCTGGCG
CGCACGACCG GGCAGCTGCG CGCGTTCGCG GCGCACGTCG AGAGCGGCGC GCACCTCGAC
GTGATCGTCG CGCCGCCGCG GCCCGACGCC GAGCCGCCGC AGCCGGACCT GCGGCGGATG
CTCGTCGCGC TGGGCCCGGT CGCCGTCTTC GAGGCGAGCA ACTTCCCGTT CGCGTTCGGC
GTCGCCGGGG GCGACACGGC CGCGGCGCTC GCGGCCGGCT GCCCGGTCGT CGTCAAGGCG
CACGAGGCGC ATCCCGCGAC CGCGCACCTG TGCGCCGCCG CCGTGACGGC CGCGGTCGCC
GCCGCGGGCG CGCCGCCCGG CCTCTTCTCG CTGCTGCACG GTCGTTCGCA CGCGGTCGGC
CGCGCGCTGG TGGAGGCGCC GGAGATCGCG GCGGTCGGCT TCACCGGCTC TAGCGCCGGC
GGCCGCGCGC TGCTCGACGC CGCCGCACGG CGGCCACAGC CGATCCCCGT CTACGCGGAG
ATGGGCAGCG TCAACCCGCT GCTGGTGACC GCGGCGGCGC TGGCCGAGCG CGGCACGGCG
ATCGCCGACG GGCTGGCCGA CTCGATCGCG CTCGGCGCCG GGCAGTTCTG CACCAGCCCG
GGGCTCGTGC TCGTCCCGCA CGGCGCCGAC GGCGACGCGT TCGCCGCGCG GCTGGCGACC
GCGCTCGACG GCCGCGCGGT GGGGGCGCTG CTGACGGCGG GGATGCGCGA TCGGCTCGTC
CGCGACGTCG CCGCGCTGTC GGCGCGCGAG GACGTGACGC TGCTGGCCGG GGACCGCGGG
GCGGACGGGG GCGGCGCGGA CGGCGCCGCC CCCGGTGGCT TCCGCTTCAC GCCGGCGCTG
CTGAGCGCCG ACGCGGAGGC GCTCGTGCGC GATCCGGCGC TGGCGGATGA GCACTTCGGG
CCGGTCGCGC TCGTGCTGCG CTACGACGGC GGGTCCGGCG CGGCGCGCGC GCTCGCCGCG
CTGCCGGGGC AGCTGACGGT CACGCTGCAC GCGGGCGCTC AGGAGCTCGC GGAGCCGGAG
CGCAGCGGCC TGGCGGCGCT GCAGCGGCTG GCGGTCGAGC GCGCCGGGCG GATCGTCTGG
AACGGCTACC CGACCGGCGT CGCCGTCGTC GCGGCGATGC AGCACGGCGG CCCCTACCCG
GCGGCGAGCA CGTCGCTGCA CACCTCGGTC GGCTTGACCG CGATCCGGCG CTTCCAGCGC
CCGGTCGTCT TCCAGGACGC GCCGGCCGCG CTCCTGCCGC CGGCCCTGCG CAACGCCGCC
ACCAATTACA CAAACTTAAC GATAGGAGGC CTTGACACCG CGATCCGCCA ACGAGGAGAA
TGA
 
Protein sequence
MSGARDAAAR ERGGGGGAGD GAARERGGGG GGAGDGAARE HGSLVAGRET ATAAETFTAF 
DPRTGRPSPL RFREASAADV AAAAEAAAVA FRAVREWPPE RFGRLLRGVA TELERAERPL
LATADGETAL GTVRLRGELA RTTGQLRAFA AHVESGAHLD VIVAPPRPDA EPPQPDLRRM
LVALGPVAVF EASNFPFAFG VAGGDTAAAL AAGCPVVVKA HEAHPATAHL CAAAVTAAVA
AAGAPPGLFS LLHGRSHAVG RALVEAPEIA AVGFTGSSAG GRALLDAAAR RPQPIPVYAE
MGSVNPLLVT AAALAERGTA IADGLADSIA LGAGQFCTSP GLVLVPHGAD GDAFAARLAT
ALDGRAVGAL LTAGMRDRLV RDVAALSARE DVTLLAGDRG ADGGGADGAA PGGFRFTPAL
LSADAEALVR DPALADEHFG PVALVLRYDG GSGAARALAA LPGQLTVTLH AGAQELAEPE
RSGLAALQRL AVERAGRIVW NGYPTGVAVV AAMQHGGPYP AASTSLHTSV GLTAIRRFQR
PVVFQDAPAA LLPPALRNAA TNYTNLTIGG LDTAIRQRGE