Gene Cwoe_3163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_3163 
Symbol 
ID8733611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp3369852 
End bp3371450 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content78% 
IMG OID646503780 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003394957 
Protein GI284044617 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01237] delta-1-pyrroline-5-carboxylate dehydrogenase, group 2, putative 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.26407 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.156772 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGA CGGGCTCCCC CACCCGCCTC GACGCGCCGT TCGCCAACGA GCCGGTCGTC 
GAGCTGCGCC GCGCCCCGCT GCGCGCGCGG CTCGCCGACG CGCTCGCGGC GCTCGACGCC
GAGCTGCCGC TGACGGTCCC CGTGTCGGTC GGCGGTGAGA CCCGCGAGCC CGCCGCCGAC
GCGTTCCGAT CGACCGACCC CGGCACGCCC GACCGCGTCG TCGCGGTCGC CGCGGAGGCG
AGTCCGGCGG AGGTCGGCGC GGCGGTCACG CACGCGGTCG AGGGCGGTCG CGCGTGGGCG
CTGCGACCGG CGCGGGAGCG CGCCGAGGTG CTGCTGCGCG CCGCCGCCGG GCTGCGCGCG
CAGCGCGCCC GCCTCGCGGC GCTCGCGCTG CGCGAGTGCG GCAAGCCGTG GCCGGAGGCC
GACGCCGACG TCTGCGAGGC GATCGACTTC CTCGAGTACT ACGCACGCGG TGCCGTCGCC
CTCGACGCGG GCGCCGCGCT GCTGCAGCCG CCCGGGGAGC GCAACGCCCT GCGCTACGCC
CCGCGAGGCG TCGTCGCGGT GATCGCGCCG TGGAACTTCC CGCTCGCGAT CGTCACCGGC
ATGACCGCGG CCGCGCTGGC GACCGGCAAC GCGGCGGTCG TCAAGCCCGC CGAGCAGTCG
CCGGCGTGCG CGGCCGCGGT CGTCGCGGCG CTGCACGCGG CGGGCGTCCC GCCCGACGCG
CTCGCGCTGC TGCCGGGAGC CGGCGAGGTC GGCGCCGCGC TCGTGCGCGA TCCGCGCGTC
GCGACGATCG CGTTCACCGG CTCGCTCGCG GTCGGCCTGG AGATCCAGCG GGCCGCGGCC
GAGCCGGCGC CCGGCCAGCA CGCGCTCAAG CGGCTCGTCG CCGAGCTGGG CGGGAAGAAC
TGCGTGATCG TCGATGCCGA CGCTGACCTC GACGAGGCGG TTCCGGCGAT CGTCGCCTCG
GCCTTCCACT ACGCCGGGCA GAAGTGCTCG GCGGCGGCGC GGGTGCTCGT CCACGAGGCG
GTCGCCGAGA CGCTGTGCCA GCGGCTCGCG GGCGCGGTCG CGGTGCTGTC GGTCGGCCAG
CCGCAGCTGC TGGAGACCGA GGTCGGCCCG CTGATCGAGC GGGCGGCGCA GGAGCGGATC
GAGCGCGCCT CCGCGCGCGC GGAGGCGGAG GGCCGGCTGC TCGTGCGCCA CGCCGGGCCG
CTGCCCGCCG CCGGCTGGTT CTGCGCGCCG GCCGTCGCGA CCGACCTGCC GCCCGACTCG
CCGCTGCTGC GCGACGAGCT GTTCGGCCCG CTGCTGACGG TCGAGGCCGT TCGCGACGTG
GGGCACGCCT GCGCGCTCGT CGACGCGCTG CCATACGCGC TCACGGGCGC CCTCTTCTGC
CGCGATCCGG CGACGGTCGC GGCCGTCGCG GCCATCTCGC CGGTCGGGAA CCTCTATGTC
AACCGTGCCA CGACGGGCGC GATGGTCGGG CGCCAGCCGT TCGGCGGCAA CCGTCTCTCG
GGCACCGGCG CGAAGGCCGG AGGGCCCGAC TACCTGCGGC ATTTCGCCGA GCCGCGGGTG
GTCACGGAGA ACACCGTGCG GCACGGGCTG GTCGCGTGA
 
Protein sequence
MSATGSPTRL DAPFANEPVV ELRRAPLRAR LADALAALDA ELPLTVPVSV GGETREPAAD 
AFRSTDPGTP DRVVAVAAEA SPAEVGAAVT HAVEGGRAWA LRPARERAEV LLRAAAGLRA
QRARLAALAL RECGKPWPEA DADVCEAIDF LEYYARGAVA LDAGAALLQP PGERNALRYA
PRGVVAVIAP WNFPLAIVTG MTAAALATGN AAVVKPAEQS PACAAAVVAA LHAAGVPPDA
LALLPGAGEV GAALVRDPRV ATIAFTGSLA VGLEIQRAAA EPAPGQHALK RLVAELGGKN
CVIVDADADL DEAVPAIVAS AFHYAGQKCS AAARVLVHEA VAETLCQRLA GAVAVLSVGQ
PQLLETEVGP LIERAAQERI ERASARAEAE GRLLVRHAGP LPAAGWFCAP AVATDLPPDS
PLLRDELFGP LLTVEAVRDV GHACALVDAL PYALTGALFC RDPATVAAVA AISPVGNLYV
NRATTGAMVG RQPFGGNRLS GTGAKAGGPD YLRHFAEPRV VTENTVRHGL VA