Gene Cwoe_1105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_1105 
Symbol 
ID8731540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp1163838 
End bp1165370 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content70% 
IMG OID646501722 
ProductAldehyde dehydrogenase (NAD(+)) 
Protein accessionYP_003392912 
Protein GI284042572 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGTG TCGAGCAGGC GCCGGCGGCC GCGCAGGGCG GCACCGGCAT CCCCGTTGAG 
AACCCGGCCA CCGGCGAGAC GATCACGACC GTCCCGACGC TCGGCCGCGA GGACGTGCAC
GAGCTGGTCG CGCGCGCTCG CGCCGCCCAG CCGGCGTGGG ACGCCATCGG CTTCGACGAG
CGTGGCCGCG TGCTGCGGCG CGCGCAGAAG TGGGTCACCG ACAACGCTGA GCGCATCATC
GCGACGATCG TCTCCGAGAC GGGCAAGACC TACGAGGACG CTCAGCTCGC CGAGATCATG
TACGCCGCCG CGGCGTTCGG CTTCTGGGCC AAGGAGGCGC CGGCGTTCCT CGCCGACGAG
CCGATGAAGT CCGCGAACCC GCTCGTGAAG GGCAAGAAGC TCGTCGTCCG CTACCGCCCG
GTCGGCGTCG TCGGCGTGAT CGGCCCGTGG AACTTCCCGC TGACGAACTC GTTCGGCGAC
TGCATCCCCG CGCTCGCCGC TGGCAACGCC GTGATCCTGA AGCCGAGCGA GGTGACGCCG
CTGACCTCGC TGCTGATGGC CGAGGGGCTG AGAGCGGCGG GTCTGCCGGA GGACGTCTTC
CAGGTCGCCA CCGGTGACGG CGCCACCGGC GCCGCGTTGA TCGACGAGGT CGACTTCGTC
ATGTTCACCG GCTCGACGAA GACCGGCAAG AAGGTCATGG AGCGTGCCGC CAAGACGCTG
ACGCCGGTCG GGCTGGAGCT GGGCGGCAAG GACCCGATGA TCGTGCTGGC CGACGCCGAC
GTCGACCGCG CCGCCAACGC GGCCGCCTAT TACTCGATGA ACAACGGCGG CCAAGTCTGC
ATCTCGATCG AGCGCGTCTA CGTCGAGGCG CCGGTCTACG ACCAGTTCGT CGCGAAGGTG
ACCGAGAGAG TCAAGAGCCT GCGCCAGGGC CGCTCCGACG GTCCCGGCTC GATCGACGTC
GGCGCCGTGA CGTTCCCGCC GCAGCTCGAC ATCATCGACA AGCACGTCAG AGACGCCGTC
AGAAAGGGCG CGCGCGTGCT GACCGGCGGC AGAGCCGGGA ACGGCCCGGG GATGTTCTAC
GAGCCGACCG TGCTCGTCGA CGTCGACCAC TCGATGTCGT GCATGACAGA GGAGACGTTC
GGCCCGACGC TGCCGATCAT GAGAGTCGGC GACGCCGAGG AGGCGTTGCG GCTCGCGAAC
GACTCCCCGT ACGGTCTGCA GGCGTCGGTC TGGACGAAGG ACACGCGCCG CGGCGAGCAG
CTGGCGCGCC GCGTCGAGGC GGGCGCCGTC TGCGTCAACG ACGCGCAGGT CAACTACACG
GCGCTGGAGC TGCCGATGGG CGGCTGGAAG TCCTCGGGCC TCGGCACGCG CCACGGCGCC
GGAGGGATCC GCAAGTACAC CCAGCAGCAG ACGCTGCTCG TGACCCGCTT CGCCGGCAAG
AGAGACCCGC ACATGTTCCC GTACAAACGG CGCACGACGC TGCTGCTGGG GAAGCTGACG
AGACTGCTCT ACGGTCGCGG CAAGCGCGAC TGA
 
Protein sequence
MASVEQAPAA AQGGTGIPVE NPATGETITT VPTLGREDVH ELVARARAAQ PAWDAIGFDE 
RGRVLRRAQK WVTDNAERII ATIVSETGKT YEDAQLAEIM YAAAAFGFWA KEAPAFLADE
PMKSANPLVK GKKLVVRYRP VGVVGVIGPW NFPLTNSFGD CIPALAAGNA VILKPSEVTP
LTSLLMAEGL RAAGLPEDVF QVATGDGATG AALIDEVDFV MFTGSTKTGK KVMERAAKTL
TPVGLELGGK DPMIVLADAD VDRAANAAAY YSMNNGGQVC ISIERVYVEA PVYDQFVAKV
TERVKSLRQG RSDGPGSIDV GAVTFPPQLD IIDKHVRDAV RKGARVLTGG RAGNGPGMFY
EPTVLVDVDH SMSCMTEETF GPTLPIMRVG DAEEALRLAN DSPYGLQASV WTKDTRRGEQ
LARRVEAGAV CVNDAQVNYT ALELPMGGWK SSGLGTRHGA GGIRKYTQQQ TLLVTRFAGK
RDPHMFPYKR RTTLLLGKLT RLLYGRGKRD