Gene Cwoe_3408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_3408 
Symbol 
ID8733857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp3632607 
End bp3633644 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content70% 
IMG OID646504025 
Productaldo/keto reductase 
Protein accessionYP_003395201 
Protein GI284044861 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.299651 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTCCT CCTACACCGC TGCCGCGGAC CGCTACGAGA CGACGATGCA ATACCGCCGC 
TGCGGGCGCA GCGGCCTGCT GCTGCCGGCG ATCTCGCTCG GCCTCTGGCA CAACTTCGGC
GACGACCGGC CGCTGGAGAA CCAGCGCGCG ATCCTGCGCC GCGCGTTCGA CCTCGGCGTC
ACGCACTTCG ACCTCGCGAA CAACTACGGG CCGCCGTACG GCTCCGCCGA GACGAACTTC
GGCCACATCA TGCGCGAGGA CCTGCGGCCC TACCGCGACG AGCTGATCGT CTCGACGAAG
GCCGGCTGGG ACATGTGGCC CGGGCCGTAC GGCGAGTTCG GCTCGCGCAA GTACCTGCTC
GCCTCGCTCG ACCAGTCGCT GAAGCGGATG GGGCTCGACT ACGTCGACAT CTTCTACTCC
CACCGCTTCG ACCCGGACAC GCCGCTGGAG GAGACGATGG GCGCGCTCCA CACAGCCGTC
CAGCAGGGCA AGGCGCTCTA CGTCGGGATC TCCTCCTACG GCTCGCCGCG CACCGCCGAG
GCGATCGGGA TCCTGCGCGA CCTCGGCACG CCGCTGCTGA TCCACCAGCC GTCGTACTCG
CTGCTGAACC GCTGGATCGA GAGAGGCCTG CTCGACGTGA TCGGCGAGCA CGGCGTCGGC
TCGATCGTCT TCACGCCGCT GGCGCAGGGG ATGCTGACCG ACCGCTACCT CGACGGCATC
CCGTCCGACT CGCGCGCGGC GAGAAGAACC TCGCTCGACC CCGGCTGGCT GGACGAGAGA
ACGCTCGCGC ACATCCGCGC GCTGAACGAG ATCGCGCAGC GGCGCGGCCA GTCGCTGGCG
CAGATGGCGC TCGCCTGGAC GCTGCGCGAC CCGCGCGTGA CCTCGACGCT CGTCGGCGCC
AGCAGCGTCG CGCAGCTCGA GGACAACCTC GGGGCGCTCG ACAACCTGTC CTTCTCCGAC
GAGGAGCTGC AGGAGATCGA GGACCGCACG ACCGAGGCGG GGATCAACCT CTGGGCCGAG
TCGGCCGAGG TCGACTGA
 
Protein sequence
MASSYTAAAD RYETTMQYRR CGRSGLLLPA ISLGLWHNFG DDRPLENQRA ILRRAFDLGV 
THFDLANNYG PPYGSAETNF GHIMREDLRP YRDELIVSTK AGWDMWPGPY GEFGSRKYLL
ASLDQSLKRM GLDYVDIFYS HRFDPDTPLE ETMGALHTAV QQGKALYVGI SSYGSPRTAE
AIGILRDLGT PLLIHQPSYS LLNRWIERGL LDVIGEHGVG SIVFTPLAQG MLTDRYLDGI
PSDSRAARRT SLDPGWLDER TLAHIRALNE IAQRRGQSLA QMALAWTLRD PRVTSTLVGA
SSVAQLEDNL GALDNLSFSD EELQEIEDRT TEAGINLWAE SAEVD