Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_5313 |
Symbol | |
ID | 8735783 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 5679951 |
End bp | 5681111 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 646505939 |
Product | cellulose biosynthesis (CelD)-like protein |
Protein accession | YP_003397094 |
Protein GI | 284046754 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.323377 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCTCCG CGCCGCTCCA GGCCGAGACG TTCGACACGA TGGCGGCGCT CGCGCGGCTG ATCCCCGAAT GGGACGCGTT GGCAGACGCG GCCGGCCGGC CGGCGTGCCT GCCGGCGTGG CAGCTGGCGT GGTGGCGTGA GCTCGCGCCG CCGGGCTCCC TGCTGCGGGT CGTCGCCGTC CGCGCGGACG GGCAGCTCGT CGGGCTGGTC CCGTTCTTCC TGGAGCAGCG GCTCGGGCGC AGCTCGCACC GGCTGCTCGG CGCGGCGGTG ACCCACCGGA CCGAACCGCT CGTCCACCCC GGCGTCGCGG CCGCGGACGT CGCCGCGCTC GCCGTCGCCG AGCTGGAGCG GTGCGACCCC GCCGCCGACG TCGTGCGGCT GGAGGGAATC GCCGCCGACG GCCCGTGGCC GGCGGCGTTC GATGCCGCCT GGCCGGCGCT CGGAGCGCCG TGGCGCCTGG TCGAGCAGCA ACAGCTCGCC CCCTCCGTCG ACCTCTCGTT CGACGACCTC GACGCATGGC TGGCGTCCAA GAGCAGCAAC TTCCGCCAGC AGACGCGCCG CTTCCGCCGC CGACTCGAGA AGGAAGGCGG CACGGTGCGG ATGAGCAGCG AGCAGGAGGT CGCGGGCGAC CTCGCCGCGA TGCTGCAGCT CCATCACCAG CGGTGGGCCG GGCGCGGTGG CTCGTCGCTG CCGGCGGCCA CCGAGGCGAT GGTGCGGACG GCGGCCGAGC AGCTGCTGCC GGCAGGACGG CTCCGACTGT GGGTCGTCGA GCTCGACGGC CGGCCGATCG GCGTCCAGCT GTTCCTCGCC GCCGGCGGGA ACGTGCTGTA TTGGAACGGC GGGTTCGACG AGAGCGCGAG CCACCTCAAG CCCGCGCTGC TCGGCATCGT CGCCGGGATC GAGGACAGCC TCCTGCGCGA CGAGCGGCTG CTCGACCTCG GCGGTGGCGA CCAGGACTAC AAGCTGCGGC TTGCGGACGG CGCGACGCCG CTGACGTGGA CGTCGCTCGT CCCGCGCAAC CGTCGCTACG CCGCCAACCG GCTCGCGCTC GCGCCGGAGG CGCTTTCGCT CACCGGGCGC CGGCTGGCGG AGCAGCTTCC CGACCCGCTC CAGCAGCCGC TGCGCAGGGT CGTCCGGCGC TACTCCGGGC GGCGCGCCTG A
|
Protein sequence | MTSAPLQAET FDTMAALARL IPEWDALADA AGRPACLPAW QLAWWRELAP PGSLLRVVAV RADGQLVGLV PFFLEQRLGR SSHRLLGAAV THRTEPLVHP GVAAADVAAL AVAELERCDP AADVVRLEGI AADGPWPAAF DAAWPALGAP WRLVEQQQLA PSVDLSFDDL DAWLASKSSN FRQQTRRFRR RLEKEGGTVR MSSEQEVAGD LAAMLQLHHQ RWAGRGGSSL PAATEAMVRT AAEQLLPAGR LRLWVVELDG RPIGVQLFLA AGGNVLYWNG GFDESASHLK PALLGIVAGI EDSLLRDERL LDLGGGDQDY KLRLADGATP LTWTSLVPRN RRYAANRLAL APEALSLTGR RLAEQLPDPL QQPLRRVVRR YSGRRA
|
| |