Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_5481 |
Symbol | |
ID | 8735956 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 5864573 |
End bp | 5867374 |
Gene Length | 2802 bp |
Protein Length | 933 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 646506111 |
Product | Glycosyltransferase-like protein |
Protein accession | YP_003397261 |
Protein GI | 284046921 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.401117 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATCGC CATCCTCCGA TCCAGCGACG CCCGGGCCGG AAGCCGCGCC GCTCGCGCTG TCGCTCGCCG TCGACCGCCT TGCCAGGTTC ACGGGTGTCC GCGCGGTGGC CAACGTCGGA CCCGCGCCGC TTCGACTCGA AGGCGACTTC GAGCTGACCG CGGCTCCTGC TGAGGGCGGG ATCGTGGTCG CCGCCGGCGA CACGTCGTGG GACGCGCTCG GCGCGGCGCG GATCGCCGTC GTCGCCGACC TCGGCCGCGA CCGCGCGCCG TTCGACGCGA TCGCCGCCGC CGAGGAGCGA GCCGCCGCGG CCGGCTGGCG CATCGCCCAC GCAGGTCTGC TGCACACGGG CGCGACCGGC GCGCGTCGCG AGGGCTCGCT GCTGATCGCT TGCAGACCCG AGGACTCGAT CGCGGAGGCG CTCGCCTGCG GTCCGCTCGG CCTCGCGCTC GACCCCCGTG CGAGCGGCAC GGGGTTCGCG CCGCGTCCCG CTCGCGTCCT GATCGCCTCG CACGAGGTCG CCGGGCCGAC GGGCAACGGC GGCATCGGGA CGGCCTACCA TTCGCTCGCC CACACGCTGG CGGCTGCGGG GCACGACGTG ACGATGCTGT TCACCGGCTG GCTCGACCCG GAGCAGGCCG GCGGGGAGCC CGAGTGGCGC CGCAGCTTCG CGCAGGCGGG GATCGACTTC CAGCTGCTCG GCACACCATG GGACGTGCCG GTGCGCAACC CGCATCACCA GGTGCGCCGC GCCTACGAGC TGCACCGCTG GCTCACCGCG ACGCATGCGA CGCGCCCGTT CGACGTCGTC CACGCGCCCG AGACGCCCGG GCACGCGGCC TTCGCGTTGA TGGCGAAGCG GCTGGGCTCG GCCTATCGCG ACGTCGAGTT CGTCATCGGC ACGCACTCCT CGACGCGCTG GGTCGCGGAG AGCAACCGGG AGGGGATCGA GCAGCTCGAC GATCTCGTCA GCGAGCAGCT CGAACGCACC AGCGTCGAGC TGGCGGACGT CGTCATGAGC CCGACGGCGT ACATGCTCGA GTACATGCGC GGGCGGGGCT GGTCGCTGCC GGAGCGCACC TTCGTGCAGC CGCTCGCGCG CCCGCGAGCC GTTCGCGAGC TGGCGGGCGG ACGCCCGACC GCGGGCGAGC GGCCGACGCG CCAGCTGGTG TTCTTCGGAC GGCTGGAGAC GCGCAAGGGG CTGGAGGCGT TCTGCGACGC CGTCGACCTC CTCACCGCCG CCGACGACTG TCCGTTCGAG CGGGTCACCC TGCTCGGCCG CCCCGAGCCG ATCCTCGGCA CCGACGCGAC CTCCTACGTC ACACGGCGGG CCGCCGGGTG GGGTCTCGAC TGGAGGATCC TGCCCGACCT CGGCCACGAC GAGGCGGTCG CCCACCTGCG CGACGCCGCC GGCGTCGTCG CGATCCCGTC GCTCGTCGAC AACTCGCCGA ACACCGTGAT CGAGACGGTG GCGCTCGGTG TCCCCTTCGT CGCATCGCGT TCCGGCGGCA CCGGCGAGCT GATCGCCGCC GCGGACCTGG CCAGCTCGAC GTTCGACGGC TGGCGCTACG CCGGCGCGCT CGAGCCGCCG ACGTTCTCCG ATGCCCGCGA GCCATTCGAC GTCGAGGCGC TCGCCACCGC GCTCCGCGCG AAGGCGTTCG CGCCCGCCGC ACCCGTCTCA CCGTCCGTCG ACGACGCCGC CTGCGACAGC GCCTACGACG GCTGGCACCG CGCGATCGCC GGCCGGCCGC GCGACGCAGC GCCCGCGGCG CCCGGCGAGC CGCTGACCGC GGCGGTCTGC ATCGTCGCGC GGGACCGCGC CGCCGCGGCG CGGGTCGGAG CCGCGGTCGC GTCCGGCACG CGGGCGCCGA CGTGGATCGT GGCGATCGTG GAGGAGCCGG GCGGCGACTC GCTCCCCGGC CTCGACAGCG TCGTCGTCGC AAGCGACCGT GACGCCGGCC AGGCCCGCCG GCGAGTGAAC GCGGAGCTCG ACGCCGACGT CCTGATCGTC CTGCGCGGCA ACGAGGAGCC CGACCCCGAG CTCGTCGAGC GCACGCTCGA CGCGATGCGG ACGGGCGGCG CGGAGCTGCT GTCGCTCGTC ACCCGCGACC ACGACGCGGA CCGGCCGACG GACACGCCCG AGCACCTGCG CCGGAGCGAG CTGCCGCGCG ACCTGTGCGC GTTCGTCCCG ATCGCCGGCC CGGCGGTCGC CGGCGCCCTC TATCCGGCGT TCTGCGTCGG ACCGTACGCG ATCCGCCGTT CGGCGCTGAG CGGGCTCGGC GGCTACGCCG CAGACGCACC GAGCGGCGCC GTCGACCGTG AGCTGCTGTC GCGGGCCGCG CTCGACGGCG TGCGGATGCA CGTCTTCCCC GACCCGCTCG CGAGCGTCGT CGAGGACGAC GAGCACGTCG CGCTGCGCGC GCACTACTGG GGCTCCACCG CGGTGCCTTC GCCGCGCGGC GAAGAGCAGA TCAGCCTGCT GCGGCCGTTC CGGCGCCAGC TCGGCGAGTC GCTCGCCGAC CTGCCCGCGC TGCTCACCGG CACGCTCCGG GTCGCCGGCG GCTCGGCCGA GCGGGCGCGA GACGAGGCCG CCCGTCGCGA CGAGATCGTC GCCGCGTACG AGGCGCGGCT GACCGAGCAC CGCGAGCTGA TCGAGCTCTA CGAGCGGCAG AAGGAGGAGC TGCGCGCCGC GCTGGGCTCG GGCGGCGCGC GCAGCGCCCG GCCGGGACCC GAGTCCCCTG GGCAGGTTCT TGCGTGGCGC GTTCGCCAAC GCATCAAACG ACTGGGCCGG AGGCTGCGAT GA
|
Protein sequence | MASPSSDPAT PGPEAAPLAL SLAVDRLARF TGVRAVANVG PAPLRLEGDF ELTAAPAEGG IVVAAGDTSW DALGAARIAV VADLGRDRAP FDAIAAAEER AAAAGWRIAH AGLLHTGATG ARREGSLLIA CRPEDSIAEA LACGPLGLAL DPRASGTGFA PRPARVLIAS HEVAGPTGNG GIGTAYHSLA HTLAAAGHDV TMLFTGWLDP EQAGGEPEWR RSFAQAGIDF QLLGTPWDVP VRNPHHQVRR AYELHRWLTA THATRPFDVV HAPETPGHAA FALMAKRLGS AYRDVEFVIG THSSTRWVAE SNREGIEQLD DLVSEQLERT SVELADVVMS PTAYMLEYMR GRGWSLPERT FVQPLARPRA VRELAGGRPT AGERPTRQLV FFGRLETRKG LEAFCDAVDL LTAADDCPFE RVTLLGRPEP ILGTDATSYV TRRAAGWGLD WRILPDLGHD EAVAHLRDAA GVVAIPSLVD NSPNTVIETV ALGVPFVASR SGGTGELIAA ADLASSTFDG WRYAGALEPP TFSDAREPFD VEALATALRA KAFAPAAPVS PSVDDAACDS AYDGWHRAIA GRPRDAAPAA PGEPLTAAVC IVARDRAAAA RVGAAVASGT RAPTWIVAIV EEPGGDSLPG LDSVVVASDR DAGQARRRVN AELDADVLIV LRGNEEPDPE LVERTLDAMR TGGAELLSLV TRDHDADRPT DTPEHLRRSE LPRDLCAFVP IAGPAVAGAL YPAFCVGPYA IRRSALSGLG GYAADAPSGA VDRELLSRAA LDGVRMHVFP DPLASVVEDD EHVALRAHYW GSTAVPSPRG EEQISLLRPF RRQLGESLAD LPALLTGTLR VAGGSAERAR DEAARRDEIV AAYEARLTEH RELIELYERQ KEELRAALGS GGARSARPGP ESPGQVLAWR VRQRIKRLGR RLR
|
| |