Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_1378 |
Symbol | |
ID | 8731817 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 1445529 |
End bp | 1447472 |
Gene Length | 1944 bp |
Protein Length | 647 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 646501996 |
Product | glycoside hydrolase family 18 |
Protein accession | YP_003393182 |
Protein GI | 284042842 |
COG category | [R] General function prediction only |
COG ID | [COG3858] Predicted glycosyl hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.112456 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCGC GGGGAACGAT CGCAGCGGGC GTGCTCGGGG CCGCTCTGTT GACAGCCGGA GCGCCCACGG TGGGCGCGAC GGCGCCGGTG GACGGCGCGG CGGCCGGGCC GGCGGGCGTC GCGGCGAGAC GCGGCGGCCG CGATGCGCCG CCGGTGCGGC GCTGCGCCGC AGTGAGACCG TCAGGGCTCA CCTTCAGCCG CGGGCGTGGT CAGACGACCG GCGTGCTGCG CTGGCGCGTG CCGCGCAACG CGACGACGCG CGCGAAGGGC AGACGCGCCG TGCGCTACCG CGTGATTCGC GACCGCGGCG TCGTCGGGCA GACGGCGAAG CGTGCGCTGC GGCTGAAGGT GACGCTCGGC AGAAGCCACC GCTTCGCCGT GCAGGCGCTC GACGCCAAGG GCAAGCCGGT GCGGCGCTGC CGCGCCGAGA AGCGCGTGCG CGTCGCCTAT CGCGCGCCCG GCAGCCCACG CTATGTCGCG GTCGCCGGTG ACGAGCGCGG GCTGCGGCTC AGCTGGCAGG CCGGCGCGCG CGGCGACGGC GAGCCGGCCG GCTACCGGCT CGTGCGCGAC GGGACGACCG TCGGGCAGAC GACGCAGACG AGCTGGGCGG TGCCGGCAGC GCCGAACCGC ACGTACCGCT TCCAGGTCGT GGCGGTCGAC CGCCGCGGGC GCACGAGCGC ACCGAGCGCG ACGGTCACGG CGAGAACCGG CCATGAGCCG CCGACGGCGC CCGCCGGGCT GGCCGCGCTG GCGGTCTCCG AGTCCGAGCT GGGGGCGCAA TGGCAGCCGA GCACGGTCGC CTCCGGCGAG ATCAAGGGCT ACCGCGTGCT GCGCGACGGC GCCGTCGTCG GCCAGGTCGC CGCCACCTCG ACCGTGCTCG GGAACCTCGC TCCGAGCACG GACTACGAGG TCGCCGTCGT GGCGATCGAC AGCCACGGCT ACACGAGCGC GCCGGCCGTC GTCCGCGGTC GCACGCACGA TCCCGTCCCG ACGACCGGAC ACGCGCAGGC GTACCTGCTC GCCTCGACCG ATCAGAGCTT CGCCGACTTC CGCGCCCACT ACCGGCAGAT CGGCGTCGTG CACCCGACCT ACTACGACTG CACCGGCGCC GGAGCCCTCG TCGGCAGCGA CGACCCGCTC GTGACGAGGT GGGCGCAGGC GCGCAGAGTC GAGGTGCTGC CGCGGATCAA CTGCCAGCGG ACGGCGACGG TCCACAAGAT CCTGACGGAC CCCGCGACGC GCGCCGCGTG GCTCGACCGG CTCGTCGGGC TCGCGCGCGA GGTCGGCTAC GACGGCATCT CGCTCGACTT CGAGGCCGGT CCCGCCGAGG ACCGTGCGGC GCTGACGTCG TTCGTGCAGG AGCTGGCGGG GCGGCTGCAC GCCGACGGCC GCAAGCTCGC GATCGCGCTG TCGTCGAAGA CGAGAGACAG CCTGACGCAC CCGCGCTCGG GGATCTTCGA CTACGCGCCC CTCTCAGAGG CGGCCGACTA CCTCTTCCTG ATGGCGTGGG GATTGCACTG GACGACCTCA GTGCCCGGGC CGCAGGATGA CGCCGACTGG GTCCGCAGAG TCGTCGAGTA CGTCAAGACG ATGCCGCAGA AGCACAAGTT CGTCTTCGGC ACGAACCTGT ACGCGCTCGA CTGGCCGAAC GGCGGCGGGG CGCAGAACAA GGCGACGGCG TACGAGTATC AGGACGCGAT GGCGCTGCTG CCGCAGTTCG CCGCGCAGAT CCGCCACGAC CCCGTGACCG ACAACTACCA GGCGACGTAC ACCGACGCCG CCGGGGTGGC GCACGAGGTC TGGTACCCGG ATGCGGACAC GACGGCGCGG CGCGTGCGGA TCGCGAAGGA GGCGGGGCTC GGCGGCGTCG GGTTCTGGCG GCTCGGCCGA GAGGACCAGC GCGTCTGGGA CGACCCGCTG CTGGCGCCGG GAGTCGCCTG GTGA
|
Protein sequence | MTARGTIAAG VLGAALLTAG APTVGATAPV DGAAAGPAGV AARRGGRDAP PVRRCAAVRP SGLTFSRGRG QTTGVLRWRV PRNATTRAKG RRAVRYRVIR DRGVVGQTAK RALRLKVTLG RSHRFAVQAL DAKGKPVRRC RAEKRVRVAY RAPGSPRYVA VAGDERGLRL SWQAGARGDG EPAGYRLVRD GTTVGQTTQT SWAVPAAPNR TYRFQVVAVD RRGRTSAPSA TVTARTGHEP PTAPAGLAAL AVSESELGAQ WQPSTVASGE IKGYRVLRDG AVVGQVAATS TVLGNLAPST DYEVAVVAID SHGYTSAPAV VRGRTHDPVP TTGHAQAYLL ASTDQSFADF RAHYRQIGVV HPTYYDCTGA GALVGSDDPL VTRWAQARRV EVLPRINCQR TATVHKILTD PATRAAWLDR LVGLAREVGY DGISLDFEAG PAEDRAALTS FVQELAGRLH ADGRKLAIAL SSKTRDSLTH PRSGIFDYAP LSEAADYLFL MAWGLHWTTS VPGPQDDADW VRRVVEYVKT MPQKHKFVFG TNLYALDWPN GGGAQNKATA YEYQDAMALL PQFAAQIRHD PVTDNYQATY TDAAGVAHEV WYPDADTTAR RVRIAKEAGL GGVGFWRLGR EDQRVWDDPL LAPGVAW
|
| |