Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_5018 |
Symbol | |
ID | 8735484 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 5349206 |
End bp | 5350786 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646505645 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003396804 |
Protein GI | 284046464 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0327585 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAACACC TCACCCGGCG TGAGTTCTCG GAGACCGGCA TCAGATACGG CGCGGCAGCG GGCCTGCTCG GAGGCGGCCT CGCGACCGTC CTCGCCGGAT GCGGCGGCGA CTCGTCCAGC GACGGGTCGA CGAGCGCGGG CACGACGCCC GCCGACGGCT CCGGCGGGGG GAGCGGGGGG ACGATCCGGA TCGGCAACGC GGAGCCGCCG ACGTCGGCCC AATGGGACCC GCACGCGGCG TTCGGCCTCG CGGACTACCA GACCTGGTCG CTCGTCTACG ACACGCTGCT CGCGTACGAC AGCGCCGGAG AGCTCGTCGG CCAGCTGGCG AAGTCGTGGA AGCGGCTGTC GCCGACGCGC CTGCGGATCG TGATCCACAG AGACGTCCAC TTCAGCGACG GCAGCCCGCT CGGCGCCGAG GACGTGAAGG CGTCGATCGA GCGCATCTCC GCGCCGCAGT CGGAGCTGGC GCTCGCGTCG AAGCTGCCCG AAGGCGCGAA GGTCGAGGTC CGCGGCGAGC ACGAGCTGGA CATCGTCACG CCCGAGCCGT TCGGCCCGCT GGAGGGCGCG CTCGTCGTCG TCTCGATCGT CTCGCGCAGA GACGCGGCGA GACCGGAGGC GTTCAAGCGC CGCCCGCTCG GCAGCGGTCC GTACACGTTC GTCGAGTACC GCAACAACAG CATCAGACTG AAGGCGAACC CCAGATACTG GCGCGGCAAG CCGGGCTCCG ACGGCGTCGT GCTGTCCTAC GTCCAGGACC CGAGCGCGCG CATGAACGCG CTGCTGACCG GCCAGATCGA CATCTACACG CGCGCCGACT CGATCGTGCT CGACGAGGTC AGAGGCAACG ACGACTTCTA CGTCAACGAC ACCAGTCCGG CGTCGAACTT CTTCTACATC CCGCAGTTCG ACACGGCGCT CAGAGACGTC CGCGTGCGGC AGGCGATCGC CTACGCGATC CCGCGTCAGC AGATCGCCGA GAGCATCATG AGAATCTGCC CGCCGGCGCT CTCCTCGCTG CCCGCGGCGT CGAAGGGCTT CAGACCGATG GAGCCGAGAT TCGACCTCGA CCTGGAGCGC GCGAGATCGC TGCTGAAGGA GGCCGGCCAC GACGGCGGCC TGTCGATCAC GCTCGCCTCG GCCAGCGTCT TCGCCCACCA GGAGCAGGTC GACCAGCTCG TGAAGGCGTC GCTGGAGCAG GTCGGCATCA CCGTCGACAT CAAGAAGCTG GAGAGCGGCA CGTTCCGCTC GAACTTCTCG CAGTACGCGC TGTCGATGAA CGCGCTCGAC ACGCCGGGCG ACCCGAACTT CATCTTCTCG TTCTTCCGGC CGTCGATCGC CAGAGAGGTC CTGAAGTGGG ACTCGGCCGA CTTCATGCCG CTGGTCGAGG CGCAGCGCCG CACGATCGGC GCCAGACGGC AGGCGACGAT CGACGCCGCC GCGAGATACC TGTGGGAGAA CCAGATCCTC GTCTACCTCA CCGACGACAT CTGGTACACG GTCGTCAACA GACGCGTCAG CGGCTACGAG CGCTCGACCG TCGAGGGCGA GCCGCTGCTG TGGAGAGCGA AGGCGGCGTA G
|
Protein sequence | MKHLTRREFS ETGIRYGAAA GLLGGGLATV LAGCGGDSSS DGSTSAGTTP ADGSGGGSGG TIRIGNAEPP TSAQWDPHAA FGLADYQTWS LVYDTLLAYD SAGELVGQLA KSWKRLSPTR LRIVIHRDVH FSDGSPLGAE DVKASIERIS APQSELALAS KLPEGAKVEV RGEHELDIVT PEPFGPLEGA LVVVSIVSRR DAARPEAFKR RPLGSGPYTF VEYRNNSIRL KANPRYWRGK PGSDGVVLSY VQDPSARMNA LLTGQIDIYT RADSIVLDEV RGNDDFYVND TSPASNFFYI PQFDTALRDV RVRQAIAYAI PRQQIAESIM RICPPALSSL PAASKGFRPM EPRFDLDLER ARSLLKEAGH DGGLSITLAS ASVFAHQEQV DQLVKASLEQ VGITVDIKKL ESGTFRSNFS QYALSMNALD TPGDPNFIFS FFRPSIAREV LKWDSADFMP LVEAQRRTIG ARRQATIDAA ARYLWENQIL VYLTDDIWYT VVNRRVSGYE RSTVEGEPLL WRAKAA
|
| |