Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_2781 |
Symbol | |
ID | 8733224 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 2968186 |
End bp | 2969745 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646503393 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003394575 |
Protein GI | 284044235 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.187703 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0364487 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTGAGGT TGAGCAAGTC GGCCACCGCC CTGGCGGTGC TGGTCGCGGC GTGGTCTTTG GCCGCATGCG GAGGCGGGGC GAAGGTAGGG GACTCGACGT CGGACGGCGG TGGCGGGGCG TCGACCGGCG CGTCGTCGAG CGGAGGGACG TTGACGGTCG GCCTCGACTC CGATCCGGCG TCGCTCGACC CGACCGGCGA CACCGGCTAC GCGGGCTCGC TCGTGACGCC GCAGATCTTC GAGACGCTCG TCGTCGCGGA CGACGACGGC ACGATCGGCC CGGGCCTCGC CGAGAGATGG ACGGTCTCGA GAGACGGCCG GACCTACACG CTGACGCTGC GCAAGGGCGT GAGATTCCAC GACGGCACGC CGCTCGACGC GAGAGCGGTG GTCGCGAGCC TCAGACGCAG CGCCGGAAGA GCGTCGCCGT GGGCGGCCGA CCTCGCTCCG ATCACAGCGA TGAAGGCGAC CGGCGAGGAC ACCGTCGTGC TGACGCTCGA CAGACCGAAC GCGCCGCTGC TGTCGACGCT CGCCGACAAG CCGGGCATGA TCGCCTCGCC GACGGCGGTC GAGCAGGCGG GCAGACGGTT CGGGTCGCAG CCGGTCGGAA CCGGTCCGTT CGCGTTCGAC CACTGGACGC GCAACCAGGA GCTGATGCTG AGACGGAACC CCGACTACTG GGACGCCGGC AAGCCGAAGC TCGACGCCGT CGTCTTCAAG CCGCTGCCGG ATCCGACGCA GAAGGTCACC AACCTCGTCG CCGGCCAGGT GCAGACCGTC GACTACGTGC CGCCGGAGCT GATATCTCGC GTCGAGGGCG CGTCGAACCT CGAGCTGGAG CAGGGCCCCG GACCGTACAA CTCGGTCGTC TACGTGCCGA TGAACGCGGC GCGGCCGCCG CTCGACGACG CGAACGTCCG CCAGGCCGTC TCGCTCGCGA TCGACCGCGA CTCGATCGTC AGAAACGTCG CCTTCGGAGC CGGCACGCCC GCGCGCTCGA TGCTCTCGCC GACCTCGTGG GGCTACAGCG ACGAGATTCC GGCGATTCCG TACGACCCTG CCAGAGCGAG AACGCTGCTG GGCGGGAGAG AGGTGAAGCT CGAGCTGCAG GTGCCGCCGA CCTACACGCA GGCCGCGCAG GTGATGAAGC AGAACCTGGC CGAGGCCGGG ATCGACGTGA CGCTGCGGCG GATGGACTGG GGCCAGCTGA TCGACGGCTT CTACAAGGGC GACTTCGACA TGCAGGTGCA GGACCTGCTC GGGATGCAGC GCTCCGACCC CGACGGCGCG CTCAGCAGCT TCTACGCGCC GGACGGCTCC AACAACGGCG CCGGCTTCTC CGATCCGCAG ATCACCGCGC TGCTCGACAG AGCCCGCTCG GGCGGCGACG AGGCGCAGCG CAGACCCGAG TACGTCGAGA TCCAGCAGCT CGCGCAGGAG CAGAGCCCGT ACGCGCCGGT GTACATCCCC AACCAGGTGC GGGCGTGGGA CAGCAAGGTG CAGGGACTCG GCCTCAGCAA CGACGGCGTC CTGCACCTGA CCGACGTCAC GATCGGCTGA
|
Protein sequence | MVRLSKSATA LAVLVAAWSL AACGGGAKVG DSTSDGGGGA STGASSSGGT LTVGLDSDPA SLDPTGDTGY AGSLVTPQIF ETLVVADDDG TIGPGLAERW TVSRDGRTYT LTLRKGVRFH DGTPLDARAV VASLRRSAGR ASPWAADLAP ITAMKATGED TVVLTLDRPN APLLSTLADK PGMIASPTAV EQAGRRFGSQ PVGTGPFAFD HWTRNQELML RRNPDYWDAG KPKLDAVVFK PLPDPTQKVT NLVAGQVQTV DYVPPELISR VEGASNLELE QGPGPYNSVV YVPMNAARPP LDDANVRQAV SLAIDRDSIV RNVAFGAGTP ARSMLSPTSW GYSDEIPAIP YDPARARTLL GGREVKLELQ VPPTYTQAAQ VMKQNLAEAG IDVTLRRMDW GQLIDGFYKG DFDMQVQDLL GMQRSDPDGA LSSFYAPDGS NNGAGFSDPQ ITALLDRARS GGDEAQRRPE YVEIQQLAQE QSPYAPVYIP NQVRAWDSKV QGLGLSNDGV LHLTDVTIG
|
| |