Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_2307 |
Symbol | |
ID | 8732750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 2435310 |
End bp | 2436857 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646502925 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003394107 |
Protein GI | 284043767 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0444045 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCCGA TCCGGTTCGG ATCAGCGCTG GTGGTCGCGG GTGTCAGCCT GACGGTCGCC GCCTGCGGTG CGACGGGTGG AGAGGAGTCG GGCGGCGCGC CGTCGCAGTC GGTGTCGGTC GCCGTCAACA GCGCGCCCGC GTCGCTCGAC CCCGCGAAGG CCGCGACCCA GAGCGACACG GTGCTCGCCC GCGCGCTCTA CGACACGCTC GTGCGCGTCG ACGTGGACGG TGAGATCGTT CCCGGGCTCG CGACGAAGTG GACGCAGCGT CCCGACAGGG CGGTCTTCAC GCTCCGCAGG GGCGTCACCT GCTCGGACGG CAGAGCGCTG ACGGCGAGCA TGGCCGCCGC ATCGCTGAAC CGGCTCGTGG CGCCCGAGAC CGCAGCGCCG ACGGCGTCCT CCTCGTTCGG CGGGGCCGGC ATGAAGGCGA CCGCCGACGA CGCCGCCGGC ACGCTCGCGG TCACGCTCGA CAGACCCTGG TCGGACCTCG TCAACGCGCT CGGGATGCCG GCGACGGCGA TCATCTGCAT GGAGGGCGAG CAGGCGCCGG CGACGCTCGA CAGACAGTCG GCGGGCACCG GGCCGTACGT GCTCGACAGC GTGCGCAGCG GCGACCGCTA CACGCTCGCG CGCCGCGACG GCTACACGTG GGGGCCGAAG CTCGGCGCCG TCGGCTCCGG CGAGCAGCCG AGAGAGGTCG TCGTGCGCGT CGTCGCGAAC GAGAGCACCG TCGCGAACCT GCTGCAGACG AGAGCGCTCG ACGCCGCCGT CCTCGCCGGC TCCGACGTCG ACCGGCTGGA GGGCGACGAC GCGCTGGAGG TGCAGGAGAC CGACGCCGGC AGCATGTTCG TGATCTTCAA CGAGGATCCC GCGCGCCCGT TCGCCGACGC GAGACTGCGC CGCGCGGCGG CGCAGGCGAT CGACCGCGAG GCGTTCCTGC GCGCCGTCGG CGGGCGCGGC AGACTGACGC CGAGCATCGT GCAGCCGGGC GTCGCCTGCT TCGACCCGGC CGTCGAGCAG GTCTTGCCTG GGAACGACGC GGAGGCCGCG GCGCAGACGC TCGGCGCCCA CGGCGGCTCG CTGAAGATCA TCGGGACGAC GCTCGTCGGC AACGGCCAGG GCACGACGTA CATCCAGGAG GCGCTGCGCG CCGCAGGCGC TGAGACGACG CTTCAGAACA GCGACCTGAC GTCGTGGGCG GGCAAGCTGT TCGACCCTGC GAAGGACTGG GACCTGACCG TCCTCGTCGT CCAGAACATC TCCAACTCGA TCTCGCAGGT CGCGAGCCTG ATGGTCGGCG AGGCGCCGCC GAGAGGCTCC AACGTCGCGA GCCTCCAGAA CCCGGCGTGG AAGCGGGCGG TCGCGCGCGC CACGTCGACC GTCGGCGACG GGCGCTGTGG CGCCTGGGGC GACGCGCAGA GAGCGGTCGT GGAGGACGTC GACGTGCTCC CGCTGACGAG CTTCACCGTC GCCGCCGTCT GGAGCGACGA CGTCACCGGG CTCGCGCCGC AGGGCATGAT CGAAGTCGGC TCGATCCGGG GCCGCTGA
|
Protein sequence | MMPIRFGSAL VVAGVSLTVA ACGATGGEES GGAPSQSVSV AVNSAPASLD PAKAATQSDT VLARALYDTL VRVDVDGEIV PGLATKWTQR PDRAVFTLRR GVTCSDGRAL TASMAAASLN RLVAPETAAP TASSSFGGAG MKATADDAAG TLAVTLDRPW SDLVNALGMP ATAIICMEGE QAPATLDRQS AGTGPYVLDS VRSGDRYTLA RRDGYTWGPK LGAVGSGEQP REVVVRVVAN ESTVANLLQT RALDAAVLAG SDVDRLEGDD ALEVQETDAG SMFVIFNEDP ARPFADARLR RAAAQAIDRE AFLRAVGGRG RLTPSIVQPG VACFDPAVEQ VLPGNDAEAA AQTLGAHGGS LKIIGTTLVG NGQGTTYIQE ALRAAGAETT LQNSDLTSWA GKLFDPAKDW DLTVLVVQNI SNSISQVASL MVGEAPPRGS NVASLQNPAW KRAVARATST VGDGRCGAWG DAQRAVVEDV DVLPLTSFTV AAVWSDDVTG LAPQGMIEVG SIRGR
|
| |