Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_2105 |
Symbol | |
ID | 8732548 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 2206848 |
End bp | 2208464 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 646502723 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003393905 |
Protein GI | 284043565 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0251373 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTGAGA GAGACCGCCG AACCTTGAAT CCGCACGCCA TCGACCGACG TGACTTCCTG CGCGCCGGCG CCGTGCTGGC CGCCGGCGCC GCCGGCGCCG CGGCGCTCGG CGCTGCGGGG TGCGGCGGCC GCGGGGCGGG GGCGAGCGGC GCCGTGGCGA TCTCCTCTGA CGCGCTCGGG CCGCGCGGCG GGACCGCGCG GCTGCTGCTC GGCGGCGGCG GGCCGCGGCT CGTGCTCGAC CCCGCGACGC AGGTCAACGA GCCGGACGCG ATCGTCGACG GGCTGCTCTA CGACGGACTC GTGCGCCTGC ACGACGACTG GCGGGTCGAG CCGCGGCTCG CGACCCGCTG GGAGTCCGAC GCGGCGCAGC GCGTGTGGCG CTTCGAGCTG CGCGACGGGG TCACCTTCCA CGACGGCCGG CCGCTGACGG CGAAGGACGT CGTCTACAGC CTCCGCCGTC TGCTGGACGA GCGGCTCGGC TCCGCGGTCT ACCCGCGGCT CAACGGTGAG CTGAGGCCCG ACGGCGTCCG CGCCGCCGGC TCCGGCGCCG TCGAGCTGCG GCTCACGCAG CCGGACGCCT TCCTGCCGGT CGCGCTCGGC GCCCGCCACT GCAAGATCGT CCCGGCCGGC ACGACTGACT TCTCCCGCGC GATCGGGACG GGACCGTTCC GCCTGCGCTC GCTCGACCAG TCGAAGCTCA GCTTCGAGCT GGAGCGCAAC CCGGGCTTCT GGCAGGAGGG GCTGCCGCGC CTGGACCGGA TCGAGGGGAT GCTCGCCAAC GACCAGGCGT CGCTCGTGCA GTCGGTCGCG TCCGGCCGCT TCCACTTCGG CGGCTTCATC GACCCCTCGC TCGCGTCGAG CGCCGAGGCG AGCGGCGACG CGCGGCTGCT CGCGCACCGC TCCGCGCTCT TCAACGACCT CGTCGCGGCG GCCGACTCCG AGCCGTTCAC GAACCCCGAC GTGCGGACGG CGCTGAAGCT CGCGATCGAC CGTGAGCAGA TCCTGAGCCT CGCCTACAAG GGCCACGGCA GCATCGCCCA CGACGTGCCG GTGCGGACCG CGGACCCGTT CTTCGCCGAG GGGCTCGCGC ACCGCACTCG CGACGTCGAC GAGGCGCGTC GGCTGCTGCG CCGGGCCGGC TACCCGAACG GCATCGACCT CGAGCTGCTG ACCGCTCCCG CCGGCGCCGC AATGGTCGAC ATGGCGGTCG TGGCGAAGGA GAGCCTCGCC GAGGCCGGCA TCCGCGTCTC GGTCCAGCAG CGACCGGCCG GCACCTACTA CGACGCCGTC TGGTTGAAGG AGGCGTTCTA CGTCGACACG TGGGTGCTGC GCCACCCGCT CGACGCGATG GCCGTGATGT TCGAGAGCTC CGCCCCGTGG AACGAGGCGA GACTGCGCTC GCCGCGGCTC GACGAGCTGC TGCGCGAGGC GCGCAGCACC GGCGAGCGGT CCGAGCAGGC GCAACTGCTC GGCGCGGCCC AGACGCTCGT CGCCGACCAG GCCGGCTTCG TCTGCCCGGC GTGGCTGGAC GAGCTGTACG TCGCCAAGCC CGAGCTGGCC GGGGTCGGCT TCAACGCGAC CGACCTCGTC GACTTCCAGC GAGCGTCGCT GGGCTGA
|
Protein sequence | MSERDRRTLN PHAIDRRDFL RAGAVLAAGA AGAAALGAAG CGGRGAGASG AVAISSDALG PRGGTARLLL GGGGPRLVLD PATQVNEPDA IVDGLLYDGL VRLHDDWRVE PRLATRWESD AAQRVWRFEL RDGVTFHDGR PLTAKDVVYS LRRLLDERLG SAVYPRLNGE LRPDGVRAAG SGAVELRLTQ PDAFLPVALG ARHCKIVPAG TTDFSRAIGT GPFRLRSLDQ SKLSFELERN PGFWQEGLPR LDRIEGMLAN DQASLVQSVA SGRFHFGGFI DPSLASSAEA SGDARLLAHR SALFNDLVAA ADSEPFTNPD VRTALKLAID REQILSLAYK GHGSIAHDVP VRTADPFFAE GLAHRTRDVD EARRLLRRAG YPNGIDLELL TAPAGAAMVD MAVVAKESLA EAGIRVSVQQ RPAGTYYDAV WLKEAFYVDT WVLRHPLDAM AVMFESSAPW NEARLRSPRL DELLREARST GERSEQAQLL GAAQTLVADQ AGFVCPAWLD ELYVAKPELA GVGFNATDLV DFQRASLG
|
| |