Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_2501 |
Symbol | |
ID | 8732944 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 2661842 |
End bp | 2663668 |
Gene Length | 1827 bp |
Protein Length | 608 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646503116 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003394298 |
Protein GI | 284043958 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGGATG GTCGTCGACG GCGCTTGTCG CCCGTCGGGA TGATCGGCGC CGCCGTGCTG CTGATCGTGC TGGTCGTCGT GGTCGTGTCG GTCGCCGGTG GCGGTGATGA CGACGACAAG ACGAGCGTGA GCGCAGGCGG CGGGTCCACC ACCGCGCCGG CCAGAAGCTC GGGCGGGGCG GCGACGCGTG AGGAGACGCT CGTGCTCGGG CAGTACCGCC CGCCCACGGG CAAGATCGGC AACCCGTACG TGCAGGCGAG CGACGCGCTC GTCTCCGACG GGCTGCACGA GCTGGTCTAC GAAGCGCTGT TCTACGTGAA TTACCAGACC GGTGAGACCG AGCCGTGGCT CGCGACCGGC TATGAGTACA GCGACGACAA CAGAACGATC ACGCTGAGAC TGCGTGACGA CGTCAGCTGG AACGACGGCA AGCCGTTCAG CGCCGACGAC GTCGTCTACA CGATGAGACA GATCCTGGCG GCCAGAGCGC CGTTCCGCGC GGCCAACATA CAGGGCGCGG TCAGATCGAT CAGAAAGCTG TCGCCGACCG AGGTCCGGAT CGACCTCAGA GCGCCCAACC CGCGCTTCGT CGACAGCGAG CTGTCCTCCT ACGTCTACAC CGCGAACTTC ATCCCGCTGC CCAAGCACGT CTTCGAGGGG CAGAGATTCG AGACGTTCGC CTTCTACGAC CTCGCCAGAG GCCTGCCGCT CGGCACCGGC CCGTACCGCC TCACCGACGT CACCGCCTCC GCGGCGACGC TGCAGCGCAA CGACGACTGG TGGGCGGCGA GAGCCGGCGT CGCCGACGTC GTGCCGAAGA AGGTCGTCTA CACGAGCCCC GGTCCCGAGG ACTCGGCCGT CTCCGGCCTG GAGAGCAGCG CGCTCGACTA CGCCGGCCAG TCGGTCCCGT CCGTCGCCGG CTTCATCGCC GCGAAGGAGC GCAACCCGCA GCTCGTCAAC TGGGACGGCG ACCTTGGCTG GCTCGACCCG TGCCCGTACG CGCTGACCGT CAACACGAAG CGCAGACCGT GGGACGACGC CGAGCTGCGC TGGGCGCTCA ACGCCTCGAT CGACAAGGAG CAGTTCAGCC GCCTCTTCAA CACCCCCGGC GAGTCGACCC CGGCGCGCAC GACGTACCCC GAGTACCCGC AGCTGAGCGA GCTGATAGAC GCCAACGAGG ACCTGCTCGC CGAGTACCCG ACGCTCGACC ACGACCTCGA CAGAGCGGCG CAGATCTTCG AGTCGAAGGG CTACAGAAGA GAGGGCGGCG TCTGGACCAA GGACGGGCAG AAGCTGTCGC TGAAGCTCAA CCTCTTCTCG CCGGCCGCGC TCGGCCCGGT CTGGGGCGAT GCGGCGCAGC TGCTCAACCA GCAGTTGAGA GAGGCCGGGA TCGCCGTCGA GGTCGACCCG GGCGACTTCA ACACGATCGC GGCCAACCGC GCCGAGGGCA GATTCGACGC GCAGTCGTGG TTCGAGTGCG GCAGCGTCAC CGATCCGTGG GCGACGCTCA ACCGCTACAC GAACGCGCCG GGCAACGACA ACGCCGGCAG ATGGAGCAAC GCCGCCTACG ACAGAATCGT CGCGCAGATG GGCGAGCTGC CGCCGGGCGA CGCGCAGATC AGAGAGCTGT ACGCGCAGGC GATGGAGATC TGGCTCAGAG AGCTGCCGGT GATCCCGCTC AACCAGCGGC CGACGCCGAT CGTGATGAAT CAGACGTACT GGAGAAACTG GCCGACCGCG GACAACGGCT ACACGCAGCC CGCGCCGTTC GGGATGAACT TCCACCAGGT CATCACCAGA CTCCAGTCCG CGAGAGGCGA GCAGTGA
|
Protein sequence | MEDGRRRRLS PVGMIGAAVL LIVLVVVVVS VAGGGDDDDK TSVSAGGGST TAPARSSGGA ATREETLVLG QYRPPTGKIG NPYVQASDAL VSDGLHELVY EALFYVNYQT GETEPWLATG YEYSDDNRTI TLRLRDDVSW NDGKPFSADD VVYTMRQILA ARAPFRAANI QGAVRSIRKL SPTEVRIDLR APNPRFVDSE LSSYVYTANF IPLPKHVFEG QRFETFAFYD LARGLPLGTG PYRLTDVTAS AATLQRNDDW WAARAGVADV VPKKVVYTSP GPEDSAVSGL ESSALDYAGQ SVPSVAGFIA AKERNPQLVN WDGDLGWLDP CPYALTVNTK RRPWDDAELR WALNASIDKE QFSRLFNTPG ESTPARTTYP EYPQLSELID ANEDLLAEYP TLDHDLDRAA QIFESKGYRR EGGVWTKDGQ KLSLKLNLFS PAALGPVWGD AAQLLNQQLR EAGIAVEVDP GDFNTIAANR AEGRFDAQSW FECGSVTDPW ATLNRYTNAP GNDNAGRWSN AAYDRIVAQM GELPPGDAQI RELYAQAMEI WLRELPVIPL NQRPTPIVMN QTYWRNWPTA DNGYTQPAPF GMNFHQVITR LQSARGEQ
|
| |