Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_5044 |
Symbol | |
ID | 8735510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 5376910 |
End bp | 5378610 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646505671 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003396830 |
Protein GI | 284046490 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.449208 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGTTGCCA AATCCATCAA GCGCAATCTG ATGCGTCGCC GCACAGCCAT CACGGCCGGC GTCCTCGCAC TGGGGGTCGC CGCTGCAGGC TGTGGCGCCG ATGCGCCGAG ATCGTCGGAC ACCGGCGCTT CCACGGGGGC CGGCACGGCG GCGCCCGCCG TCGGCGCCGC GGCGCAGACG GTCTACACGA CGACCGCCGC CAGAGGCGAG GTCGATTCGT TCACCTGGAA CCTGCCGAAC GGTGAGCCGG CAAGCCTCGA CTGGGCTAGA GCGTACGACT CCTCGCCGAA TCAGGTCCTC TCGAACATGT GCGAGAGCCT GATGCGTCAG CAGCCTGACT TCAGCATCGT CCCCGGCCTC GCCGAGTCGT TCGAGCAGGC CGACGACAGA ACGCTCGTCT ACAAGCTCCG CTCCGGCGTC AGATTCTGGG ACGGCAGAGC GATGACGGCC GACGACGTCG TGTTCAGCCT CTCGCGCCAC ATGGACCCCG ACCAGGGCTC GTTCTGGTCG ACGCCGTTCT ACTCCAACGT CAGATCGATC GAGAAGACCG GTGACCTCGA GGTCACCGTC AAGCTCAAGC GTCCGGACGC CGTCTTCAAC CGCATGATGG CGACCCCCGC GGGCGTCGTC GGCCAGCAGG CGTTCGTCGA GGCGAGAGGC CGCCGCTACG GCACGCCCAA CGGCGGCGTC ATGTGCACCG GCCCGTTCCA GCTCGACAGC TGGAAGCCCG GCTCGAGCGT CGCGCTGAAG CGCAACGACG CCTACTGGGA CGCCGAGCAC AAGGCGAAGG CCGGCGCGAT GACGTTCAGA TTCGTGACCG ACGAGTCGAC GATGATCGGC GGCCTGCAGT CCGGCGAGCT GGACGGCACG TTCCAGGTCC CGCCGGCCGG CGTCTCGCAG CTCAGAAGCG CCTCCGGCAC GCTCACGTTC GGCGCCTCGA CCGAGTGGTT CGCGTTCCGC CCGACGGAGA AGGACGGTCC GCTGAAGGAC CCCCGCGTGA TGAGAGCGCT GTCGCTCGTG CTCGACCGCG ACTCGATCGC CAGAGTCGTC TTCGGTGGCG CCGCCGTCGC CGCCGGCACG CCGATCCAGC CCGGCGCCTA CGGCTACGCC AGAGAGGTCT TCGCGGCCGC GGCCGAGCAG CTGCCCGCCC CGACGCCCGA CCCGGACGCC GCGAAGGCGC TCCTCGCCGA GGCCGGCGCC GCGGCGAGAC AGCCGATCGT CGTCGCCGTC CCCGCCGACG TGCGGACGTA CAACCAGGCG GCCCAGACGC TGCAGGACGC CGCGCGCCAG ATCGGTCTCG AGGTGAAGGT CGAGTCGATC TCGACCGCGC AGTTCACCAA CCTGTACTTC GACAAGGGCG CGCGCGCCCC GTACGACCTC TTCGCCGTGC AGCAGTACGG TGCCGGCGTC GCGGAGCCGC TGATCTCGCT GAGCGAGTTC ACGCCCCTCT CGGCCTACAA CTACGGCCAG CTGAGAGACC CGGTCGTGAC GAGATCGGTC GAGCAGGGCC TGGCGACCTA CGACGACGAG AAGCGCGCCG AGCTGGCGAC CAGAGCCGAG AAGGCGCTCG TCGACGCTCC CGGCCTGATA CCGGTCGTCA ACCTGCTCAC GTCGGTCTAC CAAGGGCCGA AGATCACCGG CTCGGTCGCG TCGCTGGCCT ACCTCTACTA CCCGTGGGCG GCTGACGTAG GCGCGCCATG A
|
Protein sequence | MVAKSIKRNL MRRRTAITAG VLALGVAAAG CGADAPRSSD TGASTGAGTA APAVGAAAQT VYTTTAARGE VDSFTWNLPN GEPASLDWAR AYDSSPNQVL SNMCESLMRQ QPDFSIVPGL AESFEQADDR TLVYKLRSGV RFWDGRAMTA DDVVFSLSRH MDPDQGSFWS TPFYSNVRSI EKTGDLEVTV KLKRPDAVFN RMMATPAGVV GQQAFVEARG RRYGTPNGGV MCTGPFQLDS WKPGSSVALK RNDAYWDAEH KAKAGAMTFR FVTDESTMIG GLQSGELDGT FQVPPAGVSQ LRSASGTLTF GASTEWFAFR PTEKDGPLKD PRVMRALSLV LDRDSIARVV FGGAAVAAGT PIQPGAYGYA REVFAAAAEQ LPAPTPDPDA AKALLAEAGA AARQPIVVAV PADVRTYNQA AQTLQDAARQ IGLEVKVESI STAQFTNLYF DKGARAPYDL FAVQQYGAGV AEPLISLSEF TPLSAYNYGQ LRDPVVTRSV EQGLATYDDE KRAELATRAE KALVDAPGLI PVVNLLTSVY QGPKITGSVA SLAYLYYPWA ADVGAP
|
| |