Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_0456 |
Symbol | |
ID | 8730884 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 471717 |
End bp | 473312 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646501070 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003392267 |
Protein GI | 284041927 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.15867 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATGA AGCTCGCCAG AGGCGTCGCG ACGTTGCTCG CGGCCGCCCT CGTGGTCGCC GGATGCGGCG GGACCAGCAG CAACAACGAC AGCACCGGTT CGACCGGCTC CGGCGGCGGC ACCTCGGCGT CGTTCGACGC TGAGGCGACG ATTCGCACCG CGCAGTTCGG CGATGGCGTC GGCGGGATGG ACCCGGAGAT CTGGTACGAC CTCAACGGCG GGTCGCTGCA CACCGCCGTC TACGAAGGGC TGCTGCGCTA CAAGACCGGC ACGACCGAGA TAGAGCCCGC GCTGGCCGAG TCCTACGAGG TCAGCAGAGA CGGCAGGACC TATACGTTCA AGCTTCGTCA GGGCGTCAGA TTCCACGACG GCACGCCGCT GACGCCGGAG GCGGTCGCCG GCTCGTTCGC GCGCCGCGCC GCGCTCAGAG GCCCGTCGGC GTACCTGACG GCCGGGGTCG CCGACGTGCG GCCGCGCGGG AGCGACACGG TCGTGATCAA GCTGAGAAGC CCGGAGATCG GGTTCCTCGA CGCGCTCGCC TCGATCTACG GGCCGCGCGT GATCAGCCCT GCGGCGCTCA GAGCGCACGG CGCGGGCGAG GACGGCAAGC GCTGGTTCGC GTCCAACGCC GTCGGCACCG GGCCGCTGAG ACTGCTCAGC TTCAGACTCG GCGACGGCGC GACGCTCGAG CGCTTCGACG GCTACTGGGG CGAGCAGGCC AAGGCGAAGC GCTACGAGGT CGACACGCTG CCGAGCAGCG GCGAGCAGCA GCTGCAGCTG CGCTCCGGCC AGCTCGACTA CCTCAGCGGC GGCTCGCTGC AGCCGGCGCA GCTGAAGGCG TTCGACGGCA ACCCGAGATA CGAGGTGACG CGCCTGGACC AGGCGTTCCG GCCGATGCTC GTGCTGAACA CGAACAAGCC GCCGTTCGAC GACGTCGAGA AGCGCAAGGC GTTCGTCGCG GCGCTCGACG TCGACGCCGC GATCAGACAG GTGTGGGGCG ACGAGCTGAT GGAGGCGCCG ACCTCCTACA TCTCGCCGTT CCTGCTCGAC CCGGCGCTGA ACAGAATCGA GCCGCTGACC AGCGACGCCG CGCTCGACGA GCCGGTCACG TTCGAGTACG TCGGCGCGAT CCAGTCGCAC CGTCAGTTCA GCGAGGTGAT CCAGCAGCAG CTGCGCGACG AGGACGTCGA GCTGAGACTG AGCGCGACGA CCGGCGGCGA GGTCTTCTCG TGGCCGCAGG ACGTGCAGAA GGCGCCGAAC GCCGCGATCG TCACCGTCTA CGGCGACTCG GCGTACGTGC AGAGCCTGGT CGACCCGTTC TTCCGCACCG GCTCCGCGGT CAACTTCCTC GGCTACTCGA ACAGAACGGT CGACGCCACG CTCGACGAGG CGGCCATCCA GACCGACCGT GACAAGGCGC TCCAGCTGTT CGCCGACGCG AACAGAATCG TCGCGGTCGA CGACGCGTCG ATCATCCCGC TCGGCGACCT CAAGCAGCCG ATCGTGGCGC GCAGGGGCGT CAGCGGCTTC CAGGGCACGC CGACGTCGAT CGACGTCGTG CAGCTGGCCG CGATCGGGAA GTCCGCGGAC GCGTGA
|
Protein sequence | MTMKLARGVA TLLAAALVVA GCGGTSSNND STGSTGSGGG TSASFDAEAT IRTAQFGDGV GGMDPEIWYD LNGGSLHTAV YEGLLRYKTG TTEIEPALAE SYEVSRDGRT YTFKLRQGVR FHDGTPLTPE AVAGSFARRA ALRGPSAYLT AGVADVRPRG SDTVVIKLRS PEIGFLDALA SIYGPRVISP AALRAHGAGE DGKRWFASNA VGTGPLRLLS FRLGDGATLE RFDGYWGEQA KAKRYEVDTL PSSGEQQLQL RSGQLDYLSG GSLQPAQLKA FDGNPRYEVT RLDQAFRPML VLNTNKPPFD DVEKRKAFVA ALDVDAAIRQ VWGDELMEAP TSYISPFLLD PALNRIEPLT SDAALDEPVT FEYVGAIQSH RQFSEVIQQQ LRDEDVELRL SATTGGEVFS WPQDVQKAPN AAIVTVYGDS AYVQSLVDPF FRTGSAVNFL GYSNRTVDAT LDEAAIQTDR DKALQLFADA NRIVAVDDAS IIPLGDLKQP IVARRGVSGF QGTPTSIDVV QLAAIGKSAD A
|
| |