Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_5802 |
Symbol | |
ID | 8736278 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 6210367 |
End bp | 6211701 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646506429 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003397578 |
Protein GI | 284047238 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0809589 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGAGG TCTACGAAGT GCGCACACGG AACAGAGGGC TCACACGCGC CGCGCTCGCC GCCCTGCTGG TGGCGCTGCT CGCACTCGTC GCCGCCGGCT GCGGCAGCGG CGACGACGAC AGCGGCGGCG ATGGTGGTGA CGGCCCAGTC GAGATCACGT TCTGGCACGG CCAGAACCAG ACCGCGCAGC AGACGATCGA AGGGCTCGTC GACAGATTCA ACGCCTCGCA TCCCGACGTG AAGGTCAAGG CCGAGGTCGG CGCGCTCGCC GACAGCCTCT ACCAGAAGAC GACGGCCGCG CTGGCCGGCG GCAAGTACCC CGACGTCGTC TACCAGTTCG GCCCCAACAT CGCATCGCTC GCGCGCAGCC CGAAGGCGCT CGACCTGACC GACGCCGTCA GAGACGCGGC GTGGAGATGG GACGACTTCT ACCCGCCCGC GCGGGAGGCC GTCACGGTCG ACGGCAAGGT CCGCGCCGTG CCCGCGCTGA TCGACTCCTT GGCCGTCGTC TACAACAGAA GACTGTTCAG AGAGGCGGGC ATCCCGGCGC CGAGAGCCGG CTGGACGTGG GACGACTACC GCGCGATCGC CAGACAGCTG ACCGACTCCT CCAAGGGGCA GTTCGGCAGC GCGTGGCCGG GCGTCGGCGA CGAGGACACC GTCTGGCGGC TGTGGCCGAT GGTGTGGCAG CTCGGCGGCG ACGTCACCTC GCCGGACGGC GAGCAGGCCG GCTTCGAGGG CGAGAGCGGG CTGACCTCCT TCACGACGAT CAACGACATG GCGGTCACGG ACAGATCGCT CTACATCGAC AAGACTGCCG GCAGCGAGAA GATGTACGCC ATCTTCAACA CCGGTCGCAT CGGCATGGTC CCGACGGGTC CGTGGCAGGT CCCCGAGTTC GTCAAGGCGA GAGTCGACTA CGGCGTCGTT CCGATGCCGA GCTACTCGGA CAGACCGACG ACGATCTCGG GCCCGGACGC GTGGATGCTG TTCGACAACG GCGACGCGCG CGCCAGAGCG GCGCAGGAGT TCGCGCAGTG GCTGACGCTG CCCGAGCAGG ACGCCGTGTG GGACGTGGAC GCCGGCTCGC TGCCGCTGCG CAGATCGACC GCGCAGCAGC CGATATGGAG AAGACACGCG CAGGAGGTCG TCGGGCTCGA CGTCTTCACC GCTGCGCTCG AGCAGGCGCG TGTGCGCCCG ACGATCCAGG CCTACCCGAA GCTGTCCGAG GCGGTCGGGT CGGGGATCGT CGACGTCCTG CTCGGCACCG CCGACCCGCA GGAGGCGCTC GACAAGGCCG TCGACGGCGC GAACGAGGCG CTCGCCGGCG ACTGA
|
Protein sequence | MREVYEVRTR NRGLTRAALA ALLVALLALV AAGCGSGDDD SGGDGGDGPV EITFWHGQNQ TAQQTIEGLV DRFNASHPDV KVKAEVGALA DSLYQKTTAA LAGGKYPDVV YQFGPNIASL ARSPKALDLT DAVRDAAWRW DDFYPPAREA VTVDGKVRAV PALIDSLAVV YNRRLFREAG IPAPRAGWTW DDYRAIARQL TDSSKGQFGS AWPGVGDEDT VWRLWPMVWQ LGGDVTSPDG EQAGFEGESG LTSFTTINDM AVTDRSLYID KTAGSEKMYA IFNTGRIGMV PTGPWQVPEF VKARVDYGVV PMPSYSDRPT TISGPDAWML FDNGDARARA AQEFAQWLTL PEQDAVWDVD AGSLPLRRST AQQPIWRRHA QEVVGLDVFT AALEQARVRP TIQAYPKLSE AVGSGIVDVL LGTADPQEAL DKAVDGANEA LAGD
|
| |