Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_2236 |
Symbol | |
ID | 8732679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 2352865 |
End bp | 2354508 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646502854 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003394036 |
Protein GI | 284043696 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0669965 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.364829 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGGCAC ACTCGTTCCG GGCCGTCGCG GGCGCCGTCG CAGTCGGCGT CGCGCTGCTT GGCGCCGGCT GCGGCCGATC CAGCGAGGAC GCCGGCGGCA CGACGGCCGG CGTGAGCAGA TCGGCTCCGC TCTCGGCCAC GACGCCCGCC GGGTCCGCCG AGGTCGACAG CGCGACGTGG GCGATGTACC GCGACACGAT CACGGTCGAC CCGATCTTCG CCGGTGACTA TCCCGAGCGT CAGGTCGTCG CGCTGATGTG CGAGTCGCTG CTGCGCCAGC AGCCGGACGG CACGACGGCG CCGGGCCTGG CGAGACTCTC CTACCGCGAC CCGAGAACGG TCGTGCTCAC GCTCGCCGAC GGCGTCAGAT TCTGGGACGG CAGCCCGCTG ACCCCGGCTG ACGTCGTCTA CAGCATCGAC CGCAACCGCG ACCCGAGAGT CGGCGGGTAC TGGGCGAGCA ACTTCGGTAG CGTCGACACT GTCACCGTGA CTGGCGAGCA TGAGGTGACA CTCAAGCTCA GACGGCCGGA CTACTGGCTC GAGGGCGTGC TCTCGTTCAT GGCGGGAGTG GTGGTGAAGA AGTCGTACGC GCAGGAGAGA GGCAAGGACT ACGGCACGCC GAGCGGCGGC GCGATGTGCA CCGGCTCCTA CAGACTCGGC GCCTGGAGAA CCGGCGGCGC GGTCCAGCTC GTGCGCAACG ACGACTACTG GAACTCCGGC GTGAGACCGC ACGTGCGAGA GCTCAGCTTC AAGGGAGTGC CCGACCACTC CGCCCTCACC GCCGGCCTGC TGACCGGCGA GATCGACGGC ACCTACCCGC TCGGGCTCTC GACGCTCGAC CAGCTGCGCC AGAGCGACGC GGTCGAGGTC TACGAGGGGC CGTCGTACAT GGTCGGCGCG ATGATCCTCA ACCTCGACGG CCCGCTCGGC GACGTGCGCG TGCGCCAGGC GCTGTCGTTG GCGCTCGACC GCCAGGGCAT CGTCGCGACG ACCTTCAAGG GCACCGCCGA GCCCTCGCGT GCGCTCGCCA GCCCCGGCAC CTGGGGCTAC GCGAAGGACG TCTTCAGCGC CGCGTGGGAC GCGCTCCCCG CGCCTGAGCC CGACCTCGAC GCGGCCAGAA GACTGGTCGA GGAGGCCGGC GCGAGCGGCA GAGAGATCAC GATCGCGACG TCGAGCGAGC TGCAGAACAT CGACACCGAC GCGAACGCCT ACCGCACCGC GGCCGAGGCG ATCGGGCTGA GAGTCAAGCT GAAGTCGAGC CCGGCGGCCG TCTACTCGAA CCTCTTCGTC GACGCCGACG CGCGCAAGCA GGTCGACGCG TTCGCGACGA TGAACTACGC CAACTGGGGC GACCCGGCGT CGCTCTACGC GCCGCTGACG TTCGCCGACG GCAGCCAGAA CTACTACGGC TACAGATCCT CCGCCGCGAG CGCCAAGCTG GAGCAGGCGC GCGCCACCGC CGATCCGCAG GAGCGCGCGC GGCTCGTCAC CGAGGCGCAG CAGACGATCA CCGAGGAGCT GCCCTGGATC GCGACCGTCT CGCCGCACAC GGTGCTCGTG ATGAGCTCGA AGCTGACCGG CGCGCCGGCC TCCTCGGTCT ACCTGTCGTC CCCCTGGGCC GACACGCTCG GCGGGAGAGG GTAG
|
Protein sequence | MRAHSFRAVA GAVAVGVALL GAGCGRSSED AGGTTAGVSR SAPLSATTPA GSAEVDSATW AMYRDTITVD PIFAGDYPER QVVALMCESL LRQQPDGTTA PGLARLSYRD PRTVVLTLAD GVRFWDGSPL TPADVVYSID RNRDPRVGGY WASNFGSVDT VTVTGEHEVT LKLRRPDYWL EGVLSFMAGV VVKKSYAQER GKDYGTPSGG AMCTGSYRLG AWRTGGAVQL VRNDDYWNSG VRPHVRELSF KGVPDHSALT AGLLTGEIDG TYPLGLSTLD QLRQSDAVEV YEGPSYMVGA MILNLDGPLG DVRVRQALSL ALDRQGIVAT TFKGTAEPSR ALASPGTWGY AKDVFSAAWD ALPAPEPDLD AARRLVEEAG ASGREITIAT SSELQNIDTD ANAYRTAAEA IGLRVKLKSS PAAVYSNLFV DADARKQVDA FATMNYANWG DPASLYAPLT FADGSQNYYG YRSSAASAKL EQARATADPQ ERARLVTEAQ QTITEELPWI ATVSPHTVLV MSSKLTGAPA SSVYLSSPWA DTLGGRG
|
| |