Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_2051 |
Symbol | |
ID | 8732494 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 2154668 |
End bp | 2156236 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 646502670 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003393852 |
Protein GI | 284043512 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGAAAC GTGGACGGCG AGCCGCCCTG GCAGCGGTTG TCGGCGCGGT CGCAGTGGCG GCCGTGGGAT GTGGATCGAG CGGCTCGAGC TCGTCGGGCG GCGATCCGGT GGAGGGAGGC ACCTTCACGG CGGCGATCCC GTTCGACCCG GGCGCGAGCT TCGACCCGTA CCGCAACCTG AAGGTCGCGC CGATCGCGAG CTACGCCTAC GACAGCCTCG TCAAGCTCGA CGGCAGAGGC GGCGTCGTGC CGAACCTGGC GAAGTCGTGG TCGTCGAGCG CGAGAGGCGC GACGTTCGCG CTGCGCTCGG GGATCACGTG CTCGGACGGG ACGCCGCTGA AGGCCAGCGG CGTCGCCGCG GCGCTCAACT CCGTGCTCGA CCCGAAGTCG AAGGCGACGG TCTTCGGCGT GCTCACGCCC AGCATCCCCT ACACCGCGAC CGCTGACGAC GCGCGCGGCA CGGTGAAGGT GACGATGGCG AGCCCGTTCA GCTTCACCGT CGAGCAGCTC GGCCAGTTCC CGATCGTCTG CCCGAGAGGG CTCGCCGACC CCGAGCTGCT GGCGAAGCGC TCGCTCGGAA CCGGCCCCTT CGTGCTCGAC GAGATCGCCC CGGGCGATCG GATCTCCCTC AGCGCCCGCA GAGGCTACGC GTGGGGCCCG GAGGGTCGCA ACGCCGGCGC GGGCGCGCCC TCGCGCGTTG TCCTGCGGAT CGTGCCGAAC CCGACGACCG CGGCGAACCT GCTCGCCTCC GGCGAGGTCA ACGCCGCCGA GCTGGCCGGT CCCGATCGCG AGCGCGTCGA GGCGTCGGAC CTCTTCAGCC GGCAGTACGA GGCGCCGTTC GGGCTGCTGA CGTTCAACCA GCTCGACGGA CGCCCGCTGC GCGATCGCAG CGTGCGCACG GCGCTGCTGC AGGCGCTCGA CCTCGACCGC CTCGCGCAGG TCGGCACGAG CGGCATCGGC GCACGCGCGA AGAGCCTGCT GACCGGCTCG CCGCCGGTCT GCGCGTACGA CAGCGTCAGA GGAAACGTCC CGACCTCCGA TCGGCAGGCG GCGCGCGACG CGCTCGCGCG TGCCGGCTGG ACCGACAGCA GACCGCTGCG CGTCGGGCTG CACTACCAGA CCGACATCCT CGGTCCGGCG GGCGCTGCGG CGGTCGAGCT GATCGCCGAG CAGTGGAAGC AGCTCGGCGT CGAGACGAAG CTCGTGCCCG AGGACATCAA CGCGTCGCTC AACACGCTCT ACGTGACGCA CGACTGGGAC GTCTACTGGG GCGAGCAGAA GATCGTCTTC CCGTCTGACC TGCCGACCTA CTTCTCCGGG CCGGCGCCGC CGAGAGGCAA CAACTACCCC GGCGTGAGCA ACCCCGCGTT CGACCGCCTC GCCGCGGCCG CGCAGGGCAC CGCGGGAGCC AAGAGCTGCC CCGCCTGGGA GCAGGCCGAG GGTTCCCTGA TCGGCCAGGC GAACATCGTC CCGGTCGCCA CGTCGCCGAC GACCGTCTTC GGCAACGGCG CGCGCTTCGA GCTGCAGGGG CTGACGATCG CGCCGACGAC GATCCGGCTG TTCGGGTAG
|
Protein sequence | MVKRGRRAAL AAVVGAVAVA AVGCGSSGSS SSGGDPVEGG TFTAAIPFDP GASFDPYRNL KVAPIASYAY DSLVKLDGRG GVVPNLAKSW SSSARGATFA LRSGITCSDG TPLKASGVAA ALNSVLDPKS KATVFGVLTP SIPYTATADD ARGTVKVTMA SPFSFTVEQL GQFPIVCPRG LADPELLAKR SLGTGPFVLD EIAPGDRISL SARRGYAWGP EGRNAGAGAP SRVVLRIVPN PTTAANLLAS GEVNAAELAG PDRERVEASD LFSRQYEAPF GLLTFNQLDG RPLRDRSVRT ALLQALDLDR LAQVGTSGIG ARAKSLLTGS PPVCAYDSVR GNVPTSDRQA ARDALARAGW TDSRPLRVGL HYQTDILGPA GAAAVELIAE QWKQLGVETK LVPEDINASL NTLYVTHDWD VYWGEQKIVF PSDLPTYFSG PAPPRGNNYP GVSNPAFDRL AAAAQGTAGA KSCPAWEQAE GSLIGQANIV PVATSPTTVF GNGARFELQG LTIAPTTIRL FG
|
| |