Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_2323 |
Symbol | |
ID | 8732766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 2456779 |
End bp | 2458335 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646502940 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003394122 |
Protein GI | 284043782 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.250322 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0484761 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGACTT CCCGCGTCCT GGCGGCGCTC GCGTGCATCG GCGGTCTCGT CGCCGCCGGC TGTGGCTCCG CCACCACCAA CAGCGATCGA CCGCTCTCGA CGCTGCGCGT CCCGTTCCAG TCGGACGTCC CCGGCGGGGT CGACCCCGAC GTCTTCTACG ACGTCGAAGG GCTGCAGATC ACCAACTCGG CCTACGAGGG CCTGCTCGGC TACGCCGACG ACGGCAGACT CGTCGGCGAG CTCGCGACCG ATTGGAGAGC GGGCGCCGAC GGGCGGACGT ACGACTTCAC GCTGCGCCCC GGCGTGCGCT TCTCGGACGG GACGCCGTTC GACGCGCAGG CGATGAAGGC GAGCTTCGAG CGCCGCAGAC AGGTCGACGC CGGCCCCGCC TACATGCTCG CCGACGTCGA GCAGGTGGAG GCCCTCTCCC CGAGACGTCT GCGCGTGCGC CTGAGCAGAC CGAACGCGGC GTTCCTCGAC CACCTCGCCA GCCCCTACGG GCCGAAGGCC GTCAGCCCGA CCGCGGTGCA GCGCCACGCC CGCGACGGCG ACCTCGCGAA GGGCTGGCTG CAGACGCACA CGGCCGGCAC GGGCGCGTAC GAGCTGACCG AGGCGGTGCC GGGGCAGCGC TTCGTGATGC GCGCCTCGCC GACGTGGAGA CGCAGCAAGC CGACCGTGCG CGAGGTGCAG TTCACCGTCG TCCCGGACGC GGCGACGCAG GTCACCGAGC TGCGCGGCGG CCAGCTCGAC CTCATCACGC ACGGCCTCAC GACCGCCGAC GTGCAGGCGC TGCGCGGGGC CGGCGGCGCG AAGGTGACGA CGCGCCCCTC GACCCTGCGC ATGATGCTCT ACCTCAACAC CGCCGCGGGC ACGCTGCGCG ACGCTGAGGT GCGGCGCGCG TTCCTGAAGT TCGTCGACCG CGACGCGCTC GTCGACACCG TCTACGGGGA TCTCGCGCGG GCGAGCGACA GCTTCTACCC GGACGGCACG TCGATCGCGA GAGCGGCGCC GCTCGACGTG CCGGTCGACC CCGACCAGCT CAGAGCGCTG TCGTCGCGCT TCACGGAGCC GCTCGTGATC GGGGCGGTGC AGGGAGACGG CCCCGCCGCT GGCCAGATCG CCCAGCTGCT CCAGGGACAG TTGCAGCAGG CAGGCATCAA GGCGACCACA CGCGACATCC CGCTCGCGCA GGTGTACGAC CTCACGACGC GCCCCTCCGC GCGCCCGGAC GTCCTGATCG TCACCAACGT CCCTGACGAC CTCGCGCCCG ACAGCTGGTC GCGCGTCTAC CTGCGCACCG GCGGCTCGGT CAACTGGCTC TCCTGCTCGG TGCCGGAGGC CGACCGGCTG ATCGACGAGG CGGTCGTCGC TCGCGGCGCC GCCCGTCAGC GAGAGCTCGG CGTCGAGGCG GCCGCGGCGT GGATGGAGCA CGGCTGCGTG CTGCCGCTCG CCGAGCTGCA GAACGTGACG GTCTCGCGCA AGGGCGTCGA GAACGTGAAG GGCGGCCCGG CACGGCCGTT CGCGGTCGAC GTCGACAAGC TGCGGCAGGC GAGATGA
|
Protein sequence | MRTSRVLAAL ACIGGLVAAG CGSATTNSDR PLSTLRVPFQ SDVPGGVDPD VFYDVEGLQI TNSAYEGLLG YADDGRLVGE LATDWRAGAD GRTYDFTLRP GVRFSDGTPF DAQAMKASFE RRRQVDAGPA YMLADVEQVE ALSPRRLRVR LSRPNAAFLD HLASPYGPKA VSPTAVQRHA RDGDLAKGWL QTHTAGTGAY ELTEAVPGQR FVMRASPTWR RSKPTVREVQ FTVVPDAATQ VTELRGGQLD LITHGLTTAD VQALRGAGGA KVTTRPSTLR MMLYLNTAAG TLRDAEVRRA FLKFVDRDAL VDTVYGDLAR ASDSFYPDGT SIARAAPLDV PVDPDQLRAL SSRFTEPLVI GAVQGDGPAA GQIAQLLQGQ LQQAGIKATT RDIPLAQVYD LTTRPSARPD VLIVTNVPDD LAPDSWSRVY LRTGGSVNWL SCSVPEADRL IDEAVVARGA ARQRELGVEA AAAWMEHGCV LPLAELQNVT VSRKGVENVK GGPARPFAVD VDKLRQAR
|
| |