Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_0494 |
Symbol | |
ID | 8730922 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 512479 |
End bp | 514089 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646501107 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003392304 |
Protein GI | 284041964 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.080678 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.301708 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGAAG TCAAGAAGGT GCTGCCGTCG GTCCGTGGAC GCGGGGTGAG CCGGCGCGCT TTCCTGAACG AGGGTGCGGC ATGGGGGCTC TCGGCCTCGA CGGTCGGACG GCTGCTCGCC GTCGGCGCCG CCCCGGCCGG CCTGCTCGCG GGCTGCGGGA CCGACTCCGA GAGCGCGAGC GGCGGGGGCG GAGGCGGCGG AGCCGCCGGG ATCATCGCGA TCGGCAACGC CGAGCCGCCG ACGTCGGCCT ACTGGGACCC CCACGCCCAG TTCGGCATGG CCGACACGCA GCTCTGGTCG CTGACGTACG ACATGCTGCT CAGCTACGAC AAGTCCGGGC GCGTCGTCGG CGGCCTGGCG CGGCGCTGGC ACCGTACGAG CCCGCAGCGC ATGCGCTTGG AGCTGCGCGA GGACGCCCGC TTCCAGGACG GCGCGCCGGT GCTCGCGAAG GACGTCAAGG CGAGCCTCGA CCGGCTCGGC GATCCGGAGT CGAGACTCGT GCTGTCCGCC TACGCGACCC CGGGCATGAG AGTCGAGGTG ATCGACGAGC ACACGATCGA GATCGTCACC CCGCGCCCGT TCGGACCGCT GGAGTCGGCG CTGACGCTGT TCGCGATCGC GCCCGCGAGA GACATCGCGC AGCCGGATGT CTTCAGAGAG CGGCCGCTCG GCAGCGGCCC GTTCAGATTC GTCCGCTACA AGAACAACGT CGTCGAGCTG GTCGCGAACG AGAGATACTG GCGCGGCAAG CCGGCCTCCA GAGGCGTCGA GCTGCGCTAC ATCGCCGACC CCGAGGCGCG TCTCAACGCG CTGCTGACGG GCGCGATCGA CATCTACACG CGCGGCAGCT CGCTGACGCT CGACGCGACG AAGAAGGACG GCTACCACGT CACCACGACC GGCCCGGCCA GCCAGCTGAT CTACATCCCG CAGCACAACA CGGAGCTCAG CGACCCGCGT GTGCGGCAGG CGATCGCCCA CGCGATCGAC CGCCGCGCGA TCGCCAAGAG CCTGATCAGA ATCGACCCGC CGGCGCGGTC GAGCCTGCCC GCCGGCACCG ACGGCTTCCG CCCGCTGGCG CCGAGCTTCG AGTACGACCC CGACAAGGCC CGCAGACTGC TCGCCGACGC CGGCCACGCG AACGGGCTGA AGATCACGAT GGCGTCCTCG AACCTCGTCA CGCACCAGCC CGCGATCGAC CAGCTCGTCA AGAGCTGGCT GGAGGAGGTC GGCATCGAGG TCGAGCTGAG AACACTCGAG ACCGGCACGT TCCGCAGCTC CTACAACCAG TACGCGCTGT CGTTCAACGC GCTCGGCACG ATGAACCCCG ACCCCGACTC GCTGCTGACG TTCTTCCGCC CGGTCGTCGC GCAGGCGGCG CTGAACCTCG ACGACCCGAA GATCGGGCGG CTGCTGCAGC GCACGCGCGA GACGACCGGT GCCGCGCGAC GCGCTGCGAT CGACGCGTAC GCGTCGTATC TGTGGCAGAA CCAGATCATG ATCTACGTCA CCGACGACAT CTGGTTCACG GTCGTGAATC CGAAGCTGCG CAACTACCAC CGCACCCCGC AGCAGGGAGA GCCGCTCCTG TGGCGCGCGT CGAAGGCGTG A
|
Protein sequence | MDEVKKVLPS VRGRGVSRRA FLNEGAAWGL SASTVGRLLA VGAAPAGLLA GCGTDSESAS GGGGGGGAAG IIAIGNAEPP TSAYWDPHAQ FGMADTQLWS LTYDMLLSYD KSGRVVGGLA RRWHRTSPQR MRLELREDAR FQDGAPVLAK DVKASLDRLG DPESRLVLSA YATPGMRVEV IDEHTIEIVT PRPFGPLESA LTLFAIAPAR DIAQPDVFRE RPLGSGPFRF VRYKNNVVEL VANERYWRGK PASRGVELRY IADPEARLNA LLTGAIDIYT RGSSLTLDAT KKDGYHVTTT GPASQLIYIP QHNTELSDPR VRQAIAHAID RRAIAKSLIR IDPPARSSLP AGTDGFRPLA PSFEYDPDKA RRLLADAGHA NGLKITMASS NLVTHQPAID QLVKSWLEEV GIEVELRTLE TGTFRSSYNQ YALSFNALGT MNPDPDSLLT FFRPVVAQAA LNLDDPKIGR LLQRTRETTG AARRAAIDAY ASYLWQNQIM IYVTDDIWFT VVNPKLRNYH RTPQQGEPLL WRASKA
|
| |