Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_5088 |
Symbol | |
ID | 8735554 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 5438577 |
End bp | 5440238 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646505713 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003396872 |
Protein GI | 284046532 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.416607 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCTTC GACGACAGAC CGCTCGCCGC CCGTTCCGCG GCCTCGCGAC CGCCGCGTCG GTCGCCCTGA TGGCGGCGAC CCTCGCCGCC TGCGGCAGCA GCGACAGCGG CAACAGCAGC AGCACCGGCG ACGCCACCAG CAGAGGGGCG AACGTCGACT CGGGTCGCTG GGGAGCGGTC ATCAACACCG GCGGCAGACC CGTCAGAGGC GGGATCCTGC GCGTCAACCA GGATGACGCG CCCGCCGGCA TCAGCCCGCT GTACCTGCTG ACCGACCCGA GAAACGACAC GATCCAGGTC GTCATGCAGG TCTTCGACCA GCTGACCGAG CTGCGGCCCG GCTCGATCGA TCCGCAGCCC GGGCTGGCGG AGAGCTGGGA GGTCAGCCCC GACGGCAAGA CCTACACGTT CAGATTGCGC GACGCGAAGT TCTCCGACGG CGCGCCGGTG ACCTCCGGCG ACGTGCGCTA CTCGCTCGAC CGCGTGCGCG CGAGAGGGTC GTTCTACGTC GACCTCTACG CCAGCATCGC GACGATCGAG ACGCCGGACC CGTCGACCGT CGTGCTGAGA CTGAAGCAGC CGACGCCGGC GATGCTCTCC TACCTGTCGT TCGCCGGCGC ATCGGTCGTG CCGGAGAGAC TCGTGCGCGC CGACGAGAGA GGGTTCAACC GCAGACCGAT CGGCAGCGGC CCGTTCGTCG TCAGAAGCTG GAAGCCCGAC CAGGCGATCG AGCTGACGCG CAACGACAGC TACTGGCAGA GAGGCCTGCC GTACCTCGAC GGGGCGGTCC TGACCTCGGT GCCCGACGAC AACACGCGCG TGCTCAACGT GCAGTCGGGC GAGGCGCAGG TCGCCGACTT CGTGCCGTTC GCGCAGATAG ACGCGATCGA CAAGGCGGGC AAGGCGAGAG TGCTGATCGG ACCGGGCGCC GACACGACCG CGATCTGGGT CAACAACTCC AGAAGACCGT ACGACGAGCG CGAGGTCCGC CAGGCGCTGA TGTACGCGAC GCCGGTCGAG TCGATCATCG ACGTCGTCTT CCACGGCAAG TCGCCGCAGG CGAACACGAT CATCCCGAAG CTGGAGTACT GGACCGACCA GGCGAGAGCC TATCCCTACG ATCTCGAGAG AGCGAGAGAG CTGCTGACGA GATCGTCGGT CCCGGACGGC TTCACCGCGA CGATCCAGAT CAACGCCGAC GACCAGGCGG CGAGCCAGAT CGTGCAGATC CTCGAGCAGG CGTGGGCCAG AATCGGCGTC AGACTCGTGC GCGACCAGGC CGACGCGGCG ACCGTCGCGG AGAAGTTCTA CGGCGGCAGA TACGAGCTGA ACCTCGTTCG CCCGGGCGCC TTCACCAGCG ACGTGCCGGT CGACGACCAG TTCGCCGAGC TGCAGTTCAA CTCGCCCGCG ACGGGCAACC TGTTCACGTT CTCCAGACCG GCGCAGGCGC GCGACTACGC GCGCAGAGCG GTCGTCGAGA CAGACCAGCA GAAGCGCAGA GAGCTGTTCG CGCAGATGCA CGTCGCCTCG ATGGAGGAGC TGCCGACGCT GCCGCTCGTC TACACGCCGA ACCGCGCCGC GGTCGCGAAC GAGGTGCGTG ACTTCAACTA CATGCTGACC GGCTACTGGC GGCTGGAAAG CGCCTGGCTG GAGCAGCCGT GA
|
Protein sequence | MDLRRQTARR PFRGLATAAS VALMAATLAA CGSSDSGNSS STGDATSRGA NVDSGRWGAV INTGGRPVRG GILRVNQDDA PAGISPLYLL TDPRNDTIQV VMQVFDQLTE LRPGSIDPQP GLAESWEVSP DGKTYTFRLR DAKFSDGAPV TSGDVRYSLD RVRARGSFYV DLYASIATIE TPDPSTVVLR LKQPTPAMLS YLSFAGASVV PERLVRADER GFNRRPIGSG PFVVRSWKPD QAIELTRNDS YWQRGLPYLD GAVLTSVPDD NTRVLNVQSG EAQVADFVPF AQIDAIDKAG KARVLIGPGA DTTAIWVNNS RRPYDEREVR QALMYATPVE SIIDVVFHGK SPQANTIIPK LEYWTDQARA YPYDLERARE LLTRSSVPDG FTATIQINAD DQAASQIVQI LEQAWARIGV RLVRDQADAA TVAEKFYGGR YELNLVRPGA FTSDVPVDDQ FAELQFNSPA TGNLFTFSRP AQARDYARRA VVETDQQKRR ELFAQMHVAS MEELPTLPLV YTPNRAAVAN EVRDFNYMLT GYWRLESAWL EQP
|
| |