Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_1014 |
Symbol | |
ID | 8731449 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 1068045 |
End bp | 1069616 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646501632 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003392822 |
Protein GI | 284042482 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCCAA GAGCTATGGG GGTCAGCCGG ATGCGGCGCT TGGCCGTGGC TGTCGGGGTG CTGGTCGTTG GCGTCGGCGC CGCGGGGTGC GGCGGCGAAA GCACGACGAC GACGACCAGA TCAGGCGGGG GAGCGAAGGT GCTGCGCTAC GGCGTTCCCG CTTCGATCGT GGCCAGAGGC GTGAGCAATC CGGCCGTGAT CCCGAGCACG GACGGACCGA TCCTGAGCAT CGCCTACGCC CCTCTGTTCC ATGCCGCTCC GGACGGCAGA ATCGAGCCGG CGCTGGCCGT CAGATGGCGC TACACCGATG ACGACCAGAA GGTCTTCGAG TTCACCCTGC GCGAGGATGC GCGCTTCTCC GACGGGACCC CGGTCACCGC GGAAGCGGTG GTGGGTTCGC TCGAGTACTA CTACAAGAGC AGAAACATCT ACTCGGAGCT GCTGGGGAGA AACCCGAGGT TCGAGGCGGT CGACAGATGG ACGGTGCGCG TCACGTTGAC GGCGTCGCTG CCCAACCTGC CGTTCCTGTT CTCCGAGGCG AACGTCAACT GGGGCTTCGT GATGGCGCCG AAGGGAGTGG CGAATCCCAG ACTGTTCACG AAGGCCACCT ATGGCGCGGG GCCGTACAAG CTCGACTACT CGAAGTCGGT GCCCGGCGAC CACTACACGT TCGTGCCGAA CGAGTACTTC TACGACAGAT CGGCGATCAA GTTCAAGGAG ATCTACCTGA AGGCGTTCGC CGATCCGTCG GCAGCGCTGC AGGCGCAGCA GGCCGGTCAG ATCGACGTCG GCTGGACGAT GGACTCGTCG ACGGCCGAAG CGGCCGAGTC GGCCGGTCTC GACGTCGTGT CTGCCCCATT CGCCGTTCTC TACATGACGA TGAACGCGCG CAGAGGCACC GAGGCGCTTC GCGACGTCCG TGTGCGCCGG GCGCTGAACT ACGCGATCGA CCGCAAGGCG ATCTCGAACG CGCTCTTCGG CAGATACGGC GTCCCGATGT CTCAGTTCAC GGTTCCGCCG GACTCCAATC CCGGGTTGGA GAACGCGTAC CCCTACGATC CGGAGAGAGC GCGCGCGCTG CTGGCCGAGG CGGGATACCC GAGAGGGTTC GAGTTCTCGA TCAACACCGG CAAGGACGAT CCCCAGGCGA AGGCCGTCGA GCTGGTGGCC AGCTACCTCG ACAGAATCGG CGTCAAGTCG AACGTCAGAA CGTTCCAGAA CCGCGCGGGC TACCTCGACG CAGCGCTGTC GTTCAAGGAC GACTCGGCGA TCTTCGCCGG CGACGTCGGC GTGCCGACGA CGATCGAGTA CCCCAGCTAC ATCGGGCCGA GCAGCACGCT CGGCGGGGGC GACCCGGTCA ACCCGAGAGT CAACGAGCTC TACGAGGCCG GTCTCAGAGC GAGCGACCCC TCCAGAGACT GGAAGGAGAT GTGGGCGATC ACGGTGAACG ACGCCTGGTT CCTGCCGATA TCGGGGTTCA GTGACCTCGC GTACGTGTCG GACGGGATCG GCGGGGTGCA GATGACCCCG GCCCGCCCGT ACTCCTTCCC GACGGAGTGG TTCCCCAAGT AG
|
Protein sequence | MSPRAMGVSR MRRLAVAVGV LVVGVGAAGC GGESTTTTTR SGGGAKVLRY GVPASIVARG VSNPAVIPST DGPILSIAYA PLFHAAPDGR IEPALAVRWR YTDDDQKVFE FTLREDARFS DGTPVTAEAV VGSLEYYYKS RNIYSELLGR NPRFEAVDRW TVRVTLTASL PNLPFLFSEA NVNWGFVMAP KGVANPRLFT KATYGAGPYK LDYSKSVPGD HYTFVPNEYF YDRSAIKFKE IYLKAFADPS AALQAQQAGQ IDVGWTMDSS TAEAAESAGL DVVSAPFAVL YMTMNARRGT EALRDVRVRR ALNYAIDRKA ISNALFGRYG VPMSQFTVPP DSNPGLENAY PYDPERARAL LAEAGYPRGF EFSINTGKDD PQAKAVELVA SYLDRIGVKS NVRTFQNRAG YLDAALSFKD DSAIFAGDVG VPTTIEYPSY IGPSSTLGGG DPVNPRVNEL YEAGLRASDP SRDWKEMWAI TVNDAWFLPI SGFSDLAYVS DGIGGVQMTP ARPYSFPTEW FPK
|
| |