Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_5121 |
Symbol | |
ID | 8735587 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 5476681 |
End bp | 5478249 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646505746 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003396905 |
Protein GI | 284046565 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.969979 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGGTTC AGATTCGGCA CCTGGCGGTG CTCGCCGCCG GCTGCGCGGT GCTGGCCGGC TGCGGTGGCG GCGGCAGTAC GAGCGGTGGC GAGACCGGCG ACACGGCGGC TTCGACGCAG GCGGCGAGAG CCGGCGGCAG ACTCGTCTAC GGCACCGCCG CGGGCATCTC GCAGCTGGAC CCGCACACGC TTTCGGCCGC GCAGCAGCTG GTGGTGCAGC CGCTGCTGTT CAACGGCCTG ACGAAGGCCG ATCCAAGCGG TGAGACCACA CCCGACCTGG CGGCCTCATG GAGAGCGTCC GCCGACCAGA GAACGTGGAC CTTCACGCTC CGCGACGGTG TCAGATTCCA CGACGGCACG CCGTTCGACG CGGCGGCCGC GAAGGCCAAC CTCGAGCGCG TGCTCGACCC CAGAGTCCCG AATCCCGACC GCACGAAGAT CGAGACGATC GCGAAGATCG AGACGCCCGC GCCGACGACG CTGGTGCTGA AGCTGAGAGC GCCGAACGCG CTGCTGCCGG ACGCGCTCGC CTCGGGCACG ATCAAGATGA TCGCGCCGAG AAGCTTCTCC AGCGCGAGCA AGACCGCGGT CGGCACCGGT CCGTTCAAGC TCGGCGAGAT GGTCCCCGAC GACCACGTCA CGCTGCTCAG AAACGACGGC TACTGGGGCG AGCCGGCCAA GCTCGACGAG ATCGACGTCG TCCGCTCGCC CGACTCGACC GCCGCCGCGA CCGCGTTCCG CGCCGGCGAC CTCGACGTGC TGTGGGCGGT CACGCCCGCC GACGTCGACG GGCTCGTCGC CGCGACGCGC GGCCGGGCGC TGGAGCCCGA CGACGTCTCG GCCGGCGCTT ACTGGGAGGT CGACAACACC AGCCCGCCGT TCGACGACGT GCGCGCCCGT CAGGCGCTGC TGCACGCGAT CGACCGCGAG ACGATGCTGA AGGTCGGCTA CGCCGGCAAG GGCCTGGTGC CGGAGACGGC GTCGATGCTG TCGCCCAGAA ACGCCGCCTT CGACAGCTCG CTGACGACCT ACCCGTTCGA CCTCGACAAG GCGAGAGCGC TGTTCGCCGA GGCCGGCGTC GACGCCGGCA CGACGCTGAC GTTCCACACG GTCGCGGGCC AGTACCCGGA GTGGGTGCAG ATGGGCCAGA TCCTCCAGCA GAACCTGGAG GAGATCGGGA TCAGAATGAA GATCGAGCGC CAGGAGTTCA GCACCTGGCT CGACACGTTC TACCCGGCCG GCAAGAGATT CCCGGGCGGC ATCGTCGCCA ACTACCTGTC GCTGCCGACC GTTCCCAGCT ACGCGCTCAG CTTCCTCGAC GAGGGCGTCT GCGAGTGCAA CGCGAGACTG CCCGGCTGGA GAGAGCTGTC CGCCAGAGCG GTCGCGACCG GCGAGCAGGC GGAGCGCGAC GCGATCTACG CCGAGATGCA GCAGCTGCAG AACGACGCCG TGCCGATCAT GCCGATCGTC TTCTCGACGC TCCAGACGGT CGTGCGCGAC GGCGTGACCG GCGCCTGGGT CGACCCGCAG GGCAACGTCA ACCTCGAACA GGCCGGCTTC GCGCCGTGA
|
Protein sequence | MRVQIRHLAV LAAGCAVLAG CGGGGSTSGG ETGDTAASTQ AARAGGRLVY GTAAGISQLD PHTLSAAQQL VVQPLLFNGL TKADPSGETT PDLAASWRAS ADQRTWTFTL RDGVRFHDGT PFDAAAAKAN LERVLDPRVP NPDRTKIETI AKIETPAPTT LVLKLRAPNA LLPDALASGT IKMIAPRSFS SASKTAVGTG PFKLGEMVPD DHVTLLRNDG YWGEPAKLDE IDVVRSPDST AAATAFRAGD LDVLWAVTPA DVDGLVAATR GRALEPDDVS AGAYWEVDNT SPPFDDVRAR QALLHAIDRE TMLKVGYAGK GLVPETASML SPRNAAFDSS LTTYPFDLDK ARALFAEAGV DAGTTLTFHT VAGQYPEWVQ MGQILQQNLE EIGIRMKIER QEFSTWLDTF YPAGKRFPGG IVANYLSLPT VPSYALSFLD EGVCECNARL PGWRELSARA VATGEQAERD AIYAEMQQLQ NDAVPIMPIV FSTLQTVVRD GVTGAWVDPQ GNVNLEQAGF AP
|
| |