Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_3425 |
Symbol | |
ID | 8733874 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 3653713 |
End bp | 3655404 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 646504042 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003395218 |
Protein GI | 284044878 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGACGT GGGCACGGAT CGTCGGTGCG ATGCTGGTCG CACTGGCGCT CGCGGTGACC GTCGCTGCGT GCGGCGGAGA CGACGACGGC GGTTCGACGT CGGCATCCAC CACATCGGGT GGCGGCGGCG CGACCACCGG AGGGGCAAGA CAGGGGGGCA CCGCGACGGT GCTCATGGGC ACAGCGCCCG ACTACCTCGA CCCGCAGCTG GCGTACACGA CCCAGGCGGT CGAGCCGCAC TGGATCAGCT ACACGGGTCT CCTGACCTAC CGGCACGAGG AGGGCCAGGC CGGCACGGAG CTGATCCCGG GCCTCGCCGA CGCGCTGCCG AGAATCAGCC AGGACGGCAG AACGTACGAC TTCACGCTGC GCAGAGACCT CAGATACTCC GACGGCACGC CCGTCAGAGC GACTGACTTC CCGTACTCCG TCGAGCGCAT GATCAAGATC CCGTGGGGCG GCAGATCGTT CGTCACCAAC TACGTCGTCG GCGCGCAGGA GTTCGACGAA GGCAGAGCCA GAAGAGTCTC CGGCATCACT GCCAGCGACG CGACCGGCAG AATCACGATC AGACTGAGAG AGGCGTACGG CGCGTTCTCC AACGTGCTCG CGTTCCCGGC GCTGGCGCTC GTCCCGAGCG GCACGCCGAT CAGAAACCTG TCGGCCGATC CGCCTCCCGG CGTCGGCGCG TACATGCTGA CCGACGTCGT CCCGAACCGT TCGTTCACGG TCAGAAGAAA CCCGGCGTTC GCCGCGTTCA GAATCCCCGA CATCCCGCTC GGCAACCTCG ACGCGATAAA CGTGAGAATC GTGTCGAACA CGAACTCCGA GGCGCAGCAG GTGCTCAACA ACCAGGCCGA CATCTTCGAC CCGGGTGACA CGCTGCCGCC GGCGCTGCTG CCGCAGATCG AGAGCCAGGC GAGAGATCGC TTCGCGCGCA AGCCGGTGCC GTCGACGTTC TACTTCTTCC TCAACACGAC CAAGCCGCCG TTCGACAACC AGAAGGCGCG CGAAGCCGTC AACATGGCGC TCGACCGCGA CGCGCTCGTG CGGCTCTCCA GCGGGTTCTT CTCGCCGAGC TGCTTCTTCA TCCCCGAGGG GATCGTCGGC CACCCCGACG CGGAGTGCCC GTACGGCGAC ACGCCGGACA TCGAGGGCGC GCGCAGAATC ATCAGCGACG AGGGCCTCGA AGGCACCAGA GTCGTCGTCT GGGGCCAGGA GCGCAGCCCG CGCAAGGAGT ACGTCGACTA CTACACCGAC ATGCTCAACA GAATCGGCTT CGACGCGCAG CAGAGAATCA TCGCCGACAC CGTCTACTTC CCGACGATCG GCAACGAGAG AACGGATCCG CAGACGGGCT TCGCGAACTG GCTGCAGGAC TTCCCGAACC CGTCGGACTT CTATCTGCTG CTCGACGCCC GTTCGATCCA GCCGACCAAC AACCAGAACT TCTCGAAGGT CGACGACCCG CACATCCAGA GAGAGCTTCT GAGACTCAAC GCCGTCCCCG CGACCGAGCT GGACAGCGTC GCCGACGAAT GGAGAGCACT CGACGAGTAC ACCGCGCAGA GAGCGTACAA CGCGGTGTTC GGCTCGATCA GCGTGCCGAA GTTCTTCTCC GACAAGCTGA ACTTCGACTC GGTGTTCCAC CCGTTGTACT TCAACGACTG GTCCACCCTC TCGCTGAAGT AG
|
Protein sequence | MRTWARIVGA MLVALALAVT VAACGGDDDG GSTSASTTSG GGGATTGGAR QGGTATVLMG TAPDYLDPQL AYTTQAVEPH WISYTGLLTY RHEEGQAGTE LIPGLADALP RISQDGRTYD FTLRRDLRYS DGTPVRATDF PYSVERMIKI PWGGRSFVTN YVVGAQEFDE GRARRVSGIT ASDATGRITI RLREAYGAFS NVLAFPALAL VPSGTPIRNL SADPPPGVGA YMLTDVVPNR SFTVRRNPAF AAFRIPDIPL GNLDAINVRI VSNTNSEAQQ VLNNQADIFD PGDTLPPALL PQIESQARDR FARKPVPSTF YFFLNTTKPP FDNQKAREAV NMALDRDALV RLSSGFFSPS CFFIPEGIVG HPDAECPYGD TPDIEGARRI ISDEGLEGTR VVVWGQERSP RKEYVDYYTD MLNRIGFDAQ QRIIADTVYF PTIGNERTDP QTGFANWLQD FPNPSDFYLL LDARSIQPTN NQNFSKVDDP HIQRELLRLN AVPATELDSV ADEWRALDEY TAQRAYNAVF GSISVPKFFS DKLNFDSVFH PLYFNDWSTL SLK
|
| |