Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_0576 |
Symbol | |
ID | 8731004 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 606414 |
End bp | 608015 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646501189 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003392386 |
Protein GI | 284042046 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAGACG GTGTGAGAGG TCTGACGCGG CGTGACGCGA TGCGCGGCGC CGCCGCGGGC GCGGCCGTGG TCGGCGCCGG CGGTCTGCTG GCGGCCTGCG GCAGCGGCGG CTCGTCGAGC GGCACGACGG CGTCCGGCGA GACGACCGCC ACGGCGGGCG GCACGCCGAG AAGCGGCGGC ACGCTGCGCG TCGGCGGAAC CGGCGGCGGC GCGCGCGACT CGCTCGACCC GAACCGCCAG CAGACGGCGC TCGACTTCGC CCGCTGCTTC GCGCTCTACG ACCCGCTCGT CGAGCTGACC GAGCAGTTCA CCTACGAGCT GGCGCTGGCC GAGGAGATCA CGCCGGACGA CGGCAGCGCG AAGGTGTGGA CGGTCCGGCT GAAGGACGGG ATCGAGTTCC ACGACGGCAA GACCGCCGAC GCTGAGGACC TGATCTTCAG CATCGGCCGC GTGATCGACC CGAGAGCCCC GGGCGCCGGT GCGAACGCGC TCAGAGGCGT GACTCTGAAC GGGATGAGAA AGCTCGACGC GCGCACCGTC CGCTTCACGC TCGAGCAGCC GATCTCGATC TTCGACAAGC GCGTCGGCGG GTACCTCTCG CCGCTGCTGC CGGTCGGGTA CGACCCGGCG AGACCGGTCG GCGCCGGCCC CTTCAAGCTG CAGAGCTTCA AGGCCGGCGA CCGCTCCGTG ATGGTCCCGC ACCCGAACTA CTGGGGCGAG AGAGCGCACG TCGACCAGCT CGACATCATC GGCATCGCCG ACGCGTCGGC GGCGGTCAAC TCGCTGCTCT CCGGCCAGAT CGACATCCTT CAGGGCCTGC CGCCGGCGCA GGCCGAGGTC GTCACCTCGG GCGGCGGCAA GCTGCTGGAG ACGAACGACT CCGCATGCTT CATGTTCGGC ATGCGGATGG ACATGGCGCC GTTCGACGAC GTGCGCGTGC GGCAGGCGAT GCGGCTGATC GCCGACCGCG ACCAGATGGT CGAGCAGGTG ATGGCCGGCC GCGGCGACGC CGCCAACGAC CTCTTCGCCC GCTACGACCC CGACTACCTG TCGGACGTCC CGCAGCGCGA GCAGGACCTG GAGCAGGCGA GAGCGCTGCT GAAGCAGGCC GGCCAGGACG GCATGCGGCT GGAGATCTCG ACGACCGGCG CCTATCCGGG CCTGCTGGAG TCGGCGCAGG TCTTCGCCGA GCAGGCGAAG GGCGCCGGTC TCGACGTCAA GGTCAGAAGC ATCGACCCGG ACACCTTCTA CGCCCGCTAC TACCGCAGAA CGCCGTTCTC GCCGGACCTC GTCTCGCCGC AGCTGTACCT GACGGTCGCG ACCTCCTACA ACACGCCGGG CGGCCCCTAC GACACCGTCT ACAACAGAGA CCCCGAGTAC CTCGCGCTCT ACAGAGACGC GCTCGCGGAG CTGGACGAGG CCAAGCGCGG CGAGCTGATC GAGGCGATGC AGCGGATCGA CCACGAGCGT GGCGGCTACG TCTGCTGGGG CTTCTCGAAG AGCCTCGACG CCTATCGCGA CGACGTCAAC GGGCTGGTGC CGGGGACGAA GGCGGCGTTC AGCGTCAACA ACGGGGCGTT CAACCGGCTC TGGCTCAGCT AG
|
Protein sequence | MGDGVRGLTR RDAMRGAAAG AAVVGAGGLL AACGSGGSSS GTTASGETTA TAGGTPRSGG TLRVGGTGGG ARDSLDPNRQ QTALDFARCF ALYDPLVELT EQFTYELALA EEITPDDGSA KVWTVRLKDG IEFHDGKTAD AEDLIFSIGR VIDPRAPGAG ANALRGVTLN GMRKLDARTV RFTLEQPISI FDKRVGGYLS PLLPVGYDPA RPVGAGPFKL QSFKAGDRSV MVPHPNYWGE RAHVDQLDII GIADASAAVN SLLSGQIDIL QGLPPAQAEV VTSGGGKLLE TNDSACFMFG MRMDMAPFDD VRVRQAMRLI ADRDQMVEQV MAGRGDAAND LFARYDPDYL SDVPQREQDL EQARALLKQA GQDGMRLEIS TTGAYPGLLE SAQVFAEQAK GAGLDVKVRS IDPDTFYARY YRRTPFSPDL VSPQLYLTVA TSYNTPGGPY DTVYNRDPEY LALYRDALAE LDEAKRGELI EAMQRIDHER GGYVCWGFSK SLDAYRDDVN GLVPGTKAAF SVNNGAFNRL WLS
|
| |