Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_2945 |
Symbol | |
ID | 8733390 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 3148415 |
End bp | 3150112 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646503559 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003394739 |
Protein GI | 284044399 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.472455 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0524757 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTTCTG ACGCATTGAG AAGGTCGCGG CGCGTGCTGG CGCCGCTCGC CGCGGTGCTC GCGGCGCTGC TCGTCCTGGC CGGCTGCGGC GGCGGCGGAT CGGACCTGCC GGACGGCGTC ACGCAGAGCG GTGACGCGAG CACCGCGGCC GCCGCGGACG GCGGTGCGGG CGGCGGATCC GCCGGCAGAC TCGCGTTCAC CCGCCTGGGT ATCAACACGC CCGGCTTCGG CCCGTGGAAC CAGAGCACGG GCAACGACGC GATCGTCAAC TCGCTGCTGT TCTCGAACCT CGTGAAGGTC AGATCGGACG AGAGAACGCT CGCGCCCGAC CTCGCCGAGA GCTGGGAGGC CTCCAGCGAT CAGCGCACCT TCACGTTCAG ACTGCGCGAC GACGTCAGCT GGAGTGACGG CACGCCGTTC ACCGCCAGAG ACGTCGTCTT CACCGCGACG CAGGCGGCGC AGTTCGGGCC GGAAGCGTAC GTCGGCTACC AGCCGACGCA GTGGCGCGAC ATCGAGGGCG GCGCCGAGAT CGAGGGCACC AGCAGACCGC TGCGCGGCAT CAGAGCGCTC GACGAGCACA CCGTCGAGAT CAGACTCGCG AAGCCGAACG CCGAGTACGT CCGCAACCTC ACCGACGCGG TCTACTCGAT CATGCCCGAG CACCTGCTCG CCGACGCGAC CGCGGCGGAC GTCAGAAGAA CCGCCTTCGC GACGAGCAGA CCGGTCGGAA CGGGCCCGTA CACGCTGACG CGGATCGCAC CGAACCAGTA CTACGAGTTC GCCGCCAACG ACGGCTACTT CGGCGGCGCG CCGAAGATCG GGACGCTCTT CTTCAAGCTC GACGTCAAGC CCGAGTCTGC CGTCGCGCAG CTCGAGTCCG GCGAGCTGCA GCTCGTGATC AACGCGTCGC CGAACGACGA GTCGCGGCTG ACGCGCGTCG ACGGGCTCAG AAACGAGTAC GTCGTCTCGC CGGCGGTGCA GATGCTGCAG TTCCGTACCG ACCACCCGCA GGCGAGAGAC GCGCGCGTGC GGCAGGCGAT CTACTCCGCG ATCGACCGCC GCGCGATGCT CAGAAGCCTC TTCGGCGACC ACGGCGAGAT CCGCTGGGTG CTGCCCGGCT TCGACCAGGA GGACCCCGCG CTCGATCGTT ACGAGCACGA CCCGCAGAAG GCGAGAGCGC TGCTCGCGGA GGCCGGCTTC GACGGCGACG CGCCGTTCAA GATCGCCTAC GCGACCGACG TCGACCCGCT CTGGAGACAG ATGACGCCGG TGATCCAGAA GAACCTGCAG GACGTCGGCA TCAACGCCGT GCTGGAACCG CTCGACGCGG CCAAGTGGTC GGCCGCGAAC GTCGACAGAA ACCCGCAGAC CCCGGTCACG CTCAACTCGG GTGGCGCGAT GGGGCTCTCG CCCGACCGCA GCTCGGTCTA CTACAACTGC AGAGCGCCGC TCTCGTCGTT CTACGCCAAC TGCGACCTCG ACGCGCTCTA CGTGCAGGCG CGCGGCGAGG CCGATCCGGA GAGACGCGCG CAGCTGTACG CGAGAGCGGC GCAGATCCTC AACAGAGACG TGCCGCAGGC CGCGCTGTGG CAGACCGCGA ACTTCCACGC CTACAGCGAC AAGCTCGGCG GGACGTTCGC GATCTTCCCG AACGACCGCG ACAGCGCGTT CGAGATCGCC GGCTGGACGC TCGGCTAG
|
Protein sequence | MGSDALRRSR RVLAPLAAVL AALLVLAGCG GGGSDLPDGV TQSGDASTAA AADGGAGGGS AGRLAFTRLG INTPGFGPWN QSTGNDAIVN SLLFSNLVKV RSDERTLAPD LAESWEASSD QRTFTFRLRD DVSWSDGTPF TARDVVFTAT QAAQFGPEAY VGYQPTQWRD IEGGAEIEGT SRPLRGIRAL DEHTVEIRLA KPNAEYVRNL TDAVYSIMPE HLLADATAAD VRRTAFATSR PVGTGPYTLT RIAPNQYYEF AANDGYFGGA PKIGTLFFKL DVKPESAVAQ LESGELQLVI NASPNDESRL TRVDGLRNEY VVSPAVQMLQ FRTDHPQARD ARVRQAIYSA IDRRAMLRSL FGDHGEIRWV LPGFDQEDPA LDRYEHDPQK ARALLAEAGF DGDAPFKIAY ATDVDPLWRQ MTPVIQKNLQ DVGINAVLEP LDAAKWSAAN VDRNPQTPVT LNSGGAMGLS PDRSSVYYNC RAPLSSFYAN CDLDALYVQA RGEADPERRA QLYARAAQIL NRDVPQAALW QTANFHAYSD KLGGTFAIFP NDRDSAFEIA GWTLG
|
| |