Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_1930 |
Symbol | |
ID | 8732371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 2030379 |
End bp | 2031878 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646502547 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003393731 |
Protein GI | 284043391 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.15113 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCTGC TCGTCACGGT GGCGGGCTGC GGAGACGGCT CGTCGTCCTC GACGTCGTCG TCGCGTTCCG GGGAGAAGGT GCTGCGCTAC GGGGTGACGT CGAAGCTCGA CACGTACAAC CCGGCGAGAG ACTCCGTCAC CGGCGCGACC AACATCCGCT ACCTGACGAC GGAGACGATC CTCGAGAAGG ATCCCGAGAC CGGCCAGTAC GGGCCCGGCC TCGCCACGGA GTTCGGCTTC GCCGGCAGAG GCAACACCGC GTACGAGTTC ACGCTGCGCG AGGACGCGAG CTTCTCCGAC GGCACGCCGC TGGACGCCGC GGCAGTGAAG AAGTGGCTGG AGTACTTCTC CAGAGCCGGC GGGCCGTGGG TCGGTCTCGT CGCGCTCAGA TCGATCGAGA CGCCCGGCAG ATACACGGTA CGGCTGAACT TCAGAGTGCC GAGCCCCAAC ATCGAGTACT TCCTCGCCGG CGGCAACAAC TGGGGCTTCG TCTCCAGCCC CAGAGGGGTC GACGACCCCA AGCTGCTGGC GCAGGACATG ATGGGCGTCG GCCCGTACGT GATGGACCCC GGCGAGAGCG TCGCGGGCGA CCACTACACG TTCACGCCCA ACGACCGCTA CTACGACCAG TCGAAGGTCA AGTGGGACAA GATCGTCGTG AAGGTGATCA CCGATCCCTC GACGATGGTC AAGGCGCTGC AGGCGGGCGA GCTCGACGCC TCCCAGGGCG ACTTCTCGAC CGTCGGCACG GCCGAGAGAG TCCCCGGACT GAAGGTCCAC TGGGGACAGG GCGGCTGGGA TCCGATCCTG CTGCTGACCA AGCACTCCGA GCCGCTGCGC GACGTGCGCG TCCGTCAGGC CCTCAACTAC GCGATCGATC GCGAGGCGAT CACCCAGGCG ATGCTGGGCA AGTACGGCGA GCCGACCTCC GAGTGGGTCA CGACCGACGG CTTCGACCCC GAGTACCAGG ACTACTACGA GTACGACCCG GAGAAGGCGA AGCGGCTGCT CGAGGCGGCC GGATACGGGG ACGGGCTGAC GCTGAAGGTG GTCGACCAGG GCTACTACGG CAACCTCGGC GACCGAATGG TCCAGATCGT CGCCGACTAC ATGAGCAGAG TCGGCGTCAC GTTCCAGGTC ACCAAGGCGA CGTCCGCCTC CGAGCATCTG GAGAAGGGCT TGTCGGGCGC GTTCGACGCC TGGCAGTTCG CGGTCGGCAG CGTGCCGACG GCCACCTTCC TGGACTTCTT CGGCGAAGGG TTCGGCGTCA TCCCCGACCC CGAGCTGGAC GAGATCGCGG CGCGCGCGTC CGTCGCGCCC AAGGAGGAGA TGCCGGAGAT CTGGAAGGAG TTCTCGCGGC GCACCGTCGA GCAGGCGTCG ATGCTGAACA TCTTCACGAC GCCGGTCCTG CTCTACGCCC GCGACGACAT CGAGGGCGTC GTCGCGACCC CCGCGTTCGG CGTGTCGCCG GCGCTGCTGC AGTGGGAGCC CGCGAGGTGA
|
Protein sequence | MALLVTVAGC GDGSSSSTSS SRSGEKVLRY GVTSKLDTYN PARDSVTGAT NIRYLTTETI LEKDPETGQY GPGLATEFGF AGRGNTAYEF TLREDASFSD GTPLDAAAVK KWLEYFSRAG GPWVGLVALR SIETPGRYTV RLNFRVPSPN IEYFLAGGNN WGFVSSPRGV DDPKLLAQDM MGVGPYVMDP GESVAGDHYT FTPNDRYYDQ SKVKWDKIVV KVITDPSTMV KALQAGELDA SQGDFSTVGT AERVPGLKVH WGQGGWDPIL LLTKHSEPLR DVRVRQALNY AIDREAITQA MLGKYGEPTS EWVTTDGFDP EYQDYYEYDP EKAKRLLEAA GYGDGLTLKV VDQGYYGNLG DRMVQIVADY MSRVGVTFQV TKATSASEHL EKGLSGAFDA WQFAVGSVPT ATFLDFFGEG FGVIPDPELD EIAARASVAP KEEMPEIWKE FSRRTVEQAS MLNIFTTPVL LYARDDIEGV VATPAFGVSP ALLQWEPAR
|
| |