Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_2811 |
Symbol | |
ID | 8733254 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 2999485 |
End bp | 3001107 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 646503423 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003394605 |
Protein GI | 284044265 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.490323 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGATCG GCAAGTCTGC ACCTGCGGCG GCGATCGTCG GCGCGCTCTG CGCGTTCGCG ATCGCCGGCT GCGGCGGTGA CAGCGACTCG TCGTCGACGA CGGACACGAG CGCGAGCAAG GGCGGAGGCG GAGGCGCCGC CGCGACAGGC GCGCTCAGAA GCGGCGGCAC GCTGACCGCC GCGCTGACGG CCGAGCCCGA CTACCTCGAC CCAGCGAGAG CGCAGTCGGG GCAGAGCTGG CAGGCGCTGG TGCCGATCTG CGAGGGCCTC TACGCGCTCG ACGGCGGCGC GACGAAACCG CAGCTCGCCG TCGGCGAGCC GGTCCTCCGG GACGGCGGCA AGACGGTCAC GATCAGGCTG CGCAGAGGCG TCAGGTTCAA CGACGGCACG CCGTTCGACG CCGAAGCGGT AAAGACGTCG CTGCTGCGCA ACCACAGAAC GTCGGTGCTG TTCCAGGGCA TCCCGCTGGA GCAGGTGCTG ACGCCGTCGG CAGACACGAT CGTGCTGAAG CTGGCGGCGC CGTACGCGCC GCTGATCTCC GACCTCGCCG GCCCCGGCGG CATGATCGCC TCGCCCGCGC AGCTGGAGAA GCTGGGCGAC AGATTCGGCG ACGACCCGGT CTGCGTCGGG CCGTTCGAGT TCGTCAGCCG CAGAGCCGGC GACGCGATCA CGCTCAGACG CGCGCCCGGG TACTACGACG CCGACCGCGT CAAGCTCGAC GAGCTGGTCT TCAAGATCAT GCCGGACGAG GACGCGCGCA GCACGAGCCT GCGCTCCGGC ACGATCGACG TCGCGCTGGA CCCGGCCGAC GCGGAAGGGC TTGCGGGCGA CGACGCGCTG CGCGTCGCGG AGGTTCCCGG CGCGGGCTGG CACGGCGTCT ACTTCAACGT CGGCAACTCC GCCGGCATGG GCAAGCCGCT GGCGCCGCGC GACAGCGCGC TCGGCAGATC GGCCGCCGTG CGCAGAGCGT TCGAGCGCAC GCTCGACCGC GAGGCGCTGC TGTCGCTCGG CCACGACGCC GGCGCGGACG TCTCGTGCAG CATCATCTCG ACGACCAGCT CGCTGCGCTC CGACGTCCCG TGCGAGGCGA ACGCCGATCC CGAGGCGGCG CGACAGCTGC TGGAGGAGGC CGGGGTGGAG ACGCCGGTCA AGGCCCAGCT GAACGTCTCG GCGTCGCCGG ACCTGCTGCG TGAGGCGCAG GCGATCCAGG CGATGGCGAG AGAGGGCGGG TTCGAGGTCG AGATCGACCA GTGCGACGTC GCGAGCTGCA TCAAGCGGCT GATCGGCGGC GACTTCGACC TGGCGCTCGG CGGCTTCTCC GGCTTCCCCG ACCCCGATCC GAGCATCAGC CCGTTCGTCT CGACCAGAGG CGGGTTCAAC TTCGTCGGCA TGTCGGACCC GGAGCTCGAC AGACTGCTGG AGCAGGCCCG CGCCGCGTCC GGCGACGAGG CGGAGCGCAG ACAGCTCTAC GCACGCGCTC TGGAGATCGT CAGCGAGCAG CTGCCGCTGG CGGTCATCGG CAACCCCGGC GTGACCGTCG CGTCGCGCAG CGACGTCGGC GGCTTCGAGG TCTCCGCGAG CGAGATCGTC GACTTCACCG GCGCCGGGTT CACGCAGGGC TGA
|
Protein sequence | MLIGKSAPAA AIVGALCAFA IAGCGGDSDS SSTTDTSASK GGGGGAAATG ALRSGGTLTA ALTAEPDYLD PARAQSGQSW QALVPICEGL YALDGGATKP QLAVGEPVLR DGGKTVTIRL RRGVRFNDGT PFDAEAVKTS LLRNHRTSVL FQGIPLEQVL TPSADTIVLK LAAPYAPLIS DLAGPGGMIA SPAQLEKLGD RFGDDPVCVG PFEFVSRRAG DAITLRRAPG YYDADRVKLD ELVFKIMPDE DARSTSLRSG TIDVALDPAD AEGLAGDDAL RVAEVPGAGW HGVYFNVGNS AGMGKPLAPR DSALGRSAAV RRAFERTLDR EALLSLGHDA GADVSCSIIS TTSSLRSDVP CEANADPEAA RQLLEEAGVE TPVKAQLNVS ASPDLLREAQ AIQAMAREGG FEVEIDQCDV ASCIKRLIGG DFDLALGGFS GFPDPDPSIS PFVSTRGGFN FVGMSDPELD RLLEQARAAS GDEAERRQLY ARALEIVSEQ LPLAVIGNPG VTVASRSDVG GFEVSASEIV DFTGAGFTQG
|
| |