Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_4369 |
Symbol | |
ID | 8734831 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 4652484 |
End bp | 4654094 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646504995 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003396158 |
Protein GI | 284045818 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAGCA CGACCCGTGC CGTGGTTCTC GCAGGAGCGC TCGGCGTCTT GGCGCTCAGC GGCTGCGGAA GCGGCGGGAA CGACTCGCCG ACCGGCACCC AGCCCGCCGA CGGCTCGGTG CCCGCCACCA AGCCGGTGAG AGACGGTGGA ACGCTCCGCG TGGGGCTGAC CGCCGAGCCT GACTACATCG ACCCGGCCCG GATGCAGTCG CTCGACTCGT GGCAGGTGCT CACGGCGATG TGCGAGGGCC TCTACAAGAT CGGCGCGAGA GGGCAAGCGG TCCCGCAGCT CGCCGTCGGC GCACCGCGGG TGTCCAAGGA CGGCCTGACC GCGACGATCA AGCTGCGCGA CGGCGTGCAG TTCAACGACG GCACGCCGTT CGACGCGAGA GCGGTCAAGC TGTCGCTCGA GCGCAACGGC AGAACGTCGG TCCTGTTCCA GGGCAACGGC ATCGAGCGGA TCGACGCGCC CGCCGACGAC ACCGTCGTCC TGCACCTGGC CAGACCCTAT GCGCCGCTGG AAGGCGACCT CGCCGGCCCC GGCGGGATGA TCGGCTCGCC GAAGCAGATC GCTGCGCTCG GCGACAAGTT CGGCGATCGC CCGGTGTGCG TCGGCCCGTT CGAGTGGGTC AGCCGGCGCG GCGGCGACTC GATCAGGCTC AGACGGTCCG ACGTCTACTA CGACAAGGAG AACGTCCACC TCGACGGGCT CGACTTCAAG GTGATCCCGG ACACGAACGC CCGTGGCGTC AGCCTGCGCG CCGGTGAGAT CGACATCGCG GCCGAGCCGC CGGAGCCCGG GGCGCTCAAG TCCGACTCCA ACCTCGACGT CACGACGATC ACCGGCGCGG GCTGGAAGGG CTTGTACGTG AACGTCGGAA ACGTCGACGG CGCGGGCAAG CCGCCCAAGC CGCGCGACAC GCCGTTGTCG ACGTCGGCCG AGGCGCGCCA GGCGCTGTCG CTCGCGATCG ACCGCCAGGC GCTCATCAAC CTCACCAGCG GTGACGGGTC AGCGCCGGCG TGCAGCGCGA TCCCGCCCAG CAGCCCGTTC TACGACGACC CGCCGTGCCC GCAGAAGGCG GATCCCGACG CGGCGAGAGC GCTGCTCGAG AGAGCAGGCG TCAGAACGCC CGTCAAGGGC ACGATGGTCG TCGCGGGCAG CCCTGAGGAG ACACGCACCG CGCAGGCCGT GCAGGGCATG GCGCGCGACG CCGGCTTCGA CTTCGAGATC GAGACCTGCG ACGTCGCCAC CTGCATCAGA CGACTGCTCG CGGGCGACTT CGACGTCACG CTGGGCGGCT TCGACGGTGT CGTCGATCCA GACCAGAGCC TCAGTCCGTT CGTCGCGAGC ACCGGCGGCT TCAACTTCGT CGGCGAGTCC GACGCGGAGC TCGACCGGCT GCTGGCAAGC GCGCGAGCCG AGTCGACCGA CGTCGATGCG CGGCGCAAGC TGTACAGACA GGCGCTCGAC CGCATCCGCG AGCGCGCCGC GCTGATCGTC TTCTACAACA CGGGCAGCTC GGCGGCGGCA CGCAAGAACG TCAGCGGATA CGTGCTGACG CCCTCGGTCC TGCTGGACTA CAAGCAAGCC GGCTTCACCA CCGGTCCATG A
|
Protein sequence | MRSTTRAVVL AGALGVLALS GCGSGGNDSP TGTQPADGSV PATKPVRDGG TLRVGLTAEP DYIDPARMQS LDSWQVLTAM CEGLYKIGAR GQAVPQLAVG APRVSKDGLT ATIKLRDGVQ FNDGTPFDAR AVKLSLERNG RTSVLFQGNG IERIDAPADD TVVLHLARPY APLEGDLAGP GGMIGSPKQI AALGDKFGDR PVCVGPFEWV SRRGGDSIRL RRSDVYYDKE NVHLDGLDFK VIPDTNARGV SLRAGEIDIA AEPPEPGALK SDSNLDVTTI TGAGWKGLYV NVGNVDGAGK PPKPRDTPLS TSAEARQALS LAIDRQALIN LTSGDGSAPA CSAIPPSSPF YDDPPCPQKA DPDAARALLE RAGVRTPVKG TMVVAGSPEE TRTAQAVQGM ARDAGFDFEI ETCDVATCIR RLLAGDFDVT LGGFDGVVDP DQSLSPFVAS TGGFNFVGES DAELDRLLAS ARAESTDVDA RRKLYRQALD RIRERAALIV FYNTGSSAAA RKNVSGYVLT PSVLLDYKQA GFTTGP
|
| |