Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_0262 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 291438 |
End bp | 292754 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | ACX37953 |
Protein GI | 260447531 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 58 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCGT TACATTATAC AGCTTCAGCA CTGGCGCTCG GACTGGCGTT AATGGGGAAT GCACAGGCAG TGACGACCAT TCCGTTCTGG CATTCTATGG AAGGGGAACT GGGTAAAGAG GTGGATTCTC TGGCCCAACG TTTTAACGCC GAAAACCCGG ATTACAAAAT TGTACCGACC TATAAAGGCA ACTACGAACA GAATTTAAGC GCGGGGATTG CCGCATTTCG TACCGGCAAC GCGCCGGCTA TTTTGCAGGT TTATGAAGTT GGCACCGCCA CCATGATGGC GTCGAAAGCC ATTAAACCGG TGTATGACGT GTTTAAAGAG GCAGGGATTC AGTTCGATGA GTCGCAGTTT GTGCCGACGG TTTCAGGTTA CTACTCCGAC AGCAAAACGG GCCACTTACT CTCCCAGCCA TTCAACAGCT CGACCCCCGT TCTCTATTAC AACAAAGACG CCTTCAAGAA AGCAGGATTA GACCCGGAAC AGCCGCCGAA AACCTGGCAG GATCTGGCGG ACTATGCCGC GAAACTGAAA GCCTCCGGCA TGAAGTGCGG CTACGCCAGC GGCTGGCAGG GCTGGATCCA ACTGGAAAAC TTTAGCGCCT GGAACGGTCT GCCGTTTGCC AGCAAAAACA ACGGCTTTGA CGGCACGGAC GCGGTGCTGG AGTTCAATAA GCCGGAGCAG GTGAAACACA TCGCCATGCT CGAGGAGATG AACAAGAAGG GCGACTTCAG CTACGTCGGT CGTAAGGATG AATCCACCGA GAAGTTCTAT AACGGTGATT GCGCGATGAC CACCGCCTCT TCCGGTTCTC TTGCCAACAT TCGCGAGTAC GCCAAATTTA ACTACGGCGT AGGCATGATG CCTTACGACG CCGATGCGAA AGATGCGCCA CAAAACGCCA TTATCGGCGG AGCCAGCCTG TGGGTGATGC AGGGTAAAGA TAAAGAAACG TATACCGGTG TGGCGAAGTT CCTCGATTTC CTCGCGAAGC CAGAAAACGC TGCCGAGTGG CATCAGAAAA CCGGTTATCT GCCAATCACC AAAGCAGCGT ATGACCTGAC CCGTGAGCAG GGCTTTTATG AGAAAAACCC AGGGGCGGAT ACCGCGACGC GTCAGATGCT GAATAAGCCG CCGTTGCCGT TCACCAAAGG GCTGCGTCTG GGCAACATGC CGCAGATCCG CGTGATTGTG GATGAAGAGC TGGAGAGCGT GTGGACCGGT AAGAAGACAC CACAGCAGGC ACTGGATACC GCCGTTGAGC GTGGAAATCA GTTGCTGCGC CGCTTTGAGA AATCGACGAA GTCTTAA
|
Protein sequence | MKPLHYTASA LALGLALMGN AQAVTTIPFW HSMEGELGKE VDSLAQRFNA ENPDYKIVPT YKGNYEQNLS AGIAAFRTGN APAILQVYEV GTATMMASKA IKPVYDVFKE AGIQFDESQF VPTVSGYYSD SKTGHLLSQP FNSSTPVLYY NKDAFKKAGL DPEQPPKTWQ DLADYAAKLK ASGMKCGYAS GWQGWIQLEN FSAWNGLPFA SKNNGFDGTD AVLEFNKPEQ VKHIAMLEEM NKKGDFSYVG RKDESTEKFY NGDCAMTTAS SGSLANIREY AKFNYGVGMM PYDADAKDAP QNAIIGGASL WVMQGKDKET YTGVAKFLDF LAKPENAAEW HQKTGYLPIT KAAYDLTREQ GFYEKNPGAD TATRQMLNKP PLPFTKGLRL GNMPQIRVIV DEELESVWTG KKTPQQALDT AVERGNQLLR RFEKSTKS
|
| |