Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_2524 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 2696910 |
End bp | 2697956 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | ACX40160 |
Protein GI | 260449738 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.00059946 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT GGTCACGCCA CCTGCTCGCG GCGGGTGCTC TGGCACTGGG CATGAGCGCC GCTCACGCCG ATGACAACAA CACGCTGTAT TTCTACAACT GGACCGAGTA CGTGCCGCCA GGACTGCTTG AACAGTTCAC CAAAGAAACC GGTATTAAGG TTATCTATTC GACTTACGAG TCGAACGAAA CCATGTACGC CAAGCTGAAA ACATACAAAG ACGGTGCCTA TGATCTGGTG GTTCCTTCAA CCTATTACGT TGATAAAATG CGTAAAGAAG GGATGATCCA GAAGATCGAC AAGTCGAAGT TAACAAACTT CAGCAATCTC GATCCAGACA TGCTCAACAA GCCTTTTGAC CCGAATAACG ACTACTCCAT TCCGTATATC TGGGGTGCGA CGGCGATTGG TGTTAACGGT GATGCGGTGG ATCCGAAATC TGTCACCAGC TGGGCCGATC TGTGGAAGCC AGAGTACAAA GGCAGCCTGC TGTTGACCGA CGATGCCCGT GAAGTGTTCC AGATGGCGCT GCGTAAGCTG GGCTACTCCG GTAACACCAC CGATCCGAAA GAGATTGAAG CTGCATATAA CGAGCTGAAA AAACTGATGC CAAACGTGGC AGCCTTTAAC TCCGATAACC CGGCGAACCC GTACATGGAA GGCGAAGTTA ACCTCGGCAT GATCTGGAAC GGTTCTGCTT TCGTTGCACG CCAGGCGGGT ACGCCAATTG ACGTGGTGTG GCCGAAAGAA GGCGGCATTT TCTGGATGGA CAGCCTGGCG ATCCCGGCAA ATGCCAAAAA CAAAGAAGGC GCGCTGAAAT TGATCAACTT CCTGCTGCGC CCGGATGTGG CAAAACAGGT TGCTGAAACT ATCGGTTATC CAACGCCAAA CCTTGCGGCG CGTAAGCTGT TAAGTCCAGA AGTGGCGAAC GATAAAACAC TCTACCCGGA TGCTGAAACC ATTAAAAATG GCGAATGGCA GAATGACGTT GGCGCAGCCA GCAGCATTTA TGAAGAGTAT TATCAGAAGC TGAAAGCAGG ACGTTAA
|
Protein sequence | MKKWSRHLLA AGALALGMSA AHADDNNTLY FYNWTEYVPP GLLEQFTKET GIKVIYSTYE SNETMYAKLK TYKDGAYDLV VPSTYYVDKM RKEGMIQKID KSKLTNFSNL DPDMLNKPFD PNNDYSIPYI WGATAIGVNG DAVDPKSVTS WADLWKPEYK GSLLLTDDAR EVFQMALRKL GYSGNTTDPK EIEAAYNELK KLMPNVAAFN SDNPANPYME GEVNLGMIWN GSAFVARQAG TPIDVVWPKE GGIFWMDSLA IPANAKNKEG ALKLINFLLR PDVAKQVAET IGYPTPNLAA RKLLSPEVAN DKTLYPDAET IKNGEWQNDV GAASSIYEEY YQKLKAGR
|
| |