Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_3164 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 3403643 |
End bp | 3405343 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | ACX40790 |
Protein GI | 260450368 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGATTGC TCAACCGTCT TAACCAGTAT CAACGTCTGT GGCAACCTTC CGCCGGAAAG CCGCAAACCG TCACCGTCAG CGAACTGGCC GAACGCTGTT TTTGCAGCGA ACGCCATGTT CGTACGCTGT TGCGTCAGGC ACAGGAGGCG GGATGGCTGG AGTGGCAGGC GCAGTCAGGA CGCGGAAAGC GCGGACAATT ACGCTTTCTG GTCACGCCGG AATCGCTACG CAATGCGATG ATGGAACAGG CACTGGAAAC CGGAAAGCAG CAAGATGTGC TGGAGCTGGC GCAACTGGCC CCAGGTGAGC TGCGCACTCT GTTACAGCCG TTTATGGGCG GACAATGGCA AAACGATACA CCCACGTTGC GTATTCCCTA CTATCGCCCG CTCGAACCGC TACAACCAGG CTTTTTGCCC GGCCGTGCCG AGCAGCATCT CGCCGGGCAG ATATTTTCCG GCCTGACCCG CTTCGATAAT AATACTCAGC GCCCGATTGG CGATTTAGCG CATCACTGGG AAACCTCTAC TGACGGGTTA CGCTGGGACT TTTATCTTCG TTCAACCCTA CACTGGCATA ACGGCGATGC AGTAAAAGCC TCACACTTAC ACCAGCGATT ATTGATGCTG TTACAACTGC CAGCACTGGA TCAATTATTT ATTAGCGTGA AGCGTATTGA AGTCACCCAT CCGCAGTGTC TGACCTTCTT TTTACATCGC CCTGATTACT GGCTTGCGCA CCGGCTGGCG AGCTATTGCA GCCATCTGGC GCATCCGCAA TTCCCACTGA TCGGCACGGG TCCTTTTCGC TTAACACAAT TCACAGCAGA GCTGGTGCGC CTGGAAAGCC ATGATTATTA CCATTTACGT CATCCGCTGC TTAAAGCGGT TGAGTACTGG ATAACTCCGC CGCTTTTCGA AAAAGATTTG GGAACCAGTT GTCGGCATCC CGTGCAAATC ACCATCGGCA AACCGGAGGA GCTGCAACGG GTCAGCCAGG TCAGTAGCGG CATCAGTTTA GGTTTTTGCT ATTTGACGTT GCGCAAAAGT CCCCGACTCT CCCTCTGGCA GGCGCGAAAA GTGATCTCCA TTATTCATCA ATCCGGTTTA TTACAAACGT TAGAAGTCGG AGAAAACCTG ATCACCGCCA GTCATGCATT ACTGTCAGGC TGGACTATTC CGCATTGGCA GGTACCGGAT GAAGTCAAAC TACCGAAAAC CTTGACGCTG GTTTATCACC TACCGATAGA ACTTCATACC ATGGCAGAAC GCCTACAGGC GACACTGGCA GCGGAAGGCT GTGAACTCAC AATTATTTTT CATAACGCAA AAAACTGGGA CGACACGACC CTACAGGCAC ACGCAGACCT CATGATGGGC GACAGATTAA TTGGCGAAGC ACCGGAATAT ACTCTGGAGC AATGGCTGCG CTGCGATCCG CTGTGGCCAC ATGTTTTCGA CGCTCCAGCA TACGCACATC TACAATCGAC ACTGGATGCC GTGCAAATAA TGCCTGATGA AGAAAACCGA TTTAATGCCC TGAAAGCGGT TTTTAGCCAG TTAATGACAG ATGCGACGCT GACGCCGCTG TTCAACTATC ACTATCGCAT TAGTGCCCCT CCCGGCGTGA ACGGTGTGCG ACTGACACCG CGCGGCTGGT TTGAATTTAC CGAAGCCTGG CTTCCCGCGC CATCGCAATG A
|
Protein sequence | MRLLNRLNQY QRLWQPSAGK PQTVTVSELA ERCFCSERHV RTLLRQAQEA GWLEWQAQSG RGKRGQLRFL VTPESLRNAM MEQALETGKQ QDVLELAQLA PGELRTLLQP FMGGQWQNDT PTLRIPYYRP LEPLQPGFLP GRAEQHLAGQ IFSGLTRFDN NTQRPIGDLA HHWETSTDGL RWDFYLRSTL HWHNGDAVKA SHLHQRLLML LQLPALDQLF ISVKRIEVTH PQCLTFFLHR PDYWLAHRLA SYCSHLAHPQ FPLIGTGPFR LTQFTAELVR LESHDYYHLR HPLLKAVEYW ITPPLFEKDL GTSCRHPVQI TIGKPEELQR VSQVSSGISL GFCYLTLRKS PRLSLWQARK VISIIHQSGL LQTLEVGENL ITASHALLSG WTIPHWQVPD EVKLPKTLTL VYHLPIELHT MAERLQATLA AEGCELTIIF HNAKNWDDTT LQAHADLMMG DRLIGEAPEY TLEQWLRCDP LWPHVFDAPA YAHLQSTLDA VQIMPDEENR FNALKAVFSQ LMTDATLTPL FNYHYRISAP PGVNGVRLTP RGWFEFTEAW LPAPSQ
|
| |