Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_4245 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 4608872 |
End bp | 4610749 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | PTS system, beta-glucoside-specific IIABC subunit |
Protein accession | ACX41843 |
Protein GI | 260451421 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 63 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGAGT TAGCCAGAAA AATAGTCGCA GGAGTCGGGG GCGCAGATAA CATTGTGAGT CTGATGCATT GCGCAACGCG ATTACGTTTT AAATTAAAGG ATGAAAGCAA AGCGCAAGCA GAGGTACTGA AAAAGACCCC CGGTATTATT ATGGTGGTGG AAAGCGGTGG CCAGTTTCAG GTGGTCATAG GTAACCATGT GGCCGATGTC TTCCTGGCGG TTAACAGTGT GGCAGGCCTT GACGAAAAAG CGCAACAGGC ACCGGAAAAT GATGATAAAG GTAATCTGCT AAACCGCTTT GTTTATGTTA TTTCAGGTAT TTTTACGCCT CTGATCGGTT TGATGGCGGC AACCGGGATC TTGAAAGGTA TGCTGGCTCT GGCGCTCACT TTTCAGTGGA CGACCGAACA AAGTGGTACT TATTTAATTT TATTCAGCGC CAGTGATGCC TTGTTTTGGT TCTTCCCGAT AATCCTGGGA TACACCGCGG GGAAACGCTT CGGCGGTAAT CCATTTACTG CCATGGTGAT TGGTGGAGCG TTAGTGCATC CATTAATTCT GACTGCTTTC GAGAACGGGC AAAAAGCGGA TGCGCTGGGG CTGGATTTCC TGGGTATTCC GGTCACATTG TTGAATTACT CGTCATCGGT TATTCCCATT ATTTTTTCTG CCTGGTTGTG CAGCATTCTG GAACGCCGAC TTAATGCGTG GTTACCGTCG GCAATCAAAA ATTTCTTCAC ACCATTGCTA TGTCTGATGG TTATCACACC CGTCACCTTT CTGCTGGTGG GGCCGCTATC AACCTGGATA AGCGAACTGA TTGCCGCCGG TTATCTCTGG CTTTATCAGG CGGTTCCTGC ATTTGCGGGC GCGGTAATGG GCGGCTTCTG GCAAATCTTC GTCATGTTCG GACTGCACTG GGGCCTGGTG CCGCTGTGTA TCAATAACTT CACCGTGCTG GGCTACGACA CCATGATCCC GCTGTTAATG CCCGCCATTA TGGCGCAGGT CGGGGCGGCG CTCGGCGTCT TCCTCTGCGA ACGCGATGCG CAGAAAAAAG TGGTGGCGGG ATCAGCGGCG TTGACGAGTC TGTTTGGTAT CACCGAACCA GCGGTATATG GCGTCAACCT GCCGCGTAAG TACCCCTTTG TTATCGCCTG TATCAGTGGG GCTTTGGGGG CCACCATTAT TGGCTACGCG CAAACGAAAG TCTACTCCTT TGGTTTGCCA AGTATTTTCA CCTTCATGCA AACCATCCCG TCAACGGGAA TTGATTTCAC CGTCTGGGCC AGCGTTATTG GCGGTGTCAT TGCCATCGGT TGCGCATTTG TCGGTACGGT GATGCTTCAT TTCATCACCG CTAAACGTCA GCCAGCGCAG GGTGCCCCGC AAGAGAAAAC ACCAGAGGTT ATTACACCAC CTGAGCAGGG CGGTATCTGT TCACCGATGA CGGGAGAGAT TGTGCCGCTC ATTCACGTCG CTGATACCAC GTTTGCCAGT GGCCTGTTGG GTAAAGGTAT TGCCATTCTG CCCTCGGTTG GTGAAGTGCG TTCTCCGGTT GCGGGTCGAA TTGCTTCGTT GTTCGCCACA TTACACGCCA TTGGCATTGA GTCAGATGAT GGTGTGGAGA TCCTGATTCA TGTCGGTATC GACACCGTAA AACTGGACGG CAAATTCTTT TCCGCTCACG TCAACGTGGG TGACAAGGTC AATACAGGCG ATCGGCTGAT TTCTTTTGAT ATCCCTGCTA TTCGCGAGGC CGGATTTGAT CTGACGACGC CGGTATTAAT CAGTAATAGC GATGATTTTA CGGACGTATT ACCCCACGGC ACGGCGCAGA TAAGCGCAGG TGAACCGCTG TTATCCATCA TTCGCTAA
|
Protein sequence | MTELARKIVA GVGGADNIVS LMHCATRLRF KLKDESKAQA EVLKKTPGII MVVESGGQFQ VVIGNHVADV FLAVNSVAGL DEKAQQAPEN DDKGNLLNRF VYVISGIFTP LIGLMAATGI LKGMLALALT FQWTTEQSGT YLILFSASDA LFWFFPIILG YTAGKRFGGN PFTAMVIGGA LVHPLILTAF ENGQKADALG LDFLGIPVTL LNYSSSVIPI IFSAWLCSIL ERRLNAWLPS AIKNFFTPLL CLMVITPVTF LLVGPLSTWI SELIAAGYLW LYQAVPAFAG AVMGGFWQIF VMFGLHWGLV PLCINNFTVL GYDTMIPLLM PAIMAQVGAA LGVFLCERDA QKKVVAGSAA LTSLFGITEP AVYGVNLPRK YPFVIACISG ALGATIIGYA QTKVYSFGLP SIFTFMQTIP STGIDFTVWA SVIGGVIAIG CAFVGTVMLH FITAKRQPAQ GAPQEKTPEV ITPPEQGGIC SPMTGEIVPL IHVADTTFAS GLLGKGIAIL PSVGEVRSPV AGRIASLFAT LHAIGIESDD GVEILIHVGI DTVKLDGKFF SAHVNVGDKV NTGDRLISFD IPAIREAGFD LTTPVLISNS DDFTDVLPHG TAQISAGEPL LSIIR
|
| |