Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_2118 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | - |
Start bp | 2260260 |
End bp | 2261450 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | ACX39773 |
Protein GI | 260449351 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.00060856 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAACAA ACACTGTTTC CCGCAAAGTG GCGTGGCTAC GGGTCGTTAC GCTGGCAGTC GCCGCCTTCA TCTTCAACAC CACCGAATTT GTCCCTGTTG GCCTGCTCTC TGACATTGCG CAAAGTTTTC ACATGCAAAC CGCTCAGGTC GGCATCATGT TGACCATTTA CGCATGGGTA GTAGCGCTAA TGTCATTGCC TTTTATGTTA ATGACCAGTC AGGTTGAACG GCGCAAATTA CTGATCTGCC TGTTTGTGGT GTTTATTGCC AGCCACGTAC TGTCGTTTTT GTCGTGGAGC TTTACCGTTC TGGTGATCAG TCGCATTGGT GTGGCTTTTG CACATGCGAT TTTCTGGTCG ATTACGGCGT CTCTGGCGAT CCGTATGGCT CCGGCCGGGA AGCGAGCACA GGCATTGAGT TTAATTGCCA CCGGTACAGC ACTGGCGATG GTCTTAGGTT TACCTCTCGG GCGCATTGTG GGCCAGTATT TCGGTTGGCG AATGACCTTC TTCGCGATTG GTATTGGGGC GCTTATCACC CTTTTGTGCC TGATTAAGTT ACTTCCCTTA CTGCCCAGTG AGCATTCCGG TTCACTGAAA AGCCTCCCGC TATTGTTCCG CCGCCCGGCA TTGATGAGCA TTTATTTGTT AACTGTGGTG GTTGTCACCG CCCATTACAC GGCATACAGC TATATCGAGC CTTTTGTACA AAACATTGCG GGATTCAGCG CCAACTTTGC CACGGCATTA CTGTTATTAC TCGGTGGTGC GGGCATTATT GGCAGCGTGA TTTTCGGTAA ACTGGGTAAT CAGTATGCGT CTGCGTTGGT GAGTACGGCG ATTGCGCTGT TGCTGGTGTG CCTGGCATTG CTGTTACCTG CGGCGAACAG TGAAATACAC CTCGGGGTGC TGAGTATTTT CTGGGGGATC GCGATGATGA TCATCGGGCT TGGTATGCAG GTTAAAGTGC TGGCGCTGGC ACCAGATGCT ACCGACGTCG CGATGGCGCT ATTCTCCGGC ATATTTAATA TTGGAATCGG GGCGGGTGCG TTGGTAGGTA ATCAGGTGAG TTTGCACTGG TCAATGTCGA TGATTGGTTA TGTGGGCGCG GTGCCTGCTT TTGCCGCGTT AATTTGGTCA ATCATTATAT TTCGCCGCTG GCCAGTGACA CTCGAAGAAC AGACGCAATA G
|
Protein sequence | MTTNTVSRKV AWLRVVTLAV AAFIFNTTEF VPVGLLSDIA QSFHMQTAQV GIMLTIYAWV VALMSLPFML MTSQVERRKL LICLFVVFIA SHVLSFLSWS FTVLVISRIG VAFAHAIFWS ITASLAIRMA PAGKRAQALS LIATGTALAM VLGLPLGRIV GQYFGWRMTF FAIGIGALIT LLCLIKLLPL LPSEHSGSLK SLPLLFRRPA LMSIYLLTVV VVTAHYTAYS YIEPFVQNIA GFSANFATAL LLLLGGAGII GSVIFGKLGN QYASALVSTA IALLLVCLAL LLPAANSEIH LGVLSIFWGI AMMIIGLGMQ VKVLALAPDA TDVAMALFSG IFNIGIGAGA LVGNQVSLHW SMSMIGYVGA VPAFAALIWS IIIFRRWPVT LEEQTQ
|
| |