Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_1873 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 2024499 |
End bp | 2025857 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | ACX39531 |
Protein GI | 260449109 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0070319 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACAAT ATGATCAAAT TGGCGCAAGA CTGGACCGCT TGCCTTTGGC CCGGTTTCAT TATCGTATAT TTGGTATTAT AAGCTTTAGT CTGTTATTAA CAGGGTTTTT GAGTTACTCC GGTAATGTCG TCTTAGCAAA GCTGGTAAGC AATGGATGGT CAAATAATTT CCTCAATGCC GCCTTTACCT CGGCATTAAT GTTTGGTTAT TTCATCGGCT CACTTACTGG TGGGTTTATT GGTGACTACT TTGGGCGGCG CAGGGCGTTT CGCATAAATC TTCTCATCGT CGGTATTGCT GCAACAGGGG CCGCTTTTGT CCCTGATATG TACTGGCTCA TCTTCTTTCG CTTCCTGATG GGAACAGGAA TGGGGGCGCT GATTATGGTT GGCTATGCCT CATTTACGGA GTTTATCCCC GCGACGGTGC GTGGAAAATG GTCCGCGCGG CTCTCATTTG TTGGTAACTG GTCGCCCATG CTGTCTGCGG CGATAGGCGT GGTGGTTATC GCTTTTTTTA GTTGGCGAAT AATGTTTCTG CTGGGTGGTA TTGGCATACT GTTAGCCTGG TTTCTCTCAG GTAAATACTT TATCGAGTCG CCACGATGGC TGGCAGGGAA AGGGCAAATC GCAGGTGCAG AATGCCAACT TCGTGAAGTA GAGCAGCAAA TTGAAAGAGA GAAGAGTATT CGTTTACCCC CGCTTACTTC GTATCAGAGC AACAGCAAGG TTAAAGTAAT CAAGGGTACT TTCTGGCTCC TGTTTAAAGG TGAAATGTTA CGACGTACAT TAGTCGCGAT TACTGTTTTA ATTGCAATGA ACATTTCGCT TTATACCATC ACCGTATGGA TACCGACCAT ATTTGTTAAC TCCGGCATTG ATGTCGATAA ATCAATATTA ATGACCGCTG TTATTATGAT TGGCGCTCCG GTAGGAATAT TTATTGCGGC ATTAATTATT GATCATTTTC CTCGTCGGTT ATTTGGCTCC ACCTTACTTA TTATTATTGC CGTGTTAGGC TATATCTATT CAATTCAGAC TACAGAGTGG GCGATTTTAA TCTATGGACT GGTGATGATC TTCTTTTTAT ACATGTATGT TTGCTTCGCG TCGGCGGTTT ATATCCCGGA GCTTTGGCCA ACGCATTTAC GCCTGCGCGG TTCGGGTTTC GTTAATGCCG TCGGACGGAT CGTCGCAGTC TTCACGCCCT ATGGCGTTGC GGCATTATTA ACACATTATG GGTCGATCAC GGTGTTTATG GTACTTGGTG TTATGTTATT GCTCTGTGCG CTGGTTCTCT CCATTTTTGG CATCGAAACG CGGAAGGTGT CGTTGGAAGA GATTTCTGAG GTGAATTAA
|
Protein sequence | MEQYDQIGAR LDRLPLARFH YRIFGIISFS LLLTGFLSYS GNVVLAKLVS NGWSNNFLNA AFTSALMFGY FIGSLTGGFI GDYFGRRRAF RINLLIVGIA ATGAAFVPDM YWLIFFRFLM GTGMGALIMV GYASFTEFIP ATVRGKWSAR LSFVGNWSPM LSAAIGVVVI AFFSWRIMFL LGGIGILLAW FLSGKYFIES PRWLAGKGQI AGAECQLREV EQQIEREKSI RLPPLTSYQS NSKVKVIKGT FWLLFKGEML RRTLVAITVL IAMNISLYTI TVWIPTIFVN SGIDVDKSIL MTAVIMIGAP VGIFIAALII DHFPRRLFGS TLLIIIAVLG YIYSIQTTEW AILIYGLVMI FFLYMYVCFA SAVYIPELWP THLRLRGSGF VNAVGRIVAV FTPYGVAALL THYGSITVFM VLGVMLLLCA LVLSIFGIET RKVSLEEISE VN
|
| |