Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_0975 |
Symbol | |
ID | 5586123 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 995396 |
End bp | 996544 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640924684 |
Product | putative MFS family transporter protein |
Protein accession | YP_001462098 |
Protein GI | 157158970 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCACGT ATACCCGGCC TGTCATGCTT TTGCTGTCTG GCCTGCTTTT GTTGACTCTG GCGATCGCGG TGTTAAATAC ACTCGTGCCG CTTTGGCTCG CCCAGGAACA CATGTCCACA TGGCAGGTAG GCGTTGTCAG CTCATCCTAT TTTACCGGCA ACCTTGTCGG TACATTGCTG ACAGGGTATG TCATTAAACG CATTGGCTTT AACCGCAGCT ATTATCTGGC CTCCTTCATT TTTGCCGCTG GCTGTGCCGG CCTTGGCCTG ATGATTGGTT TCTGGAGCTG GTTGGCGTGG CGTTTTGTCG CTGGCGTCGG CTGTGCCATG ATTTGGGTGG TTGTTGAGAG CGCGCTGATG TGCAGTGGGA CGTCACGTAA CCGTGGGCGT TTGCTTGCTG CATACATGAT GGTTTATTAC GTGGGAACGT TTTTAGGCCA GTTACTGGTC AGCAAGGTCT CAACCGAGCT GATGAGCGTA TTGCCGTGGG TTACAGGTTT GACATTGGCA GGGATCTTAC CGCTGTTGTT TACGCGTGTG CTGAATCAGC AGGCTGAAAA CCATGATTCG ACGTCAATTA CGGCAATGCT AAAACTCCGT CAGGCGCGGC TTGGCGTGAA TGGCTGCATT ATCTCAGGAA TCGTTCTGGG ATCTCTATAT GGCCTGATGC CGTTGTACCT CAATCACAAA GGGGTGAGCA ATGCCAGTAT TGGTTTCTGG ATGGCGGTAC TGGTCAGTGC GGGTATCCTC GGACAATGGC CGATTGGACG TCTGGCGGAT AAGTTTGGTC GACTGCTGGT GTTGCGTGTT CAGGTCTTTG TCGTCATTCT CGGCAGTATC GCGATGCTTA GCCAGGCGGC GATGGCCCCT GCGTTATTCA TCCTCGGTGC CGCTGGCTTT ACGCTATATC CGGTGGCGAT GGCCTGGGCT TGTGAGAAAG TTGAACATCA TCAACTGGTG GCGATGAACC AGGCCTTACT GTTGAGCTAT ACCGTGGGAA GTCTGCTTGG CCCGTCATTT ACCGCTATGC TAATGCAGAA TTTCTCCGAT AATTTATTGT TTATCATGAT CGCCAGCGTA TCGTTTATCT ATTTGCTGAT GCTGCTGCGC AACGCCGGTC ATACGCCGAA ACCCGTTGCT CACGTGTAA
|
Protein sequence | MSTYTRPVML LLSGLLLLTL AIAVLNTLVP LWLAQEHMST WQVGVVSSSY FTGNLVGTLL TGYVIKRIGF NRSYYLASFI FAAGCAGLGL MIGFWSWLAW RFVAGVGCAM IWVVVESALM CSGTSRNRGR LLAAYMMVYY VGTFLGQLLV SKVSTELMSV LPWVTGLTLA GILPLLFTRV LNQQAENHDS TSITAMLKLR QARLGVNGCI ISGIVLGSLY GLMPLYLNHK GVSNASIGFW MAVLVSAGIL GQWPIGRLAD KFGRLLVLRV QVFVVILGSI AMLSQAAMAP ALFILGAAGF TLYPVAMAWA CEKVEHHQLV AMNQALLLSY TVGSLLGPSF TAMLMQNFSD NLLFIMIASV SFIYLLMLLR NAGHTPKPVA HV
|
| |