Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2389 |
Symbol | |
ID | 6142736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2436995 |
End bp | 2438185 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641617262 |
Product | major facilitator transporter |
Protein accession | YP_001744434 |
Protein GI | 170683695 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGAGA AGTTATGGAC GAAGGATTTT TGGGCAATAA CCATCATCAG TTTTATTATT TTCTTCGTCT TTTATGTTTT ACTAACATTG TTGCCAATTT ATATCTCAGA CCGCTTGCAT GCCTCTCCTG ATAAAGCAGG TTTGTTGGTG ACTTTATTTT TAATTGCCGC CATTGTTATT CGCCCCTTTG CCGGGCAATG GGTGGGTAAA TATTCGAATA AAACTATTCT GGGGCTCTCT TCTCTGGCCT TCTTGGTGGT CACTGCGCTG TATCCTTTTT GCCACTCAAT TGAATCACTG CTTTTTATTC GGGTGCTTCA TGGTATTACC TTCGGGGTTA TCACTACGGT AAAAGGAACG ATTTCCGCGC GGCTGATCCC GGCCTCCCGA CGTGGGGAGG GCATCAGTTT TTTCTCTCTG GCAATGGGGC TGGCAATGGT GGTCGGGCCG TGGATTGGCC TGAATATGGC GCGCTGGGAG GCCTTTAATA TGGCTTTCTG GTTATGCACC GGCGTGGCAG CGGTGGGGAT TATTCTGTCG CTGATTATGA CAGTGCCGCC GGTTATCAGC CATGCTGATG GTTCAACGCC GAAAATGGGC TTCGCCGCCA TGTTCGATCG CGCGGCGTTG CCATTTGCGA TGGTCACCTT TTTTATGACC TTTTCGTATG CCGGGGTTTC TGCTTTTCTG GCGCTTTATG CACGCGAACT TAACCTGATG TCGGCAGCCA GTAATTTCCT GCTTTGCTAC GCTATCTTCC TGATGATCTG CCGTACCTTT ACCGGCAATG TCTGCGACAA AAAAGGACCG AAATATGTGG TTTACCCTTG CCTTGTGTTC TTTACGGTTG GACTGGTGGT TCTCGGCTAC ACCCAGGGAA GCATAATGAT GGTTGTTTCT GGCGCGTTGA TTGGTATCGG GTATGGTTCC GTGACGCCAG TTTTTCAGAC GCAGATTATC AGTTCAGTGG AACCGCATAA AATCGGTGTC GCAAACTCCC TCTTCTTCAA TGCGATGGAT GCAGGCCTGG CGCTGGGAGC CTGTGTGATG GGGATGATGG TTGCACATAC TGGCTACCGA ATGATTTATC TGCTGGGTGC ACTATTAGTG GTAGTGGCTG GTGGAGTCTA TGAGCTGCAA ATGAAGGGCA AAAGCGGTGT CATGCTAGTA GCGGCAAAAG AAATTCATTA A
|
Protein sequence | MKEKLWTKDF WAITIISFII FFVFYVLLTL LPIYISDRLH ASPDKAGLLV TLFLIAAIVI RPFAGQWVGK YSNKTILGLS SLAFLVVTAL YPFCHSIESL LFIRVLHGIT FGVITTVKGT ISARLIPASR RGEGISFFSL AMGLAMVVGP WIGLNMARWE AFNMAFWLCT GVAAVGIILS LIMTVPPVIS HADGSTPKMG FAAMFDRAAL PFAMVTFFMT FSYAGVSAFL ALYARELNLM SAASNFLLCY AIFLMICRTF TGNVCDKKGP KYVVYPCLVF FTVGLVVLGY TQGSIMMVVS GALIGIGYGS VTPVFQTQII SSVEPHKIGV ANSLFFNAMD AGLALGACVM GMMVAHTGYR MIYLLGALLV VVAGGVYELQ MKGKSGVMLV AAKEIH
|
| |