Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2803 |
Symbol | |
ID | 6146823 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2883803 |
End bp | 2884987 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641617672 |
Product | major facilitator family transporter |
Protein accession | YP_001744832 |
Protein GI | 170682932 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.188499 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.0180883 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTAAAC CTAATCATGA GCTTAGCCCG GCACTGATCG TGCTGATGTC TATCGCCACC GGTCTGGCGG TAGCCAGTAA CTATTACGCC CAGCCATTAC TCGACACCAT CGCGCGTAAC TTTTCCCTTT CCGCCAGTTC GGCGGGCTTT ATTGTTACCG CCGCGCAGTT GGGCTATGCC GCAGGTCTGC TGTTTCTTGT TCCCCTCGGT GATATGTTTG AACGCCGCCG CCTGATTGTC TCGATGACCT TACTGGCAGC GGGCGGTATG TTGATTACCG CCAGCAGTCA GTCGCTGGCG ATGATGATCC TCGGTACGGC ATTAACCGGT TTATTCTCAG TCGTGGCACA AATTCTGGTT CCGCTGGCAG CGACGCTGGC TTCACCGGAC AAACGCGGCA AAGTGGTCGG CACCATTATG AGCGGGCTGC TGTTGGGGAT CTTGCTGGCA CGGACCGTTG CCGGATTGCT GGCGAGTCTC GGTGGCTGGC GAACTGTCTT TTGGGTCGCT TCGGTATTAA TGGCACTGAT GGCGCTGGCG TTATGGCGTG GTCTGCCACA AATGAAATCA GAAACCCACC TCAACTACCC ACAGTTGCTA GGTTCTGTTT TCAGCATGTT TATCAGCGAT AAAATCCTGC GCACCCGCGC GTTGCTGGGC TGCCTGACCT TTGCCAACTT CAGCATTCTC TGGACCTCAA TGGCCTTTTT GCTTGCCGCT CCACCTTTTA ACTACAGCGA TGGCGTAATT GGTCTGTTCG GACTTGCAGG AGCTGCCGGG GCGTTAGGCG CTCGTCCGGC GGGCGGTTTT GCCGATAAGG GCAAATCACA CCTCACCACA ACTTTCGGCC TGCTGCTGCT GTTACTTTCA TGGCTGGCTA TCTGGTTTGG ACACACTTCT GTACTGGCGC TGGTTATCGG CATCCTGGTG CTGGACCTCA CCGTGCAGGG CGTGCATATC ACTAACCAGA CGGTAATTTA TCGAATACAC CCTGATGCGC GTAATCGCCT GACCGCAGGT TACATGACCA GCTACTTTAT TGGCGGTGCC GCCGGTTCGC TAATTTCAGC CTCAGCCTGG CAACATGGCG GTTGGGCTGG CGTTTGTCTG GCTGGCGCGA CGATTGCCCT GGTTAACTTA CTGGTCTGGT GGCGAGGTTT TCATCGTCAG GAAGCCGCAA ATTAA
|
Protein sequence | MTKPNHELSP ALIVLMSIAT GLAVASNYYA QPLLDTIARN FSLSASSAGF IVTAAQLGYA AGLLFLVPLG DMFERRRLIV SMTLLAAGGM LITASSQSLA MMILGTALTG LFSVVAQILV PLAATLASPD KRGKVVGTIM SGLLLGILLA RTVAGLLASL GGWRTVFWVA SVLMALMALA LWRGLPQMKS ETHLNYPQLL GSVFSMFISD KILRTRALLG CLTFANFSIL WTSMAFLLAA PPFNYSDGVI GLFGLAGAAG ALGARPAGGF ADKGKSHLTT TFGLLLLLLS WLAIWFGHTS VLALVIGILV LDLTVQGVHI TNQTVIYRIH PDARNRLTAG YMTSYFIGGA AGSLISASAW QHGGWAGVCL AGATIALVNL LVWWRGFHRQ EAAN
|
| |