Gene EcSMS35_2803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2803 
Symbol 
ID6146823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2883803 
End bp2884987 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content55% 
IMG OID641617672 
Productmajor facilitator family transporter 
Protein accessionYP_001744832 
Protein GI170682932 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.188499 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0180883 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAAAC CTAATCATGA GCTTAGCCCG GCACTGATCG TGCTGATGTC TATCGCCACC 
GGTCTGGCGG TAGCCAGTAA CTATTACGCC CAGCCATTAC TCGACACCAT CGCGCGTAAC
TTTTCCCTTT CCGCCAGTTC GGCGGGCTTT ATTGTTACCG CCGCGCAGTT GGGCTATGCC
GCAGGTCTGC TGTTTCTTGT TCCCCTCGGT GATATGTTTG AACGCCGCCG CCTGATTGTC
TCGATGACCT TACTGGCAGC GGGCGGTATG TTGATTACCG CCAGCAGTCA GTCGCTGGCG
ATGATGATCC TCGGTACGGC ATTAACCGGT TTATTCTCAG TCGTGGCACA AATTCTGGTT
CCGCTGGCAG CGACGCTGGC TTCACCGGAC AAACGCGGCA AAGTGGTCGG CACCATTATG
AGCGGGCTGC TGTTGGGGAT CTTGCTGGCA CGGACCGTTG CCGGATTGCT GGCGAGTCTC
GGTGGCTGGC GAACTGTCTT TTGGGTCGCT TCGGTATTAA TGGCACTGAT GGCGCTGGCG
TTATGGCGTG GTCTGCCACA AATGAAATCA GAAACCCACC TCAACTACCC ACAGTTGCTA
GGTTCTGTTT TCAGCATGTT TATCAGCGAT AAAATCCTGC GCACCCGCGC GTTGCTGGGC
TGCCTGACCT TTGCCAACTT CAGCATTCTC TGGACCTCAA TGGCCTTTTT GCTTGCCGCT
CCACCTTTTA ACTACAGCGA TGGCGTAATT GGTCTGTTCG GACTTGCAGG AGCTGCCGGG
GCGTTAGGCG CTCGTCCGGC GGGCGGTTTT GCCGATAAGG GCAAATCACA CCTCACCACA
ACTTTCGGCC TGCTGCTGCT GTTACTTTCA TGGCTGGCTA TCTGGTTTGG ACACACTTCT
GTACTGGCGC TGGTTATCGG CATCCTGGTG CTGGACCTCA CCGTGCAGGG CGTGCATATC
ACTAACCAGA CGGTAATTTA TCGAATACAC CCTGATGCGC GTAATCGCCT GACCGCAGGT
TACATGACCA GCTACTTTAT TGGCGGTGCC GCCGGTTCGC TAATTTCAGC CTCAGCCTGG
CAACATGGCG GTTGGGCTGG CGTTTGTCTG GCTGGCGCGA CGATTGCCCT GGTTAACTTA
CTGGTCTGGT GGCGAGGTTT TCATCGTCAG GAAGCCGCAA ATTAA
 
Protein sequence
MTKPNHELSP ALIVLMSIAT GLAVASNYYA QPLLDTIARN FSLSASSAGF IVTAAQLGYA 
AGLLFLVPLG DMFERRRLIV SMTLLAAGGM LITASSQSLA MMILGTALTG LFSVVAQILV
PLAATLASPD KRGKVVGTIM SGLLLGILLA RTVAGLLASL GGWRTVFWVA SVLMALMALA
LWRGLPQMKS ETHLNYPQLL GSVFSMFISD KILRTRALLG CLTFANFSIL WTSMAFLLAA
PPFNYSDGVI GLFGLAGAAG ALGARPAGGF ADKGKSHLTT TFGLLLLLLS WLAIWFGHTS
VLALVIGILV LDLTVQGVHI TNQTVIYRIH PDARNRLTAG YMTSYFIGGA AGSLISASAW
QHGGWAGVCL AGATIALVNL LVWWRGFHRQ EAAN