Gene EcSMS35_0463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0463 
Symbol 
ID6144885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp471333 
End bp472697 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content53% 
IMG OID641615357 
Productmajor facilitator transporter 
Protein accessionYP_001742564 
Protein GI170679618 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGATT ATAAAATGAC GCCAGGTGAG CGGCGCGCGA CCTGGGGTTT AGGCACCGTT 
TTCTCGTTGC GCATGCTGGG CATGTTCATG GTTCTGCCGG TTCTGACCAC GTACGGCATG
GCTCTGCAAG GTGCCAGCGA AGCATTAATC GGTATTGCCA TTGGTATTTA TGGTCTGACT
CAGGCCGTTT TTCAGATTCC GTTTGGCCTG CTTTCAGACC GCATTGGTCG CAAACCATTA
ATTGTCGGTG GGCTGGCGGT GTTTGCCGCC GGTAGCGTTA TCGCTGCGCT CTCTGACTCC
ATCTGGGGAA TTATTCTGGG CCGGGCGCTA CAAGGCTCCG GTGCGATTGC CGCTGCCGTT
ATGGCGCTGC TTTCCGATCT TACGCGCGAA CAAAACCGCA CCAAAGCGAT GGCGTTTATC
GGCGTGAGCT TTGGCATTAC CTTTGCCATT GCGATGGTGC TTGGCCCGAT CATCACTCAC
AAACTTGGGC TGCACGCGCT GTTCTGGATG ATCGCTATTC TGGCAACAAC CGGCATTGCG
TTGACCATTT GGGTTGTGCC CAACAGTAGC ACTCACGTAC TTAATCGTGA GTCCGGAATG
GTGAAAGGCA GTTTCAGTAA AGTGCTGGCG GAACCGCGGT TGCTGAAACT CAACTTTGGC
ATTATGTGTC TACACATGCT TCTGATGTCT ACGTTTGTTG CCCTGCCCGG ACAGCTGGCT
GATGCGGGGT TCCCGGCGGC TGAACACTGG AAGGTCTATC TGGCGACGAT GCTAATCGCC
TTTGGCTCGG TCGTGCCTTT CATTATCTAC GCTGAAGTTA AGCGCAAAAT GAAGCAAGTC
TTTGTCTTCT GCGTCGGATT GATCGTGGTT GCGGAAATTG TGTTGTGGAA CGCACAAACG
CAGTTCTGGC AACTGGTGGT CGGCGTGCAG CTTTTCTTTG TAGCGTTTAA TTTGATGGAA
GCCCTTCTGC CTTCACTTAT CAGTAAAGAG TCGCCTGCAG GTTACAAAGG TACGGCGATG
GGTGTTTACT CCACCAGCCA GTTTCTTGGC GTGGCGATTG GCGGTTCGCT GGGCGGCTGG
ATTGACGGCA TGTTTGACGG TCAGGGCGTA TTTCTCGCTG GCGCAATGCT GGCCGCAGTG
TGGCTGGCAG TCGCCAGTAC CATGAAAGAA CCGCCGTATG TCAGCAGCTT GCGCATTGAA
ATCCCGGCGG ACATTGCCGC AAACGAAGCG TTAAAAGTAC GTTTGCTGGA AACTGAAGGC
GTCAAAGAAG TGTTGATTGC AGAAGAAGAA CATTCAGCGT ATGTGAAAAT CGACAGCAAA
GTGACGAATC GCTTTGAGGT TGAACAGGCA ATTCGCCAGG CATAA
 
Protein sequence
MNDYKMTPGE RRATWGLGTV FSLRMLGMFM VLPVLTTYGM ALQGASEALI GIAIGIYGLT 
QAVFQIPFGL LSDRIGRKPL IVGGLAVFAA GSVIAALSDS IWGIILGRAL QGSGAIAAAV
MALLSDLTRE QNRTKAMAFI GVSFGITFAI AMVLGPIITH KLGLHALFWM IAILATTGIA
LTIWVVPNSS THVLNRESGM VKGSFSKVLA EPRLLKLNFG IMCLHMLLMS TFVALPGQLA
DAGFPAAEHW KVYLATMLIA FGSVVPFIIY AEVKRKMKQV FVFCVGLIVV AEIVLWNAQT
QFWQLVVGVQ LFFVAFNLME ALLPSLISKE SPAGYKGTAM GVYSTSQFLG VAIGGSLGGW
IDGMFDGQGV FLAGAMLAAV WLAVASTMKE PPYVSSLRIE IPADIAANEA LKVRLLETEG
VKEVLIAEEE HSAYVKIDSK VTNRFEVEQA IRQA