Gene EcSMS35_3868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3868 
Symbol 
ID6146825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3936947 
End bp3938155 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content50% 
IMG OID641618694 
Productmajor facilitator family transporter 
Protein accessionYP_001745833 
Protein GI170682261 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00890] Oxalate/Formate Antiporter 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACCTT CAAATTATCA GCGTACTCGC TGGCTGACAC TCATCGGTAC TATCATTACC 
CAGTTTGCTC TGGGGTCGGT TTATACCTGG AGTCTGTTTA ACGGTGCGCT TTCTGCCAAG
CTGGGGGCTC CTGTAAGCCA GGTTGCTTTC TCTTTCGGCT TGTTAAGTCT GGGGCTGGCA
ATTTCATCTT CTGTTGCGGG CAAATTGCAG GAACGTTTTG GTGTTAAACG CGTCACTATG
GCTTCCGGCA TTTTGCTGGG ATTAGGTTTC TTCCTGACAG CACATTCCAA CAACCTGATG
ATGCTGTGGT TAAGCGCCGG TGTGTTGGTT GGTCTGGCGG ATGGTGCGGG TTACCTGTTG
ACGCTCTCTA ACTGCGTGAA GTGGTTTCCG GAGCGTAAGG GACTTATCTC TGCTTTTGCT
ATCGGTTCTT ATGGTCTGGG CAGCCTGGGT TTTAAATTTA TCGACACGCA CCTGCTCGAA
ACGGTGGGTC TGGAAAAAAC CTTTGTTATT TGGGGCGCGA TTGTACTGGT GATGATTGTC
TTTGGCGCAA CGTTAATGAA AGATGCGCCG AAGCAGGAAG TGAAAACCAG CAATGGTGTG
GTGGAGAAGG ACTACACCCT GGCAGAGTCG ATGCGTAAAC CGCAGTACTG GATGTTAGCG
GTTATGTTCC TGACTGCGTG CATGAGTGGT CTGTATGTGA TTGGTGTAGC GAAAGATATC
GCTCAAAGTC TGGCGCATCT TGATGCAATT TCCGCAGCCA ATGCTGTGAC GGTTATTTCC
ATCGCCAACC TTTCTGGTCG TCTGGTGCTG GGCATTCTGT CTGACAAAAT CGCCCGTATC
CGTGTTATCA CCATTGGTCA GGTGATATCG CTGGTGGGTA TGGCGGCCCT GCTGTTTGCA
CCATTGAATG CAGTGACGTT CTTTGCAGCG ATTGCCTGCG TGGCGTTTAA CTTTGGCGGC
ACTATCACGG TGTTCCCGTC ACTGGTCAGT GAGTTCTTCG GCCTCAATAA CCTGGCGAAA
AACTACGGTG TGATTTATCT CGGTTTCGGT ATCGGCAGCA TTTGTGGGTC GATTATCGCC
TCACTGTTTG GCGGCTTCTA TGTGACTTTC TACGTCATTT TTGCCCTGCT GATTCTGTCT
CTGGCGCTTT CAACGACCAT TCGCCAGCCA GAGCAGAAAG TATTGCGTGA AGCGCATGGC
TCCCTTTAA
 
Protein sequence
MTPSNYQRTR WLTLIGTIIT QFALGSVYTW SLFNGALSAK LGAPVSQVAF SFGLLSLGLA 
ISSSVAGKLQ ERFGVKRVTM ASGILLGLGF FLTAHSNNLM MLWLSAGVLV GLADGAGYLL
TLSNCVKWFP ERKGLISAFA IGSYGLGSLG FKFIDTHLLE TVGLEKTFVI WGAIVLVMIV
FGATLMKDAP KQEVKTSNGV VEKDYTLAES MRKPQYWMLA VMFLTACMSG LYVIGVAKDI
AQSLAHLDAI SAANAVTVIS IANLSGRLVL GILSDKIARI RVITIGQVIS LVGMAALLFA
PLNAVTFFAA IACVAFNFGG TITVFPSLVS EFFGLNNLAK NYGVIYLGFG IGSICGSIIA
SLFGGFYVTF YVIFALLILS LALSTTIRQP EQKVLREAHG SL