Gene EcSMS35_0425 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0425 
SymbolaraJ 
ID6146639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp433575 
End bp434843 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content51% 
IMG OID641615321 
ProductMFS transport protein AraJ 
Protein accessionYP_001742528 
Protein GI170683106 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.87857 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCTCGC TGGTCGTTAT CCTGCAAGCT ATCACTTTAT TGGCTACGGT GATTAGTAGC 
CGTTCTGGTG GTTGTGATGG TGGTATGAAA AAAGTCATTT TATCTTTGGC TCTGGGCACG
TTTGGTTTGG GGATGGCCGA ATTTGGCATT ATGGGCGTGC TCACGGAGCT GGCGCATAAC
GTAGGAATTT CGATTCCTGC TGCCGGGCAT ATGATCTCGT ATTATGCGCT GGGGGTGGTG
GTCGGTGCGC CAATCATCGC ACTCTTTTCC AGCCGCTACT CACTCAAACA TATCTTATTG
TTTCTGGTGG CGTTGTGCGT CATTGGCAAC GCCATGTTCA CGCTCTCTTC GTCTTACCTG
ATGCTCGCCA TTGGTCGGCT GGTATCCGGC TTTCCGCATG GCGCATTTTT TGGCGTCGGC
GCGATCGTGT TATCAAAAAT TATCAAACCC GGAAAAGTCA CCGCCGCCGT GGCGGGGATG
GTTTCCGGGA TGACAGTCGC CAATTTGCTG GGCATTCCGC TGGGAACGTA TTTAAGTCAG
GAATTTAGCT GGCGTTACAC CTTTTTATTG ATCGCTGTTT TTAATATTGT GGTGATGGCA
TCGGTCTATT TTTGGGTGCC GGATATTCGC GACGAAGCGA AAGGAAAGCT GCGCGAACAA
TTTCACTTTT TACGCAGCCC GGCCCCGTGG TTAATTTTCG CCGCCACCAT GTTTGGCAAC
GCAGGTGTAT TTGCCTGGTT CAGCTACGTA AAGCCATACA TGATGTTTAT TTCCGGTTTT
TCGGAAACGG CGATGACCTT TATTATGATG TTAGTGGGGC TAGGGATGGT GCTGGGGAAT
GTGCTAAGTG GCCGAATTTC AGGACGTTAT TCACCACTGC GCATTGCAGC AGTGACTGAC
TTTATCATTG TACTGGCACT GCTGATGCTC TTTTTCTGCG GCGGCATGAA AATAACGTCG
CTTATTTTTG CTTTTATTTG TTGCGCGGGA TTATTTGCCC TTTCAGCACC TCTGCAAATA
TTGTTACTGC AAAACGCCAA AGGCGGAGAG TTATTAGGTG CCGCAGGTGG GCAAATAGCG
TTTAACCTCG GTAGCGCCGT CGGCGCATAT TGCGGTGGTA TGATGCTGAC GCTGGGGCTG
GCATATAATT ACGTGGCGCT GCCTGCCGCC CTGCTTTCGT TTGCTGCGAT GTCGTCGTTG
CTGCTGTATG GTCGCTATAA GCGCCAGCAA GCGGCGGATA GTCCGGTGCT GGCGAAACCA
CTGGGGTAG
 
Protein sequence
MASLVVILQA ITLLATVISS RSGGCDGGMK KVILSLALGT FGLGMAEFGI MGVLTELAHN 
VGISIPAAGH MISYYALGVV VGAPIIALFS SRYSLKHILL FLVALCVIGN AMFTLSSSYL
MLAIGRLVSG FPHGAFFGVG AIVLSKIIKP GKVTAAVAGM VSGMTVANLL GIPLGTYLSQ
EFSWRYTFLL IAVFNIVVMA SVYFWVPDIR DEAKGKLREQ FHFLRSPAPW LIFAATMFGN
AGVFAWFSYV KPYMMFISGF SETAMTFIMM LVGLGMVLGN VLSGRISGRY SPLRIAAVTD
FIIVLALLML FFCGGMKITS LIFAFICCAG LFALSAPLQI LLLQNAKGGE LLGAAGGQIA
FNLGSAVGAY CGGMMLTLGL AYNYVALPAA LLSFAAMSSL LLYGRYKRQQ AADSPVLAKP
LG