Gene EcHS_A3672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3672 
Symbol 
ID5594384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3656786 
End bp3658045 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content56% 
IMG OID640922788 
Productmajor facilitator superfamily transporter 
Protein accessionYP_001460268 
Protein GI157162950 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones69 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTAAAAA TGAAACACTG TTGTAAAAAT GTGGTGATCC TCATGCCCGA ACCCGTAGCC 
GAACCCGCGC TAAACGGATT GCGCCTGAAT TTGCGCATTG TCTCTATAGT CATGTTTAAC
TTCGCCAGCT ACCTCACCAT CGGGTTGCCG CTCGCTGTAT TACCGGGCTA TGTCCATGAT
GTGATGGGCT TTAGCGCCTT CTGGGCAGGA TTGGTTATCA GCCTGCAATA TTTCGCCACC
TTGCTGAGCC GCCCTCATGC CGGACGTTAC GCCGATTCGC TGGGACCCAA AAAGATTGTC
GTCCTCGGTT TATGCGGCTG CTTTTTGAGC GGTCTGGGGT ATCTGACGGC AGGATTAACC
GCCAGTCTGC CTGTCATCAG CCTGTTATTA CTTTGCCTGG GGCGCGTCAT CCTTGGGATT
GGGCAAAGTT TTGCCGGAAC GGGATCGACC CTATGGGGCG TTGGCGTGGT TGGCTCGCTG
CATATCGGGC GGGTGATTTC GTGGAACGGC ATTGTCACTT ACGGGGCGAT GGCGATGGGT
GCGCCGTTAG GCGTCGTGTT TTATCACTGG GGCGGCTTGC AGGCGTTAGC GTTAATCATT
ATGGGCGTGG CGCTGGTGGC CATTTTGTTG GCGATCCCGC GTCCGACGGT AAAAGCCAGT
AAAGGCAAAC CGCTGCCGTT TCGCGCGGTG CTTGGGCGCG TCTGGCTGTA CGGTATGGCG
CTGGCACTGG CTTCCGCCGG ATTTGGCGTC ATCGCCACCT TTATCACGCT GTTTTATGAC
GCTAAAGGTT GGGACGGTGC GGCTTTCGCG CTGACGCTGT TTAGCTGTGC GTTTGTCGGT
ACGCGTTTGT TATTCCCTAA CGGCATTAAC CGTATCGGTG GCTTAAACGT AGCGATGATT
TGCTTTAGCG TTGAGATAAT CGGCCTGCTA CTGGTTGGCG TGGCGACTAT GCCGTGGATG
GCGAAAATCG GCGTCTTACT GGCGGGGGCC GGGTTTTCGC TGGTGTTCCC GGCATTGGGT
GTAGTGGCGG TAAAAGCGGT TCCGCAGCAA AATCAGGGGG CGGCGCTGGC AACTTACACC
GTATTTATGG ATTTATCGCT TGGCGTGACT GGACCACTGG CTGGGCTGGT GATGAGCTGG
GCGGGCGTAC CGGTGATTTA TCTGGCGGCG GCGGGACTGG TCGCAATCGC GTTATTACTG
ACGTGGCGAT TAAAAAAACG GCCTCCGGAA CACGTCCCTG AGGCCGCCTC ATCATCTTAA
 
Protein sequence
MVKMKHCCKN VVILMPEPVA EPALNGLRLN LRIVSIVMFN FASYLTIGLP LAVLPGYVHD 
VMGFSAFWAG LVISLQYFAT LLSRPHAGRY ADSLGPKKIV VLGLCGCFLS GLGYLTAGLT
ASLPVISLLL LCLGRVILGI GQSFAGTGST LWGVGVVGSL HIGRVISWNG IVTYGAMAMG
APLGVVFYHW GGLQALALII MGVALVAILL AIPRPTVKAS KGKPLPFRAV LGRVWLYGMA
LALASAGFGV IATFITLFYD AKGWDGAAFA LTLFSCAFVG TRLLFPNGIN RIGGLNVAMI
CFSVEIIGLL LVGVATMPWM AKIGVLLAGA GFSLVFPALG VVAVKAVPQQ NQGAALATYT
VFMDLSLGVT GPLAGLVMSW AGVPVIYLAA AGLVAIALLL TWRLKKRPPE HVPEAASSS