Gene EcolC_0243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0243 
Symbol 
ID6067757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp281685 
End bp282944 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content56% 
IMG OID641599642 
Productmajor facilitator superfamily transporter 
Protein accessionYP_001723249 
Protein GI170018295 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAAAAA TGAAACACTG TTGTAAAAAT GTGGTGATCC TCATGCCCGA ACCCGTAGCC 
GAACCCGCGC TAAACGGATT GCGCCTGAAT TTGCGCATTG TCTCCATTGT CATGTTTAAC
TTCGCCAGCT ACCTCACCAT CGGGTTGCCG CTCGCTGTAT TACCGGGCTA TGTCCATGAT
GTGATGGGCT TTAGCGCTTT CTGGGCAGGA TTGGTTATCA GCCTGCAATA TTTCGCCACC
TTGCTGAGCC GCCCTCATGC CGGACGTTAT GCCGATTTGC TGGGACCCAA AAAGATTGTC
GTCTTCGGTT TATGCGGCTG CTTTTTGAGC GGTCTGGGAT ATCTGACGGC AGGATTAACC
GCCAGTCTGC CCGTCATCAG CCTGTTATTA CTTTGCCTGG GACGCGTGAT CCTTGGGATT
GGGCAAAGTT TTGCCGGAAC GGGATCGACC CTGTGGGGCG TTGGCGTGGT TGGCTCGCTG
CATATCGGGC GGGTGATTTC GTGGAACGGC ATTGTCACTT ACGGGGCGAT GGCGATGGGT
GCGCCGTTAG GCGTCGTGTT TTATCACTGG GGCGGCTTGC AGGCGTTAGC GTTAATCATT
ATGGGCGTGG CGCTGGTGGC CATTTTGTTG GCGCTCCCGC GTCCGACGGT AAAAGCCAGT
AAAGGCAAAC CGCTGCCGTT TCGCGCGGTG CTTGGGCGCG TCTGGCTGTA CGGTATGGCA
CTGGCACTGG CTTCCGCCGG ATTTGGCGTT ATCGCCACCT TTATCACGCT GTTTTATGAC
GCTAAAGGTT GGGACGGTGC GGCTTTCGCG CTGACGCTGT TTAGCTGTGC GTTTGTCGGT
ACGCGTTTGT TATTCCCTAA CGGCATTAAC CGTATCGGCG GCTTAAACGT GGCGATGATT
TGCTTTAGCG TTGAGATAAT CGGCCTGCTA CTGGTTGGCG TGGCGACTAT GCCGTGGATG
GCGAAAATCG GCGTCTTACT GGCGGGGGCA GGGTTTTCGC TGGTGTTCCC GGCATTGGGT
GTAGTGGCGG TAAAAGCGGT TCCGCAGCAA AATCAGGGGG CGGCGCTGGC AACTTACACC
GTATTTATGG ATTTATCGCT TGGCGTGACT GGACCACTGG CTGGGCTGGT GATGAGCTGG
GCGGGCGTAC CGGTGATTTA TCTGGCGGCG GCGGGACTGG TCGCAATCGC GTTATTACTG
ACGTGGCGAT TAAAAAAACG GCCTCCGGAA CACGTCCCTG AGGCCGCCTC ATCATCTTAA
 
Protein sequence
MVKMKHCCKN VVILMPEPVA EPALNGLRLN LRIVSIVMFN FASYLTIGLP LAVLPGYVHD 
VMGFSAFWAG LVISLQYFAT LLSRPHAGRY ADLLGPKKIV VFGLCGCFLS GLGYLTAGLT
ASLPVISLLL LCLGRVILGI GQSFAGTGST LWGVGVVGSL HIGRVISWNG IVTYGAMAMG
APLGVVFYHW GGLQALALII MGVALVAILL ALPRPTVKAS KGKPLPFRAV LGRVWLYGMA
LALASAGFGV IATFITLFYD AKGWDGAAFA LTLFSCAFVG TRLLFPNGIN RIGGLNVAMI
CFSVEIIGLL LVGVATMPWM AKIGVLLAGA GFSLVFPALG VVAVKAVPQQ NQGAALATYT
VFMDLSLGVT GPLAGLVMSW AGVPVIYLAA AGLVAIALLL TWRLKKRPPE HVPEAASSS