Gene EcE24377A_4042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4042 
Symbol 
ID5588265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4024909 
End bp4026117 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content51% 
IMG OID640927662 
Productmajor facilitator family transporter 
Protein accessionYP_001465022 
Protein GI157156915 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00890] Oxalate/Formate Antiporter 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACCTT CAAATTATCA GCGTACCCGC TGGCTGACAC TCATCGGTAC TATCATTACC 
CAGTTTGCAC TGGGGTCGGT TTATACCTGG AGCCTGTTTA ATGGCGCGCT TTCTGCCAAG
CTGGATGCGC CGGTAAGCCA GGTCGCTTTC TCTTTCGGCT TGTTAAGTCT GGGGCTGGCA
ATTTCGTCTT CTGTTGCGGG CAAATTGCAG GAACGTTTTG GCGTTAAACG CGTCACCATG
GCTTCCGGCA TTTTGCTGGG ATTAGGCTTC TTCCTGACCG CGCATTCCAA CAACCTGATG
ATGCTGTGGT TAAGCGCCGG TGTGCTGGTC GGTCTGGCGG ATGGCGCGGG TTACCTGCTG
ACGCTCTCTA ACTGCGTGAA GTGGTTCCCG GAGCGTAAAG GTCTGATCTC CGCGTTCGCT
ATCGGTTCTT ATGGTCTGGG TAGCCTGGGT TTCAAATTTA TCGACACGCA GCTGCTCGAA
ACGGTGGGTT TGGAAAAAAC ATTTGTGATT TGGGGAGCGA TTGCGCTGGT GATGATTGTT
TTCGGCGCAA CGTTAATGAA AGACGCACCA AAACAGGAAG TGAAAACCAG CAATGGTGTG
GTGGAGAAAG ATTACACGCT GGCAGAGTCG ATGCGTAAAC CGCAGTACTG GATGTTAGCG
GTAATGTTCC TGACTGCCTG CATGAGCGGC CTGTACGTGA TTGGGGTAGC GAAAGATATC
GCCCAAAGTC TGGCACACCT TGATGTGGTT TCCGCAGCCA ATGCAGTAAC GGTTATTTCC
ATCGCCAACC TTTCAGGTCG TCTGGTGCTG GGGATACTGT CTGACAAAAT TGCCCGTATC
CGTGTTATCA CCATTGGTCA GGTGATTTCG CTGGTGGGTA TGGCGGCCCT GCTGTTTGCA
CCATTAAATG CAGTGACGTT CTTTGCAGCG ATTGCCTGCG TGGCATTTAA CTTTGGCGGC
ACTATTACCG TCTTTCCGTC ACTGGTCAGT GAGTTCTTCG GCCTCAATAA CCTGGCGAAA
AACTACGGTG TGATTTACCT CGGTTTCGGT ATCGGTAGCA TTTTTGGTTC GATTATCGCC
TCACTGTTTG GCGGTTTCTA TGTGACCTTC TACGTGATTT TCGCCCTGCT GATTCTGTCA
TTGGCGCTTT CTACGACGAT TCGTCAGCCA GAGCAGAAAA TGTTGCGTGA GGCGCATGGC
TCCCTTTAA
 
Protein sequence
MTPSNYQRTR WLTLIGTIIT QFALGSVYTW SLFNGALSAK LDAPVSQVAF SFGLLSLGLA 
ISSSVAGKLQ ERFGVKRVTM ASGILLGLGF FLTAHSNNLM MLWLSAGVLV GLADGAGYLL
TLSNCVKWFP ERKGLISAFA IGSYGLGSLG FKFIDTQLLE TVGLEKTFVI WGAIALVMIV
FGATLMKDAP KQEVKTSNGV VEKDYTLAES MRKPQYWMLA VMFLTACMSG LYVIGVAKDI
AQSLAHLDVV SAANAVTVIS IANLSGRLVL GILSDKIARI RVITIGQVIS LVGMAALLFA
PLNAVTFFAA IACVAFNFGG TITVFPSLVS EFFGLNNLAK NYGVIYLGFG IGSIFGSIIA
SLFGGFYVTF YVIFALLILS LALSTTIRQP EQKMLREAHG SL