Gene EcE24377A_0073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_0073 
SymbolsetA 
ID5590260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp79311 
End bp80489 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content49% 
IMG OID640923804 
Productmajor facilitator superfamily sugar transporter 
Protein accessionYP_001461241 
Protein GI157156264 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00899] sugar efflux transporter 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCTGGA TAATGACGAT GGCTCGCCGT ATGAACGGTG TTTACGCGGC ATTTATGCTG 
GTCGCTTTTA TGATGGGGGT GGCCGGGGCG CTACAGGCTC CTACATTGAG CTTATTTCTG
AGTCGTGAGG TTGGCGCGCA ACCTTTCTGG ATCGGACTCT TTTATACGGT GAATGCTATT
GCTGGGATCG GCGTAAGCCT CTGGTTGGCA AAACGTTCTG ACAGTCAGGG CGATCGGCGA
AAACTGATTA TATTTTGCTG TTTGATGGCT ATCGGCAATG CGCTATTGTT TGCATTTAAT
CGTCATTATC TGACGCTTAT CACCTGTGGT GTGCTTCTGG CATCTCTGGC CAATACGGCA
ATGCCACAGT TATTTGCTCT GGCGCGGGAA TATGCGGATA ACTCGGCGCG AGAAGTGGTG
ATGTTTAGCT CGGTGATGCG TGCGCAGCTT TCTCTGGCAT GGGTTATCGG TCCACCGTTG
GCCTTTATGC TGGCGTTGAA TTACGGCTTT ACGGTGATGT TTTCGATTGC CGCCGGGATA
TTCACACTCA GTCTGGTATT GATTGCATTT ATGCTTCCGT CTGTGGCGCG GGTAGAACTG
CCGTCGGAAA ATGCTTTATC AATGCAAGGT GGCTGGCAGG ATAGTAACGT ACGGATGTTA
TTTGTCGCCT CGACGTTAAT GTGGACCTGC AACACCATGT ACATTATTGA TATGCCGTTG
TGGATCAGTA GCGAGTTAGG ATTGCCAGAC AAACTGGCGG GTTTCCTGAT GGGGACGGCA
GCTGGACTGG AAATACCAGC AATGATTCTG GCTGGCTACT ATGTCAAACG TTATGGTAAG
CGGCGAATGA TGGTCATAGC AGTGGCGGCA GGAGTACTGT TTTACACCGG ATTGATTTTC
TTTCATAGCC GTATGGCGTT GATGACGCTG CAACTTTTTA ACGCTGTATT TATCGGCATT
GTTGCGGGTA TTGGGATGCT ATGGTTTCAG GATTTAATGC CTGGAAGAGC GGGGGCAGCT
ACCACCTTAT TTACTAACAG TATTTCTACC GGGGTAATTC TGGCTGGCGT TATTCAGGGA
GCAATTGCAC AAAGTTGGGG GCACTTTGCT GTCTACTGGG TAATTGCGGT TATTTCTGTT
GTCGCATTAT TTTTAACCGC AAAGGTTAAA GACGTTTGA
 
Protein sequence
MIWIMTMARR MNGVYAAFML VAFMMGVAGA LQAPTLSLFL SREVGAQPFW IGLFYTVNAI 
AGIGVSLWLA KRSDSQGDRR KLIIFCCLMA IGNALLFAFN RHYLTLITCG VLLASLANTA
MPQLFALARE YADNSAREVV MFSSVMRAQL SLAWVIGPPL AFMLALNYGF TVMFSIAAGI
FTLSLVLIAF MLPSVARVEL PSENALSMQG GWQDSNVRML FVASTLMWTC NTMYIIDMPL
WISSELGLPD KLAGFLMGTA AGLEIPAMIL AGYYVKRYGK RRMMVIAVAA GVLFYTGLIF
FHSRMALMTL QLFNAVFIGI VAGIGMLWFQ DLMPGRAGAA TTLFTNSIST GVILAGVIQG
AIAQSWGHFA VYWVIAVISV VALFLTAKVK DV