Gene EcE24377A_3840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_3840 
SymbolfrlA 
ID5590330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3814567 
End bp3815955 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content54% 
IMG OID640927464 
Productputative fructoselysine transporter 
Protein accessionYP_001464825 
Protein GI157157958 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGCAA ACTCTCCCCT ACAACGTATT GGACAAGAAA AAGGTATCGC TATGGGAAGC 
CAGGAACTCC AACGCAAGCT CGGATTTTGG GCCGTTCTTG CAATCGCCGT CGGGACAACC
GTCGGCTCCG GTATTTTTGT ATCTGTGGGT GAAGTGGCAA AAGCAGCGGG CACGCCGTGG
CTTACGGTGC TCGCGTTTGT CATTGGCGGG TTAATTGTGA TCCCGCAAAT GTGCGTCTAT
GCGGAACTAT CCACCGCTTA TCCGGAAAAT GGCGCAGATT ATGTTTATCT GAAAAATGCC
GGAAGCCGAC CGCTGGCTTT CCTCTCCGGC TGGGCCAGCT TCTGGGCCAA CGATGCGCCG
TCATTGTCGA TTATGGCGCT GGCGATTGTC AGCAATCTTG GCTTTTTAAC GCCTATCGAT
CCGTTGCTCG GTAAATTTAT CGCCGCCGGA TTAATTATCG CCTTTATGTT GCTACACCTG
CGCTCCGTTG AAGGCGGCGC AGCGTTTCAG ACGCTAATTA CCATCGCCAA AATTATCCCG
TTCACTATCG TCATTGGCCT TGGGATCTTC TGGTTTAAAG CGGAGAATTT TGCCGCCCCT
ACCACCACTG CGATTGGCGC AACGGGCAGC TTTATGGCGC TGCTGGCGGG GATCTCTGCC
ACCAGTTGGT CGTATACCGG CATGGCCTCT ATCTGTTATA TGACCGGCGA AATTAAAAAC
CCCGGAAAAA CCATGCCACG AGCGCTGATT GGTTCCTGTC TGCTGGTTCT GGTGCTCTAC
ACCCTGCTGG CGCTGGTGAT TTCCGGCCTG ATGCCCTTCG ACAAACTCGC CAATTCTGAA
ACGCCGATTT CCGACGCCCT GACCTGGATC CCCGCACTCG GCAGCACCGC TGGGATCTTT
GTTGCCATCA CGGCGATGAT CGTCATTCTT GGTTCGCTTT CCAGCTGCGT GATGTACCAG
CCGCGGCTGG AATACGCGAT GGCGAAAGAC AACCTGTTCT TTAAATGCTT CGGCCATGTG
CATCCGAAAT ACAACACGCC GGATGTCTCC ATCATCCTGC AAGGGGCGCT GGGGATCTTC
TTCATCTTCG TTTCCGATCT CACCAGCCTG CTGGGTTATT TCACCCTGGT GATGTGTTTC
AAAAATACCC TCACCTTCGG CTCCATCATC TGGTGTCGTA AACGCGACGA TTACAAACCG
CTGTGGCGTA CTCCGGCTTT CGGGCTGATG ACCACCCTCG CCATTGCGTC AAGCCTCATT
CTGGTCGCCT CAACCTTTGT CTGGGCACCG ATTCCCGGCC TTATCTGCGC CGTCATCGTT
ATTGCTACTG GTCTGCCTGC TTACGCCTTC TGGGCGAAGC GTAGCCGCCA GCTCAACGCT
TTGTCGTAA
 
Protein sequence
MTANSPLQRI GQEKGIAMGS QELQRKLGFW AVLAIAVGTT VGSGIFVSVG EVAKAAGTPW 
LTVLAFVIGG LIVIPQMCVY AELSTAYPEN GADYVYLKNA GSRPLAFLSG WASFWANDAP
SLSIMALAIV SNLGFLTPID PLLGKFIAAG LIIAFMLLHL RSVEGGAAFQ TLITIAKIIP
FTIVIGLGIF WFKAENFAAP TTTAIGATGS FMALLAGISA TSWSYTGMAS ICYMTGEIKN
PGKTMPRALI GSCLLVLVLY TLLALVISGL MPFDKLANSE TPISDALTWI PALGSTAGIF
VAITAMIVIL GSLSSCVMYQ PRLEYAMAKD NLFFKCFGHV HPKYNTPDVS IILQGALGIF
FIFVSDLTSL LGYFTLVMCF KNTLTFGSII WCRKRDDYKP LWRTPAFGLM TTLAIASSLI
LVASTFVWAP IPGLICAVIV IATGLPAYAF WAKRSRQLNA LS