Gene ECH74115_3303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3303 
SymbolfruA 
ID6969714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3036010 
End bp3037701 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content56% 
IMG OID643387115 
ProductPTS system fructose-specific transporter subunits IIBC 
Protein accessionYP_002271579 
Protein GI209399216 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1299] Phosphotransferase system, fructose-specific IIC component 
TIGRFAM ID[TIGR00829] PTS system, fructose-specific, IIB component
[TIGR01427] PTS system, fructose subfamily, IIC component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000185462 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.0412647 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACGC TGCTGATTAT TGACGCTAAT CTCGGTCAGG CACGCGCCTA TATGGCGAAG 
ACCCTGCTGG GCGCGGCGGC GCGAAAAGCA AAACTGGAAA TCATCGACAA TCCGAACGAC
GCAGAAATGG CGATTGTTCT CGGTGATTCC ATCCCGAATG ACAGCGCGCT GAACGGTAAA
AATGTCTGGC TGGGCGATAT TTCCCGGGCA GTTGCGCACC CTGAGCTGTT CCTGAGTGAA
GCCAAAGGCC ATGCGAAACC TTACACTGCG CCGGTCGCTG CGACAGCACC GGTTGCCGCC
AGCGGTCCGA AACGCGTAGT TGCGGTGACT GCTTGCCCGA CTGGCGTAGC ACACACCTTT
ATGGCGGCTG AAGCCATTGA AACCGAAGCG AAAAAACGTG GCTGGTGGGT GAAAGTTGAA
ACCCGTGGTT CTGTTGGCGC GGGTAATGCA ATCACTCCCG AAGAAGTAGC CGCAGCGGAT
CTGGTGATTG TGGCGGCAGA TATCGAAGTG GATCTGGCGA AATTTGCTGG TAAACCGATG
TATCGCACCT CTACCGGTCT GGCGCTGAAG AAAACCGCGC AGGAACTGGA TAAAGCGGTT
GCTGAAGCAA CGCCGTATGA ACCGGCGGGC AAAGCTCAAA CGGCGACCTC TGAAGGTAAG
AACGAGAGTG CAGGCGCATA CCGTCACTTG CTGACGGGCG TCTCTTACAT GCTGCCGATG
GTCGTTGCAG GTGGTCTGTG TATCGCGCTT TCTTTTGCTT TTGGTATCGA AGCGTTTAAA
GAGCCGGGTA CGTTGGCAGC GGCGCTGATG CAGATTGGTG GTGGTTCAGC CTTTGCGCTG
ATGGTGCCGG TACTGGCAGG TTATATTGCC TTCTCCATTG CCGATCGTCC GGGTCTCACG
CCGGGTCTGA TTGGCGGTAT GCTGGCGGTC AGCACCGGTT CTGGCTTCAT TGGCGGTATT
ATTGCGGGCT TCCTGGCTGG TTACATTGCG AAGTTAATCA GTACGCAACT GAAACTGCCA
CAGAGTATGG AGGCGCTGAA ACCGATCCTG ATCATTCCGC TAATTTCCAG TCTGGTGGTC
GGTCTGGCGA TGATCTACCT GATCGGTAAA CCGGTTGCTG GCATTCTCGA AGGGTTGACT
CACTGGCTGC AGACCATGGG GACTGCGAAT GCGGTTCTGC TGGGGGCGAT CCTCGGTGGC
ATGATGTGTA CTGACATGGG CGGTCCGGTA AATAAAGCAG CGTACGCATT CGGTGTGGGT
CTGCTGAGTA CTCAAACCTA TGGCCCGATG GCGGCGATTA TGGCGGCAGG TATGGTGCCA
CCGCTGGCAA TGGGTCTGGC AACAATGGTG GCGCGTCGCA AATTTGACAA AGCGCAGCAG
GAAGGTGGCA AAGCCGCTCT GGTTCTTGGT CTGTGCTTTA TTTCGGAAGG TGCAATTCCT
TTTGCTGCTC GTGATCCGAT GCGTGTGCTG CCGTGTTGTA TCGTTGGTGG GGCGCTGACT
GGCGCAATCT CAATGGCGAT TGGTGCGAAA CTGATGGCAC CGCACGGTGG TCTGTTTGTT
CTGCTGATCC CTGGCGCTAT TACGCCGGTA CTGGGTTACC TGGTAGCAAT TATTGCCGGT
ACGCTGGTGG CGGGTTTGGC CTATGCCTTC CTGAAACGTC CGGAAGTGGA CGCAGTAGCG
AAAGCCGCGT AA
 
Protein sequence
MKTLLIIDAN LGQARAYMAK TLLGAAARKA KLEIIDNPND AEMAIVLGDS IPNDSALNGK 
NVWLGDISRA VAHPELFLSE AKGHAKPYTA PVAATAPVAA SGPKRVVAVT ACPTGVAHTF
MAAEAIETEA KKRGWWVKVE TRGSVGAGNA ITPEEVAAAD LVIVAADIEV DLAKFAGKPM
YRTSTGLALK KTAQELDKAV AEATPYEPAG KAQTATSEGK NESAGAYRHL LTGVSYMLPM
VVAGGLCIAL SFAFGIEAFK EPGTLAAALM QIGGGSAFAL MVPVLAGYIA FSIADRPGLT
PGLIGGMLAV STGSGFIGGI IAGFLAGYIA KLISTQLKLP QSMEALKPIL IIPLISSLVV
GLAMIYLIGK PVAGILEGLT HWLQTMGTAN AVLLGAILGG MMCTDMGGPV NKAAYAFGVG
LLSTQTYGPM AAIMAAGMVP PLAMGLATMV ARRKFDKAQQ EGGKAALVLG LCFISEGAIP
FAARDPMRVL PCCIVGGALT GAISMAIGAK LMAPHGGLFV LLIPGAITPV LGYLVAIIAG
TLVAGLAYAF LKRPEVDAVA KAA