Gene EcSMS35_2314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2314 
SymbolfruA 
ID6144771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2345377 
End bp2347068 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content56% 
IMG OID641617188 
ProductPTS system fructose-specific transporter subunits IIBC 
Protein accessionYP_001744361 
Protein GI170681268 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1299] Phosphotransferase system, fructose-specific IIC component
[COG1445] Phosphotransferase system fructose-specific component IIB 
TIGRFAM ID[TIGR00829] PTS system, fructose-specific, IIB component
[TIGR01427] PTS system, fructose subfamily, IIC component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00422844 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.00031102 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAACGC TGCTGATTAT TGACGCTAAT CTCGGTCAGG CACGCGCCTA TATGGCGAAG 
ACCCTGCTGG GCGCGGCGGC GCGAAAAGCA AAACTGGAAA TCATCGACAA TCCGAACGAC
GCAGAAATGG CGATTGTTCT CGGTGATTCC ATCCCGAATG ACAGCGCGCT GAACGGTAAA
AATGTCTGGC TGGGCGATAT TTCCCGGGCA GTTGCGCACC CTGAGCTGTT CCTGAGTGAA
GCCAAAGGCC ATGCGAAACC TTACACTGCG CCGGTCGCTG CGACAGCACC GGTTGCCGCC
AGCGGTCCGA AACGCGTAGT TGCGGTGACT GCTTGCCCGA CTGGCGTAGC ACACACCTTT
ATGGCGGCTG AAGCCATTGA AACCGAAGCG AAAAAACGTG GCTGGTGGGT GAAAGTTGAA
ACCCGTGGTT CTGTTGGCGC GGGTAATGCA ATCACCCCTG AAGAAGTCGC AGCAGCGGAT
CTGGTGATTG TGGCGGCAGA TATCGAAGTG GATCTGGCGA AATTTGCTGG TAAACCGATG
TATCGCACCT CTACCGGTCT GGCGCTGAAG AAAACTGCGC AGGAACTGGA TAAAGCGGTT
GCTGAAGCAA CGCCGTATGA ACCAGCGGGC AAAGCTCAAA CAGCGACCAC TGAAGGTAAG
AAAGAGAGTG CAGGCGCTTA CCGTCACTTG CTGACGGGCG TCTCTTACAT GCTGCCGATG
GTCGTTGCAG GTGGTCTGTG TATCGCGCTT TCTTTTGCTT TTGGTATCGA AGCGTTTAAA
GAGCCGGGTA CGTTGGCTGC GGCGCTGATG CAGATTGGTG GTGGTTCAGC CTTTGCGTTG
ATGGTGCCGG TACTGGCAGG TTATATTGCC TTTTCCATTG CCGATCGTCC GGGTCTCACG
CCGGGTCTGA TTGGCGGTAT GCTGGCGGTC AGCACCGGTT CTGGCTTCAT TGGCGGTATT
ATTGCGGGCT TCCTGGCTGG TTACATTGCG AAGTTAATCA GTACGCAATT GAAACTGCCA
CAGAGTATGG AGGCGCTGAA ACCGATTCTG ATCATTCCGC TAATTTCCAG TCTGGTGGTC
GGTCTGGCGA TGATCTACCT GATCGGTAAA CCGGTTGCTG GCATTCTCGA AGGGTTGACT
CACTGGCTGC AGACCATGGG GACTGCGAAT GCGGTTCTGC TGGGGGCGAT CCTCGGTGGC
ATGATGTGTA CTGACATGGG CGGTCCGGTA AACAAAGCAG CGTACGCATT CGGTGTGGGT
CTGCTGAGTA CTCAAACCTA TGGCCCGATG GCGGCGATTA TGGCGGCAGG TATGGTGCCA
CCGCTGGCAA TGGGCCTGGC AACAATGGTG GCGCGTCGCA AATTCGACAA GGCGCAGCAG
GAAGGGGGCA AAGCTGCTCT GGTACTGGGA CTGTGCTTTA TTTCGGAAGG TGCAATTCCG
TTTGCTGCCC GTGATCCGAT GCGTGTGCTG CCGTGCTGTA TCGTGGGTGG GGCGCTGACT
GGCGCAATCT CAATGGCGAT TGGTGCGAAA CTGATGGCAC CGCACGGTGG TCTGTTTGTT
CTGCTGATCC CTGGCGCGAT TACGCCGGTA TTGGGTTACC TGGTAGCCAT TATTGCCGGT
ACGCTGGTGG CGGGGTTGGC CTATGCCTTC CTGAAACGTC CGGAAGTGGA CGCAGTAGCG
AAAGCAGCGT AA
 
Protein sequence
MKTLLIIDAN LGQARAYMAK TLLGAAARKA KLEIIDNPND AEMAIVLGDS IPNDSALNGK 
NVWLGDISRA VAHPELFLSE AKGHAKPYTA PVAATAPVAA SGPKRVVAVT ACPTGVAHTF
MAAEAIETEA KKRGWWVKVE TRGSVGAGNA ITPEEVAAAD LVIVAADIEV DLAKFAGKPM
YRTSTGLALK KTAQELDKAV AEATPYEPAG KAQTATTEGK KESAGAYRHL LTGVSYMLPM
VVAGGLCIAL SFAFGIEAFK EPGTLAAALM QIGGGSAFAL MVPVLAGYIA FSIADRPGLT
PGLIGGMLAV STGSGFIGGI IAGFLAGYIA KLISTQLKLP QSMEALKPIL IIPLISSLVV
GLAMIYLIGK PVAGILEGLT HWLQTMGTAN AVLLGAILGG MMCTDMGGPV NKAAYAFGVG
LLSTQTYGPM AAIMAAGMVP PLAMGLATMV ARRKFDKAQQ EGGKAALVLG LCFISEGAIP
FAARDPMRVL PCCIVGGALT GAISMAIGAK LMAPHGGLFV LLIPGAITPV LGYLVAIIAG
TLVAGLAYAF LKRPEVDAVA KAA