Gene EcSMS35_2316 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2316 
SymbolfruB 
ID6145993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2348023 
End bp2349153 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content56% 
IMG OID641617190 
Productbifunctional PTS system fructose-specific transporter subunit IIA/HPr protein 
Protein accessionYP_001744363 
Protein GI170683388 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1925] Phosphotransferase system, HPr-related proteins
[COG4668] Mannitol/fructose-specific phosphotransferase system, IIA domain 
TIGRFAM ID[TIGR01003] Phosphotransferase System HPr (HPr) Family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000446962 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.000655033 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTCCAGT TATCCGTACA GGACATCCAT CCGGGCGAAA AGGCCGGAGA CAAAGAAGAG 
GCGATTCGCC AGGTCGCTGC GGCGCTGGTG CAGGCCGGTA ATGTAGCTGA AGGCTACGTC
AATGGCATGC TGGCGCGCGA ACAGCAAACC TCTACGTTCC TCGGCAATGG TATTGCTATT
CCACACGGCA CTACCGACAC CCGCGATCAG GTGCTGAAAA CCGGCGTTCA GGTATTTCAG
TTCCCGGAAG GCGTCACCTG GGGTGACGGC CAGGTAGCGT ACGTGGCGAT CGGTATTGCT
GCCAGCTCGG ATGAACATCT GGGTCTGTTA CGCCAGCTGA CCCACGTACT AAGCGATGAT
TCCGTTGCTG AACAGCTGAA GTCAGCAACG ACAGCAGAAG AACTTCGCGC ATTGCTGATG
GGCGAAAAGC AGAGTGAGCA GCTGAAGCTC GATAACGAAA TGCTGACGCT GGATATTGTC
GCCAGCGATC TGCTGACGCT TCAGGCGCTT AACGCTGCGC GTCTGAAAGA AGCAGGGGCA
GTTGACGCCA CCTTCGTCAC TAAAGCCATC AATGAACAAC CGCTGAACCT CGGACAGGGC
ATCTGGCTGA GCGATAGCGC TGAAGGCAAT CTGCGTAGCG CGATTGCGGT AAGCCGTGCG
GCAAATGCTT TTGATGTGGA CGGCGAAACG GCAGCCATGC TGGTGAGTGT GGCGATGAAT
GACGATCAGC CCATTGCGGT TCTTAAGCGT CTCGCTGATT TGTTGCTCGA TAATAAAGCT
GACCGCTTGC TGAAAGCGGA TGCGGCAACG TTGCTGGCGC TGCTGACCAG CGATGATGCG
CCGACCGACG ACGTGTTAAG CGCGGAGTTT GTGGTGCGCA ATGAACACGG CCTGCATGCT
CGTCCAGGTA CCATGCTGGT CAATACCATT AAACAATTTA ACAGTGATAT TACCGTGACA
AACCTTGATG GCACCGGCAA ACCGGCAAAC GGACGTAGTC TGATGAAAGT TGTGGCACTT
GGCGTTAAGA AAGGTCATCG CCTACGCTTT ACCGCCCAGG GTGCAGATGC TGAACAGGCG
CTGAAAGCCA TTGGCGACGC TATCGCTGCT GGTCTTGGGG AGGGCGCATA A
 
Protein sequence
MFQLSVQDIH PGEKAGDKEE AIRQVAAALV QAGNVAEGYV NGMLAREQQT STFLGNGIAI 
PHGTTDTRDQ VLKTGVQVFQ FPEGVTWGDG QVAYVAIGIA ASSDEHLGLL RQLTHVLSDD
SVAEQLKSAT TAEELRALLM GEKQSEQLKL DNEMLTLDIV ASDLLTLQAL NAARLKEAGA
VDATFVTKAI NEQPLNLGQG IWLSDSAEGN LRSAIAVSRA ANAFDVDGET AAMLVSVAMN
DDQPIAVLKR LADLLLDNKA DRLLKADAAT LLALLTSDDA PTDDVLSAEF VVRNEHGLHA
RPGTMLVNTI KQFNSDITVT NLDGTGKPAN GRSLMKVVAL GVKKGHRLRF TAQGADAEQA
LKAIGDAIAA GLGEGA