Gene EcolC_1479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1479 
Symbol 
ID6067187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1631679 
End bp1632809 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content56% 
IMG OID641600899 
Productbifunctional PTS system fructose-specific transporter subunit IIA/HPr protein 
Protein accessionYP_001724469 
Protein GI170019515 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1925] Phosphotransferase system, HPr-related proteins
[COG4668] Mannitol/fructose-specific phosphotransferase system, IIA domain 
TIGRFAM ID[TIGR01003] Phosphotransferase System HPr (HPr) Family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000150472 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000341922 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTCCAGT TATCCGTACA GGACATCCAT CCGGGCGAAA AGGCCGGAGA CAAAGAAGAG 
GCGATTCGCC AGGTCGCTGC GGCGCTGGTG CATGCCGGTA ATGTAGCAGA AGGCTACGTC
AATGGCATGC TGGCGCGCGA GCAGCAAACC TCAACGTTCC TCGGCAATGG TATTGCTATT
CCACACGGCA CCACCGACAC CCGCGATCAG GTGCTGAAAA CGGGCGTTCA GGTATTTCAG
TTCCCGGAAG GCGTCACCTG GGGTGACGGT CAGGTAGCGT ACGTGGCGAT CGGTATTGCT
GCCAGCTCGG ATGAACATCT GGGCCTGCTA CGCCAGCTGA CCCACGTACT GAGCGATGAT
TCCGTTGCTG AACAACTGAA GTCAGCAACA ACAGCAGAAG AACTTCGCGC ATTGCTGATG
GGCGAAAAGC AGAGTGAGCA GCTGAAGCTC GACAACGAAA TGCTGACGCT GGATATCGTC
GCCAGCGATC TGCTGACTCT TCAGGCGCTG AACGCTGCGC GTCTGAAAGA GGCGGGGGCA
GTTGACGCCA CTTTCGTCAC CAAAGCCATC AATGAACAAC CGCTGAACCT CGGACAGGGT
ATCTGGCTGA GCGATAGCGC CGAAGGCAAT CTGCGTAGCG CGATTGCGGT AAGCCGTGCG
GCAAATGCTT TTGATGTGGA CGGCGAAACG GCAGCCATGC TGGTGAGTGT GGCGATGAAT
GACGATCAGC CCCTCGCGGT TCTTAAGCGT CTCGCTGATT TGTTGCTCGA CAATAAAGCT
GACCGCTTGC TGAAAGCGGA TGCGGCAACG TTGCTGGCGC TGCTGACCAG CGATGATGCG
CCGACCGACG ACGTGTTAAG CGCGGAGTTT GTGGTGCGCA ATGAACACGG CCTGCATGCT
CGTCCAGGTA CCATGCTGGT CAATACCATT AAACAATTTA ACAGTGATAT TACCGTGACA
AACCTTGATG GCACCGGCAA ACCGGCAAAC GGACGTAGTC TGATGAAAGT TGTGGCACTT
GGCGTTAAGA AAGGTCATCG CCTACGCTTT ACCGCCCAGG GTGCAGATGC TGAACAGGCG
CTGAAAGCAA TCGGCGACGC TATCGCTGCT GGTCTTGGGG AGGGCGCATA A
 
Protein sequence
MFQLSVQDIH PGEKAGDKEE AIRQVAAALV HAGNVAEGYV NGMLAREQQT STFLGNGIAI 
PHGTTDTRDQ VLKTGVQVFQ FPEGVTWGDG QVAYVAIGIA ASSDEHLGLL RQLTHVLSDD
SVAEQLKSAT TAEELRALLM GEKQSEQLKL DNEMLTLDIV ASDLLTLQAL NAARLKEAGA
VDATFVTKAI NEQPLNLGQG IWLSDSAEGN LRSAIAVSRA ANAFDVDGET AAMLVSVAMN
DDQPLAVLKR LADLLLDNKA DRLLKADAAT LLALLTSDDA PTDDVLSAEF VVRNEHGLHA
RPGTMLVNTI KQFNSDITVT NLDGTGKPAN GRSLMKVVAL GVKKGHRLRF TAQGADAEQA
LKAIGDAIAA GLGEGA