Gene EcHS_A4183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4183 
Symbol 
ID5594452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4172442 
End bp4173521 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content54% 
IMG OID640923285 
Productputative fructose-like permease EIIC subunit 2 
Protein accessionYP_001460744 
Protein GI157163426 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1299] Phosphotransferase system, fructose-specific IIC component 
TIGRFAM ID[TIGR01427] PTS system, fructose subfamily, IIC component 


Plasmid Coverage information

Num covering plasmid clones72 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAGT TGGTGCAGAT CCTGAAAAAT ACCCGTCAGC ATTTAATGAC GGGCGTTTCA 
CACATGATTC CCTTCGTGGT ATCGGGCGGT ATTTTGCTGG CGGTTTCCGT CATGTTGTAT
GGCAAAGGCG CAGTGCCGGA TGCCGTAGCC GATCCGAATC TGAAAAAACT GTTTGATATC
GGCGTTGCGG GTTTGACGCT GATGGTGCCT TTCCTCGCCG CTTACATTGG TTACTCCATT
GCAGAGCGTT CTGCGCTGGC TCCGTGCGCT ATCGGGGCCT GGGTTGGTAA CAGCTTTGGT
GCGGGCTTCT TTGGTGCGCT GATCGCCGGG ATTATCGGCG GCATCGTGGT GCATTACCTG
AAGAAAATTC CGGTGCATAA AGTTCTGCGC TCGGTGATGC CTATCTTCAT CATTCCTATC
GTCGGCACAC TGATTACCGC AGGCATCATG ATGTGGGGCT TGGGCGAGCC TGTAGGGGCG
TTGACCAACA GCCTGACTCA GTGGCTTCAG GGGATGCAGC AGGGCAGCAT TGTTATGCTG
GCGGTGATCA TGGGTCTGAT GCTGGCGTTC GATATGGGCG GTCCGGTTAA CAAAGTGGCC
TATGCCTTCA TGCTGATTTG CGTTGCTCAG GGTGTTTATA CCGTGGTGGC TATTGCCGCT
GTTGGGATTT GTGTTCCACC GCTGGGGATG GGGCTGGCGA CGCTGATTGG TCGTAAAAAT
TTCTCCGCAG AAGAGCGCGA AACTGGTAAA GCGGCGCTGG TGATGGGGTG CGTTGGGGTT
ACTGAAGGGG CGATTCCTTT CGCCGCTGCC GATCCGCTGC GTGTCATTCC TTCCATCATG
GTTGGTTCTG TTTGTGGTGC GGTAACTGCG GCGCTGGTCG GTGCGCAGTG CTATGCAGGC
TGGGGTGGTC TGATTGTACT GCCGGTGGTT GAAGGCAAGC TGGGTTATAT CGCCGCAGTG
GCTGTCGGAG CAGTGGTGAC GGCTGTTTGT GTGAACGTGC TGAAAAGTCT GGCGCGTAAA
AATGGGTCTT CGACTGATGA AAAAGAAGAC GACCTGGATT TGGATTTTGA AATTAATTAA
 
Protein sequence
MNELVQILKN TRQHLMTGVS HMIPFVVSGG ILLAVSVMLY GKGAVPDAVA DPNLKKLFDI 
GVAGLTLMVP FLAAYIGYSI AERSALAPCA IGAWVGNSFG AGFFGALIAG IIGGIVVHYL
KKIPVHKVLR SVMPIFIIPI VGTLITAGIM MWGLGEPVGA LTNSLTQWLQ GMQQGSIVML
AVIMGLMLAF DMGGPVNKVA YAFMLICVAQ GVYTVVAIAA VGICVPPLGM GLATLIGRKN
FSAEERETGK AALVMGCVGV TEGAIPFAAA DPLRVIPSIM VGSVCGAVTA ALVGAQCYAG
WGGLIVLPVV EGKLGYIAAV AVGAVVTAVC VNVLKSLARK NGSSTDEKED DLDLDFEIN