Gene EcHS_A2304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2304 
SymbolfruA 
ID5591230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2301897 
End bp2303588 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content56% 
IMG OID640921430 
ProductPTS system fructose-specific transporter subunits IIBC 
Protein accessionYP_001458966 
Protein GI157161648 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1299] Phosphotransferase system, fructose-specific IIC component
[COG1445] Phosphotransferase system fructose-specific component IIB 
TIGRFAM ID[TIGR00829] PTS system, fructose-specific, IIB component
[TIGR01427] PTS system, fructose subfamily, IIC component 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.000200462 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACGC TGCTGATTAT TGACGCTAAT CTCGGTCAGG CACGCGCCTA TATGGCGAAG 
ACCCTGCTGG GCGCGGCGGC GCGAAAAGCA AAACTGGAAA TCATCGACAA TCCGAACGAC
GCTGAAATGG CGATTGTTCT CGGTGATTCC ATCCCGAATG ACAGCGCGCT GAACGGTAAA
AATGTCTGGC TGGGTGATAT TTCCCGGGCA GTTGCGCACC CTGAGCTGTT CCTGAGTGAA
GCCAAAGGCC ATGCGAAACC TTACACTGCG CTGGTCACTG CGACAGCACC GGTTGCCGCC
AGCGGTCCGA AACGCGTAGT TGCGGTGACT GCTTGCCCGA CTGGCGTAGC ACACACCTTT
ATGGCGGCTG AAGCCATTGA AACCGAAGCG AAAAAACGTG GCTGGTGGGT GAAAGTTGAA
ACCCGTGGTT CTGTTGGCGC GGGTAATGCA ATCACTCCCG AGGAAGTAGC CGCAGCGGAT
CTGGTGATTG TGGCGGCAGA TATCGAAGTG GATCTGGCGA AATTTGCTGG TAAACCGATG
TATCGTACCT CTACCGGTCT GGCGCTGAAG AAAACCGCGC AGGAACTGGA TAAAGCGGTT
GCTGAAGCAA CGCCGTATGA ACCGGCGGGC AAAGCTCAAA CGGCGACCAC TGAAGGTAAG
AAAGAGAGTG CAGGCGCTTA TCGTCACTTG CTAACGGGCG TCTCTTACAT GCTGCCGATG
GTCGTTGCTG GTGGTCTGTG TATCGCGCTT TCTTTTGCTT TTGGTATCGA AGCGTTTAAA
GAGCCGGGTA CGTTGGCAGC GGCGCTGATG CAGATTGGTG GTGGTTCAGC CTTTGCGCTG
ATGGTGCCGG TACTGGCAGG TTATATTGCC TTTTCCATTG CCGATCGTCC GGGCCTCACG
CCGGGTCTGA TTGGCGGTAT GCTGGCGGTC AGCACCGGTT CTGGCTTCAT TGGCGGTATT
ATTGCGGGCT TCCTGGCTGG TTACATTGCG AAGTTAATCA GTACGCAACT GAAACTGCCA
CAGAGTATGG AGGCGCTGAA ACCAATCCTG ATCATTCCGC TAATTTCCAG TCTGGTGGTC
GGTCTGGCGA TGATCTACCT GATCGGTAAA CCGGTTGCTG GCATTCTCGA AGGGCTGACT
CACTGGCTGC AGACCATGGG GACTGCGAAT GCGGTTCTGC TGGGGGCGAT CCTCGGTGGC
ATGATGTGTA CTGACATGGG CGGTCCGGTA AACAAAGCAG CGTACGCATT CGGTGTGGGT
CTGCTGAGTA CTCAAACCTA TGGCCCGATG GCGGCGATTA TGGCGGCAGG TATGGTGCCA
CCGCTGGCAA TGGGTCTGGC AACAATGGTG GCGCGTCGCA AATTCGACAA AGCGCAGCAG
GAAGGTGGCA AAGCCGCTCT GGTATTGGGA CTGTGCTTCA TTTCGGAAGG TGCAATTCCG
TTTGCTGCTC GTGATCCGAT GCGTGTGCTG CCGTGCTGTA TCGTGGGTGG TGCGCTGACT
GGCGCAATCT CAATGGCGAT TGGTGCGAAA CTGATGGCAC CACACGGTGG TCTGTTTGTT
CTGCTGATCC CTGGCGCTAT TACGCCGGTA CTGGGTTACC TGGTAGCAAT TATTGCCGGT
ACGCTGGTGG CGGGTTTGGC CTATGCCTTC CTGAAACGTC CGGAAGTGGA CGCAGTAGCG
AAAGCAGCGT AA
 
Protein sequence
MKTLLIIDAN LGQARAYMAK TLLGAAARKA KLEIIDNPND AEMAIVLGDS IPNDSALNGK 
NVWLGDISRA VAHPELFLSE AKGHAKPYTA LVTATAPVAA SGPKRVVAVT ACPTGVAHTF
MAAEAIETEA KKRGWWVKVE TRGSVGAGNA ITPEEVAAAD LVIVAADIEV DLAKFAGKPM
YRTSTGLALK KTAQELDKAV AEATPYEPAG KAQTATTEGK KESAGAYRHL LTGVSYMLPM
VVAGGLCIAL SFAFGIEAFK EPGTLAAALM QIGGGSAFAL MVPVLAGYIA FSIADRPGLT
PGLIGGMLAV STGSGFIGGI IAGFLAGYIA KLISTQLKLP QSMEALKPIL IIPLISSLVV
GLAMIYLIGK PVAGILEGLT HWLQTMGTAN AVLLGAILGG MMCTDMGGPV NKAAYAFGVG
LLSTQTYGPM AAIMAAGMVP PLAMGLATMV ARRKFDKAQQ EGGKAALVLG LCFISEGAIP
FAARDPMRVL PCCIVGGALT GAISMAIGAK LMAPHGGLFV LLIPGAITPV LGYLVAIIAG
TLVAGLAYAF LKRPEVDAVA KAA