Gene EcHS_A4494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4494 
SymboltreB 
ID5593313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4499974 
End bp4501392 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content55% 
IMG OID640923592 
ProductPTS system trehalose(maltose)-specific transporter subunits IIBC 
Protein accessionYP_001461033 
Protein GI157163715 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR00852] PTS system, maltose and glucose-specific subfamily, IIC component
[TIGR01992] PTS system, trehalose-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAAA TAAACCAAAC GGATATCGAT CGGTTGATTG AACTGGTCGG CGGGCGCGGC 
AATATTGCGA CGGTGAGCCA CTGTATTACT CGCCTGCGCT TTGTCCTCAA CCAACCGGCC
AATGCCAGAC CGAAAGAAAT TGAGCAACTC CCCATGGTGA AAGGCTGTTT CACCAATGCC
GGGCAATTTC AGGTGGTGAT TGGCACCAAC GTGGGTGATT ACTATCAAGC ACTAATAGCG
TCAACCGGAC AGGCGCAGGT TGATAAAGAG CAGGTAAAAA AAGCCGCCCG GCAGAATATG
AAATGGCATG AGCAGTTGAT CTCTCATTTC GCGGAGATCT TCTTCCCGTT GCTGCCCGCG
TTGATTAGCG GCGGTTTGAT CCTCGGTTTT CGCAATGTGA TCGGCGATTT GCCCATGAGC
AACGGTCAAA CGCTGGCGCA AATGTACCCT TCCCTGCAAA CGATCTACGA TTTTCTGTGG
TTGATCGGTG AAGCGATCTT CTTCTACCTG CCGGTCGGGA TTTGCTGGTC AGCGGTGAAA
AAAATGGGCG GCACGCCGAT CCTTGGTATC GTGCTTGGCG TGACACTGGT TTCCCCCCAG
CTGATGAACG CTTATCTGCT CGGGCAGCAG CTGCCGGAAG TGTGGGACTT TGGCATGTTC
AGCATCGCCA AAGTGGGCTA TCAGGCGCAG GTGATCCCGG CACTGTTAGC CGGGCTGGCG
CTGGGCGTTA TTGAAACTCG CCTTAAACGC ATCGTACCGG ATTACCTCTA TCTGGTGGTG
GTGCCCGTCT GTTCGCTGAT CCTCGCGGTG TTCCTCGCCC ATGCGCTGAT TGGTCCGTTT
GGTCGCATGA TTGGCGATGG CGTTGCCTTT GCGGTACGTC ACCTGATGAC CGGCAGCTTT
GCTCCGATTG GTGCGGCATT GTTTGGCTTC CTGTACGCGC CGCTGGTGAT CACCGGCGTA
CACCAGACCA CCCTTGCTAT TGATTTGCAG ATGATTCAAA GCATGGGTGG CACGCCAGTG
TGGCCGCTGA TTGCGCTGTC GAATATCGCT CAGGGCTCCG CCGTGATAGG CATTATCATT
TCCAGCCGCA AGCACAATGA ACGCGAGATC TCCGTGCCTG CCGCTATCTC CGCCTGGCTT
GGGGTCACTG AGCCTGCAAT GTACGGCATC AACCTGAAAT ATCGCTTCCC GATGCTGTGC
GCGATGATTG GTTCTGGTCT GGCAGGATTA CTATGCGGCC TGAACGGCGT TATGGCGAAT
GGTATCGGCG TAGGCGGCCT GCCGGGAATT CTCTCGATTC AACCGAGCTA CTGGCAGGTA
TTTGCGCTGG CAATGGTTAT CGCCATCATC ATCCCGATTG TACTCACCTC GTTTATCTAT
CAGCGGAAAT ACCGCCTGGG CACGCTGGAT ATTGTTTAA
 
Protein sequence
MSKINQTDID RLIELVGGRG NIATVSHCIT RLRFVLNQPA NARPKEIEQL PMVKGCFTNA 
GQFQVVIGTN VGDYYQALIA STGQAQVDKE QVKKAARQNM KWHEQLISHF AEIFFPLLPA
LISGGLILGF RNVIGDLPMS NGQTLAQMYP SLQTIYDFLW LIGEAIFFYL PVGICWSAVK
KMGGTPILGI VLGVTLVSPQ LMNAYLLGQQ LPEVWDFGMF SIAKVGYQAQ VIPALLAGLA
LGVIETRLKR IVPDYLYLVV VPVCSLILAV FLAHALIGPF GRMIGDGVAF AVRHLMTGSF
APIGAALFGF LYAPLVITGV HQTTLAIDLQ MIQSMGGTPV WPLIALSNIA QGSAVIGIII
SSRKHNEREI SVPAAISAWL GVTEPAMYGI NLKYRFPMLC AMIGSGLAGL LCGLNGVMAN
GIGVGGLPGI LSIQPSYWQV FALAMVIAII IPIVLTSFIY QRKYRLGTLD IV