Gene EcHS_A3894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3894 
Symbol 
ID5595084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3889158 
End bp3891035 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content51% 
IMG OID640923002 
ProductPTS system, alpha-glucoside-specific IIBC component 
Protein accessionYP_001460479 
Protein GI157163161 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific
[COG1264] Phosphotransferase system IIB components 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR00852] PTS system, maltose and glucose-specific subfamily, IIC component
[TIGR02005] PTS system, alpha-glucoside-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones66 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCCAC AAGTTCTGCT TAATCGATTG AAAAAACATC TTTTTTTTAA AGATGTGTTC 
GATGCTGTGT ACCAGTGCTC ACAGATGTCT ACTTTTTCGC GAAAACGTAG ATCTCTACCG
CCCAACGAAA AGCATGAAAG CGATCACGAA TCCCATTGGG TCGTCATGTT CTGCCGATCG
CATCTTCCTA TCCTCGCTCC AGGCCTGCCG CATAACCAAT CAGGCTTCCT ACTTACAGAA
TTGAGAAAAG AGGATGTGGA AATGCTCAGT CAAATTCAAC GCTTTGGCGG CGCGATGTTC
ACGCCAGTGC TGCTGTTTCC CTTCGCCGGG ATTGTGGTGG GTCTTGCCAT CTTGCTGCAA
AACCCGATGT TTGTCGGGGA ATCACTGACC GATCCGAACA GTTTATTCGC GCAAATCGTA
CACATTATTG AAGAGGGCGG TTGGACGGTA TTCCGTAATA TGCCGCTGAT TTTTGCTGTC
GGTTTACCCA TTGGCCTTGC TAAGCAAGCG CAGGGGCGTG CTTGTCTGGC GGTGATGGTG
AGTTTCCTGA CCTGGAACTA TTTCATCAAC GCGATGGGAA TGACCTGGGG AAGCTACTTC
GGCGTCGATT TCACTCAGGA CGCGGTGGCA GGTAGCGGTC TGACAATGAT GGCCGGGATT
AAAACCCTCG ATACCAGCAT TATCGGCGCA ATTATCATTT CCGGCATTGT GACGGCGCTG
CATAACCGTC TGTTCGATAA AAAACTGCCG GTTTTTCTCG GCATTTTCCA GGGGACGTCT
TATGTGGTGA TTATCGCCTT CCTGGTGATG ATCCCCTGTG CCTGGCTGAC GTTGCTCGGC
TGGCCAAAAG TACAAATGGG GATTGAATCT CTGCAAGCGT TCCTGCGTTC GGCGGGTGCA
CTTGGGGTCT GGGTTTACAC CTTCCTCGAA CGTATTCTGA TCCCAACCGG TTTACACCAC
TTCATCTACG GACAGTTTAT CTTTGGTCCG GCAGCTGTTG AAGGCGGCAT TCAGATGTAC
TGGGCGCAGC ATCTGCAAGA GTTCAGTTTG AGCGCCGAGC CGCTGAAATC GTTGTTCCCG
GAAGGCGGTT TTGCCCTGCA CGGTAACTCA AAAATCTTTG GTGCCGTGGG CATTTCTTTA
GCGATGTACT TCACTGCCGC ACCGGAAAAT CGGGTAAAAG TGGCGGGCTT GCTGATTCCC
GCAACCTTAA CCGCCATGCT GGTGGGAATT ACCGAACCGC TGGAATTTAC CTTCCTGTTC
ATTTCACCGT TGCTGTTTGC GGTACACGCC GTGCTGGCGG CCTCAATGTC GACCGTAATG
TATCTCTTTG GTGTGGTGGG CAACATGGGC GGAGGTCTGA TTGACCAGGT TTTACCGCAA
AACTGGATCC CGATGTTCAG CAACCACGCG GATATGATGC TGACCCAAAT CGCCATTGGG
TTGTGCTTTA CCCTGCTGTA CTTCGTGGTT TTCCGCACAC TGATTCTGCA GTTCAACATG
TGCACGCCGG GACGTGAAGA TGCGGAAGTG AAACTCTACT CAAAAGCCGA ATACAAAGCC
TCGCGAGGCC AAACCACCGC GGCAGAGCCA AAAAAAGAGC TGGATCAGGC TGCCGGTATC
CTGCAAGCCC TGGGCGGGGT CGGCAATATC TCCAGCATTA ACAATTGCGC GACGCGTTTA
CGTATTGCAC TGCATGACAT GTCACAAACG CTGGATGACG AAGTCTTTAA AAAGCTGGGA
GCGCACGGCG TCTTCCGTAG TGGCGATGCC ATTCAGGTGA TCATTGGTCT GCATGTATCC
CAGCTGCGTG AACAGCTCGA TAGCTTAATT AATTCTCATC AATCAGCAGA AAATGTTGCC
ATTACGGAGG CAGTATAA
 
Protein sequence
MRPQVLLNRL KKHLFFKDVF DAVYQCSQMS TFSRKRRSLP PNEKHESDHE SHWVVMFCRS 
HLPILAPGLP HNQSGFLLTE LRKEDVEMLS QIQRFGGAMF TPVLLFPFAG IVVGLAILLQ
NPMFVGESLT DPNSLFAQIV HIIEEGGWTV FRNMPLIFAV GLPIGLAKQA QGRACLAVMV
SFLTWNYFIN AMGMTWGSYF GVDFTQDAVA GSGLTMMAGI KTLDTSIIGA IIISGIVTAL
HNRLFDKKLP VFLGIFQGTS YVVIIAFLVM IPCAWLTLLG WPKVQMGIES LQAFLRSAGA
LGVWVYTFLE RILIPTGLHH FIYGQFIFGP AAVEGGIQMY WAQHLQEFSL SAEPLKSLFP
EGGFALHGNS KIFGAVGISL AMYFTAAPEN RVKVAGLLIP ATLTAMLVGI TEPLEFTFLF
ISPLLFAVHA VLAASMSTVM YLFGVVGNMG GGLIDQVLPQ NWIPMFSNHA DMMLTQIAIG
LCFTLLYFVV FRTLILQFNM CTPGREDAEV KLYSKAEYKA SRGQTTAAEP KKELDQAAGI
LQALGGVGNI SSINNCATRL RIALHDMSQT LDDEVFKKLG AHGVFRSGDA IQVIIGLHVS
QLREQLDSLI NSHQSAENVA ITEAV