Gene EcSMS35_1637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1637 
SymbolcelA 
ID6144754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1627429 
End bp1628868 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content45% 
IMG OID641616513 
Product6-phospho-beta-glucosidase 
Protein accessionYP_001743691 
Protein GI170683087 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGGAT TTAAAAAAGG TTTTTTATGG GGTGGCGCGG TAGCCGCGCA TCAGTTGGAA 
GGTGGCTGGA ATGAAGGAGG AAAAGGCATC AGTATCGCTG ATGTGATGAC TGCTGGCGCT
CACGGGGTGC CGCGTGAAGT GACAGAAGGC ATTATCGACG GGCTTAATTA TCCCAATCAT
GAAGCAATTG ATTTTTATCA TCGCTATAAA ACAGATATTC AGTTATTTGC CGAGATGGGA
TTCAAATGCT TTCGAACTTC CATTGCCTGG ACACGAATCT TTCCGCAAGG TGACGAACAG
GAGCCGAATG AAGAGGGTTT ACAATTTTAT GATGATCTGT TCGATGAATG CCTGAAGCAG
GGAATGGAGC CTGTGGTGAC GCTTTCGCAT TTTGAGATGC CTTATCATCT GGTGACAAAA
TATGGTGGCT GGCGACACCG TAAACTGATC GACTTTTTCA TCCGCTTCGC ATCAACGGTC
TTCACGCGCT ATAAAGAAAA AGTAAAGTAC TGGATGACGT TTAACGAAAT CAATAATCAG
GTGAATTTCA GCGAAAGCCT GTGTCCATTT ACTAATTCCG GTATCTTGTA TTCGCCAGAG
GAAGATATCA ATGAGCGCGA ACAAATAATG TACCAGGCGG TACATTACGA GTTAGTTGCC
AGTGCCCTGG CGGTACAGAC TGGAAAATCG ATCAATCCTG AATTTAATAT CGGCTGTATG
ATCGCCATGT GCCCCATCTA TCCTCTGACG TGTGCACCCA ACGATATGAT GATGGCCACG
AAAGCGATGC ATCGTCGTTA CTGGTTTACT GATGTTCATG CGCGTGGATA TTATCCGCAA
CATATGCTGA ATTACTTTGC CAGGAAAGGA TTCAACCTCG ATATCACACC AGAAGATAAC
ACGATTCTTG CCAGAGGTTG TGTCGACTTT ATCGGTTTTA GCTACTACAT GTCTTTTACT
ACGCAATTTT CGCCAGATAA CCCGCAACTG AATTATGTTG AACCACGAGA TTTGGTCAGC
AACCCTTATA TCGATACATC CGAATGGGGA TGGCAAATTG ATCCGGCAGG GCTACGTTAT
TCACTCAACT GGTTCTGGGA TCATTTCCAG TTGCCGCTGT TTATTGTCGA AAATGGATTG
GGTGCGGTTG ACCAGAGACA AGCTGACGGC ACGGTGAACG ATCACTATCG CATTGAGTAC
TTTGCTTCCC ATATTCGGGA AATGAAAAAA GCCGTTGTTG AAGATGGTGT TGACTTAATT
GGCTACACAC CGTGGGGCTG CATTGACCTG GTTTCTGCCG GAACAGGGGA AATGAAAAAA
CGCTACGGAA TGATTTATGT CGATAAAGAC AACGAAGGGA AGGGAACGCT GGAAAGGATA
CGTAAAGCGT CGTTTTACTG GTATCGGGAT CTCATCGCCA ACAATGGCGA AAATATTTGA
 
Protein sequence
MSGFKKGFLW GGAVAAHQLE GGWNEGGKGI SIADVMTAGA HGVPREVTEG IIDGLNYPNH 
EAIDFYHRYK TDIQLFAEMG FKCFRTSIAW TRIFPQGDEQ EPNEEGLQFY DDLFDECLKQ
GMEPVVTLSH FEMPYHLVTK YGGWRHRKLI DFFIRFASTV FTRYKEKVKY WMTFNEINNQ
VNFSESLCPF TNSGILYSPE EDINEREQIM YQAVHYELVA SALAVQTGKS INPEFNIGCM
IAMCPIYPLT CAPNDMMMAT KAMHRRYWFT DVHARGYYPQ HMLNYFARKG FNLDITPEDN
TILARGCVDF IGFSYYMSFT TQFSPDNPQL NYVEPRDLVS NPYIDTSEWG WQIDPAGLRY
SLNWFWDHFQ LPLFIVENGL GAVDQRQADG TVNDHYRIEY FASHIREMKK AVVEDGVDLI
GYTPWGCIDL VSAGTGEMKK RYGMIYVDKD NEGKGTLERI RKASFYWYRD LIANNGENI