Gene EcSMS35_2840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2840 
SymbolascB 
ID6146316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2916156 
End bp2917580 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content53% 
IMG OID641617709 
Productcryptic 6-phospho-beta-glucosidase 
Protein accessionYP_001744864 
Protein GI170680422 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.106134 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGTAT TTCCAGAAGG TTTTTTATGG GGCGGCGCGC TTGCCGCCAA CCAGTCTGAA 
GGTGCGTTCC GTGAAGGTGG CAAAGGGCTG ACCACCGTCG ATATGATCCC ACACGGCGAG
CATCGAATGG CGGTGAAACT GGGGCTGGAA AAACGTTTTC AGTTGCGCGA TGACGAGTTT
TATCCCAGCC ATGAGGCGAC GGATTTTTAT CATCGTTATA AAGAAGATAT CGCCCTGATG
GCAGAGATGG GATTCAAAGT GTTCCGTACC TCAATTGCCT GGAGCCGCCT CTTCCCGCAG
GGCGATGAAC TGACGCCCAA CCAGCAGGGC ATTGCTTTTT ATCGCGCGGT ATTTGAAGAG
TGTAAAAAGT ACGGTATCGA ACCGCTGGTC ACGTTGTGCC ACTTCGATGT GCCGATGCAT
CTGGTCACCG AATATGGCTC CTGGCGTAAC CGCAAGCTGG TGGAGTTTTT CAGCCGCTAC
GCCCGGACCT GCTTTGAAGC ATTTGATGGT CTGGTGAAAT ACTGGCTTAC CTTCAATGAA
ATCAACATTA TGTTGCATAG CCCGTTCTCC GGCGCGGGTC TGGTGTTTGA AGAAGGGGAA
AATCAGGATC AGGTGAAATA TCAGGCCGCG CATCACCAGC TGGTTGCCAG TGCGCTAGCT
ACCAAAATCG CCCATGAGGT TAACCCGCAA AATCAGGTGG GCTGTATGCT GGCGGGCGGC
AACTTCTACC CTTACAGTTG CAAGCCGGAG GATGTCTGGG CGGCGTTGGA GAAAGACCGG
GAAAACCTGT TTTTTATCGA TGTGCAGGCG CGGGGCGCGT ATCCGGCTTA CTCTGCCCGC
GTATTCCGCG AAAAAGGGGT AACCATCAAC AAAGCACCGG GCGATGATGA AATCCTGAAA
AACACCGTCG ATTTTGTCTC ATTCAGCTAT TACGCCTCGC GCTGCGCCTC GGCGGAGATG
AACGCCAACA ACAGCAGTGC GGCGAATGTA GTGAAATCGC TGCGTAACCC GTATCTACAG
GTAAGCGACT GGGGCTGGGG TATTGATCCA CTCGGTCTGC GTATCACCAT GAACATGATG
TACGACCGCT ATCAGAAGCC GCTGTTTCTG GTGGAAAACG GCCTCGGCGC AAAAGATGAA
TTTGCTGCCA ACGGTGAGAT TAACGATGAC TATCGCATCA GCTATTTACG CGAACATATC
CGCGCAATGG GCGAAGCGAT TGCAGATGGC ATTCCGCTGA TGGGCTACAC CACCTGGGGC
TGTATTGATT TGGTTTCCGC CTCAACGGGT GAAATGAGCA AACGCTACGG TTTTGTCTAT
GTAGACCGTG ACGATGCAGG CAACGGCACG CTGGCGCGCA CGCGTAAGAA ATCGTTCTGG
TGGTATAAAA AAGTGATTGC CAGTAATGGT GAGGATTTAG AGTAA
 
Protein sequence
MSVFPEGFLW GGALAANQSE GAFREGGKGL TTVDMIPHGE HRMAVKLGLE KRFQLRDDEF 
YPSHEATDFY HRYKEDIALM AEMGFKVFRT SIAWSRLFPQ GDELTPNQQG IAFYRAVFEE
CKKYGIEPLV TLCHFDVPMH LVTEYGSWRN RKLVEFFSRY ARTCFEAFDG LVKYWLTFNE
INIMLHSPFS GAGLVFEEGE NQDQVKYQAA HHQLVASALA TKIAHEVNPQ NQVGCMLAGG
NFYPYSCKPE DVWAALEKDR ENLFFIDVQA RGAYPAYSAR VFREKGVTIN KAPGDDEILK
NTVDFVSFSY YASRCASAEM NANNSSAANV VKSLRNPYLQ VSDWGWGIDP LGLRITMNMM
YDRYQKPLFL VENGLGAKDE FAANGEINDD YRISYLREHI RAMGEAIADG IPLMGYTTWG
CIDLVSASTG EMSKRYGFVY VDRDDAGNGT LARTRKKSFW WYKKVIASNG EDLE