Gene EcE24377A_3001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_3001 
SymbolascB 
ID5590198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3001659 
End bp3003083 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content52% 
IMG OID640926649 
Productcryptic 6-phospho-beta-glucosidase 
Protein accessionYP_001464025 
Protein GI157158253 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGTAT TTCCAGAAGG TTTTTTATGG GGCGGCGCGC TTGCCGCCAA CCAGTCTGAA 
GGTGCGTTCC GTGAAGGTGG CAAAGGACTG ACCACGGTCG ATATGATCCC GCACGGCGAG
CATCGAATGG CGGTGAAACT GGGGCTGGAA AAACGTTTTC AGTTGCGCGA TGACGAGTTT
TATCCCAGCC ATGAGGCGAC GGATTTTTAT CATCGTTATA AAGAAGATAT CGCCCTGATG
GCAGAGATGG GATTCAAGGT TTTCCGTACC TCAATTGCCT GGAGCCGCCT CTTTCCGCAG
GGCGATGAAA TCACGCCCAA TCAGCAGGGC ATTGCTTTTT ACCGTTCTGT CTTTGAAGAG
TGTAAAAAGT ACGGTATCGA ACCGCTGGTC ACGTTGTGCC ACTTCGATGT GCCGATGCAT
CTGGTCACCG AATATGGCTC CTGGCGTAAC CGCAAGCTGG TGGAGTTTTT CAGCCGCTAC
GCCAGAACCT GCTTTGAAGC ATTTGATGGT CTGGTGAAAT ACTGGCTAAC CTTCAATGAA
ATCAACATTA TGTTGCATAG CCCGTTCTCC GGCGCGGGTC TGGTGTTTGA AGAAGGTGAA
AATCAGGATC AGGTGAAATA TCAGGCCGCG CATCACCAGC TGGTTGCCAG TGCGCTAGCC
ACCAAAATCG CCCATGAGGT TAACCCGCAA AATCAGGTGG GCTGTATGCT GGCGGGCGGT
AACTTCTACC CTTACAGCTG CAAACCGGAA GATGTCTGGG CGGCGCTGGA GAAAGATCGG
GAAAACCTGT TTTTTATCGA TGTGCAGGCG CGCGGCGCGT ATCCGGCTTA CTCTGCCCGC
GTATTTCGCG AAAAAGGGGT AACCATCAAC AAAGCACCGG GCGATGATGA AATCCTGAAA
AACACCGTCG ATTTTGTCTC TTTCAGCTAT TACGCCTCGC GCTGCGCCTC GGCGGAGATG
AACGCCAACA ACAGCAGTGC GGCGAACGTG GTGAAATCGC TGCGTAATCC GTATCTACAG
GTGAGCGACT GGGGCTGGGG AATTGATCCA CTCGGTCTGC GTATCACCAT GAATATGATG
TACGATCGTT ATCAGAAGCC GCTGTTTCTG GTGGAAAACG GCCTCGGCGC AAAAGATGAA
CTTGCTGCCA ATGGCGAGAT TAACGATGAC TATCGCATCA GCTATTTACG CGAACATATC
CGCGCAATGG GCGAAGCGAT TGCAGATGGC ATTCCGCTGA TGGGCTACAC CACCTGGGGC
TGTATTGATT TAGTTTCCGC CTCTACGGGT GAAATGAGCA AACGCTACGG TTTTGTCTAC
GTAGACCGTG ACGACGCAGG CAACGGCACG CTGACGCGCA CGCGTAAGAA ATCGTTCTGG
TGGTATAAAA AAGTAATTGC CAGTAATGGG GAAGATTTAG AGTAA
 
Protein sequence
MSVFPEGFLW GGALAANQSE GAFREGGKGL TTVDMIPHGE HRMAVKLGLE KRFQLRDDEF 
YPSHEATDFY HRYKEDIALM AEMGFKVFRT SIAWSRLFPQ GDEITPNQQG IAFYRSVFEE
CKKYGIEPLV TLCHFDVPMH LVTEYGSWRN RKLVEFFSRY ARTCFEAFDG LVKYWLTFNE
INIMLHSPFS GAGLVFEEGE NQDQVKYQAA HHQLVASALA TKIAHEVNPQ NQVGCMLAGG
NFYPYSCKPE DVWAALEKDR ENLFFIDVQA RGAYPAYSAR VFREKGVTIN KAPGDDEILK
NTVDFVSFSY YASRCASAEM NANNSSAANV VKSLRNPYLQ VSDWGWGIDP LGLRITMNMM
YDRYQKPLFL VENGLGAKDE LAANGEINDD YRISYLREHI RAMGEAIADG IPLMGYTTWG
CIDLVSASTG EMSKRYGFVY VDRDDAGNGT LTRTRKKSFW WYKKVIASNG EDLE