Gene EcolC_0996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0996 
Symbol 
ID6067726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1081648 
End bp1083072 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content51% 
IMG OID641600404 
Productcryptic 6-phospho-beta-glucosidase 
Protein accessionYP_001723992 
Protein GI170019038 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0301517 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGTAT TTCCAGAAAG TTTTTTATGG GGCGGCGCGC TTGCCGCCAA CCAGTCTGAA 
GGTGCGTTCC GTGAAGGTGA CAAAGGTCTG ACCACTGTCG ATATGATCCC ACACGGCGAG
CATCGAATGG CGGTGAAACT GGGGCTGGAA AAACGTTTTC AGTTGCGAGA TGACGAGTTT
TATCCCAGCC ATGAGGCGAC GGATTTTTAT CATCGTTATA AAGAAGATAT CGCCCTGATG
GCAGAGATGG GATTCAAGGT TTTCCGTACC TCAATTGCCT GGAGCCGTCT CTTTCCGCAG
GGCGATGAAA TCACGCCCAA TCAGCAGGGC ATTGCTTTTT ATCGTTCTGT CTTTGAAGAG
TGTAAAAAGT ACGGTATCGA ACCGCTGGTC ACGTTGTGCC ACTTCGATGT GCCGATGCAT
CTGGTCACCG AATATGGCTC CTGGCGTAAC CGCAAGCTGG TGGAGTTTTT CAGCCGCTAC
GCCAGAACCT GCTTTGAAGC ATTTGATGGT CTGGTGAAAT ACTGGCTAAC CTTCAATGAA
ATCAACATTA TGTTGCATAG CCCGTTCTCC GGCGCGGGTC TGGTGTTTGA AGAAGGTGAA
AATCAGGATC AGGTGAAATA TCAGGCCGCG CATCACCAGC TGGTTGCCAG TGCGCTAGCC
ACCAAAATCG CCCATGAGGT TAACCCGCAA AATCAGGTGG GCTGTATGCT GGCGGGCGGT
AACTTCTACC CTTACAGTTG CAAGCCGGAA GATGTCTGGG CGGCGCTGGA GAAAGACCGG
GAAAACCTGT TTTTTATCGA TGTGCAGGCG CGGGGCACGT ATCCGGCTTA CTCTGCCCGC
GTATTCCGCG AAAAAGGGGT AACCATCAAC AAAGCACCGG GCGATGATGA AATCCTGAAA
AACACCGTCG ATTTTGTCTC TTTCAGCTAT TACGCCTCGC GCTGCGCCTC GGCGGAGATG
AACGCCAACA ACAGCAGTGC GGCGAACGTG GTGAAATCGC TGCGTAATCC GTATCTACAG
GTGAGCGACT GGGGCTGGGG AATTGATCCA CTCGGTCTGC GTATCACCAT GAATATGATG
TACGATCGTT ATCAGAAGCC GCTGTTTCTG GTGGAAAACG GCCTGGGCGC AAAAGATGAA
TTTGCTGCCA ATGGCGAGAT TAACGACGAC TATCGCATCA GCTACTTACG CGAACATATC
CGCGCAATGA GCGAAGCGAT TGCAGACGGC ATTCCGCTGA TGGGCTACAC CACATGGGGC
TGTATTGATT TAGTTTCCGC CTCTACGGGT GAAATGAGCA AACGCTACGG CTTTGTCTTT
GTTGACCGTG ACGACGCAGG CAACGGTACG CTGACGCGCA CGCGTAAGAA ATCATTCTGG
TGGTATAAAA AAGTGATTGC CAGTAATGGG GAAGATTTAG AGTAG
 
Protein sequence
MSVFPESFLW GGALAANQSE GAFREGDKGL TTVDMIPHGE HRMAVKLGLE KRFQLRDDEF 
YPSHEATDFY HRYKEDIALM AEMGFKVFRT SIAWSRLFPQ GDEITPNQQG IAFYRSVFEE
CKKYGIEPLV TLCHFDVPMH LVTEYGSWRN RKLVEFFSRY ARTCFEAFDG LVKYWLTFNE
INIMLHSPFS GAGLVFEEGE NQDQVKYQAA HHQLVASALA TKIAHEVNPQ NQVGCMLAGG
NFYPYSCKPE DVWAALEKDR ENLFFIDVQA RGTYPAYSAR VFREKGVTIN KAPGDDEILK
NTVDFVSFSY YASRCASAEM NANNSSAANV VKSLRNPYLQ VSDWGWGIDP LGLRITMNMM
YDRYQKPLFL VENGLGAKDE FAANGEINDD YRISYLREHI RAMSEAIADG IPLMGYTTWG
CIDLVSASTG EMSKRYGFVF VDRDDAGNGT LTRTRKKSFW WYKKVIASNG EDLE