Gene EcolC_0808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0808 
Symbol 
ID6065927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp867643 
End bp869076 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content51% 
IMG OID641600213 
Productglycoside hydrolase family protein 
Protein accessionYP_001723807 
Protein GI170018853 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.785665 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAAAAC TCACCTTACC GAAAGATTTC TTATGGGGCG GCGCAGTTGC CGCTCATCAG 
GTCGAAGGCG GCTGGAACAA AGGCGGAAAA GGGCCGAGCA TTTGTGACGT TCTGACCGGT
GGCGCACACG GCGTGCCGCG CGAAATCACC AAAGAAGTCT TGCCAGGAAA ATACTATCCA
AACCATGAAG CCGTTGATTT TTATGGTCAC TATAAGGAAG ACATCAAGCT ATTTGCCGAA
ATGGGCTTCA AATGTTTTCG TACATCCATT GCCTGGACGC GCATTTTTCC AAAAGGCGAT
GAAGCTCAGC CAAACGAAGA AGGGCTGAAG TTCTACGATG ATATGTTCGA TGAACTGCTG
AAATACAACA TCGAACCGGT GATCACCCTC TCCCACTTTG AAATGCCGCT GCATCTGGTG
CAGCAATACG GTAGCTGGAC CAACCGTAAA GTGGTTGATT TCTTTGTACG TTTCGCGGAA
GTGGTATTTG AACGCTATAA GCACAAAGTC AAATACTGGA TGACCTTCAA CGAAATTAAC
AACCAGCGTA ACTGGCGTGC ACCGCTGTTC GGTTACTGCT GCTCCGGCGT GGTGTATACC
GAGCATGAAA ACCCGGAAGA GACGATGTAT CAGGTGCTGC ATCACCAGTT TGTCGCCAGC
GCCCTGGCGG TGAAAGCTGC GCGTCGCATT AACCCGGAGA TGAAAGTCGG CTGTATGCTG
GCGATGGTGC CGCTCTATCC TTACTCCTGT AACCCGGACG ATGTGATGTT CGCTCAGGAG
TCGATGCGCG AACGCTACGT CTTTACCGAT GTGCAGCTAC GCGGCTATTA CCCGTCCTAT
GTGTTGAACG AGTGGGAGCG TCGCGGATTT AACATCAAAA TGGAAGACGG CGATCTGGAT
GTGCTGCGTG AAGGCACCTG CGATTATCTT GGTTTCAGCT ATTACATGAC CAATGCAGTG
AAGGCCGAAG GCGGCACCGG CGATGCGATC TCTGGTTTTG AAGGCAGCGT ACCAAACCCG
TATGTTAAAG CATCTGACTG GGGCTGGCAG ATTGATCCAG TAGGTCTGCG CTATGCACTT
TGCGAACTGT ATGAGCGTTA TCAGAGGCCG CTGTTTATTG TCGAAAACGG TTTTGGCGCT
TACGACAAAG TGGAAGAAGA TGGCAGCATC AACGACGACT ACCGCATTGA CTACCTGCGC
GCCCATATCG AAGAGATGAA AAAAGCGGTG ACTTACGATG GCGTGGATCT GATGGGCTAC
ACACCGTGGG GCTGCATCGA CTGCGTGTCG TTCACCACCG GGCAGTACAG CAAACGCTAC
GGCTTTATCT ATGTGAATAA ACATGACGAC GGTACTGGCG ATATGTCGCG TTCACGTAAG
AAGAGCTTTA ACTGGTACAA AGAGGTGATT GCCAGCAACG GCGAGAAGCT TTAA
 
Protein sequence
MKKLTLPKDF LWGGAVAAHQ VEGGWNKGGK GPSICDVLTG GAHGVPREIT KEVLPGKYYP 
NHEAVDFYGH YKEDIKLFAE MGFKCFRTSI AWTRIFPKGD EAQPNEEGLK FYDDMFDELL
KYNIEPVITL SHFEMPLHLV QQYGSWTNRK VVDFFVRFAE VVFERYKHKV KYWMTFNEIN
NQRNWRAPLF GYCCSGVVYT EHENPEETMY QVLHHQFVAS ALAVKAARRI NPEMKVGCML
AMVPLYPYSC NPDDVMFAQE SMRERYVFTD VQLRGYYPSY VLNEWERRGF NIKMEDGDLD
VLREGTCDYL GFSYYMTNAV KAEGGTGDAI SGFEGSVPNP YVKASDWGWQ IDPVGLRYAL
CELYERYQRP LFIVENGFGA YDKVEEDGSI NDDYRIDYLR AHIEEMKKAV TYDGVDLMGY
TPWGCIDCVS FTTGQYSKRY GFIYVNKHDD GTGDMSRSRK KSFNWYKEVI ASNGEKL