Gene Ent638_1710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_1710 
Symbol 
ID5112449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp1859547 
End bp1860902 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content52% 
IMG OID640491899 
Product6-phospho-beta-glucosidase 
Protein accessionYP_001176440 
Protein GI146311366 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACAGA AATTAAAAGT CGTAACGATT GGCGGCGGCA GCAGTTATAC CCCAGAATTG 
CTCGAAGGTT TTCTGAAACG CTATCACGAG TTGCCTGTGA GTGAATTATG GCTGGTGGAC
GTTGAAGAAG GGCAGGAGAA ACTCGATATT ATTTTCGACC TGTGCCAGCG CATGGTGAAA
AAAGCCGGCG TTCCGTTAAC CGTGCATAAA ACACTCGATC GCCGCCTGGC ATTAAAAGAC
GCTGATTTTG TCACAACGCA GCTGCGCGTT GGTCAGCTTA AAGCGCGTGA ACTCGACGAG
CGTATCCCGC TGAGCCACGG TTACTTAGGT CAGGAAACCA ACGGCGCGGG CGGTCTGTTT
AAAGGTCTGC GCACCATTCC GGTGATTTTC GACATCATTA AAGATGTGGA AGAGATTTGC
CCGCAGGCGT GGGTCATTAA CTTTACCAAC CCAGCCGGGA TGGTGACGGA AGCGGTGTAC
CGCCATACCG GTTTTAAACG TTTTATCGGC GTGTGCAACA TTCCGATCGG CATGAAGATG
TTTATTCGCG ATGTGCTGGT CCTGTCCGAC AGTGACGATC TTTCCATCGA GCTGTTTGGT
CTCAACCACA TGGTGTTTAT CAAAGACGTG CTGGTCAACG GTGAGTCGCG CTTCGATGAG
CTGCTGGACG GCGTTGCATC GGGTCGTCTG ACGGCCGGTT CAGTGAAAAA TATCTTCGAT
CTGCCGTTCA GTGAAGGGCT GATTCGTTCT CTGAATCTGC TGCCGTGCTC TTACTTGCTC
TATTACTTCA AGCAGAAAGA GATGCTGGCG ATTGAAATGG GCGAGTACTA CAAAGGCGGC
GCGCGTGCGC AGGTGGTGCA GAAGGTCGAG AAACAGCTGT TTGATTTGTA TAAAGATCCA
GAGTTGAACG TCAAACCTAA AGAGCTTGAG CTGCGCGGCG GGGCGTATTA TTCCGACGCC
GCCTGCGAAG TGATCAACGC GATCTACAAT GATAAGCAAG CTGAACATTA CGTGAACGTA
CCGCATCACG GCCATATCGA TAATATCCCG GCAGACTGGG CGGTGGAGAT GACCTGTATT
CTGGGACGCG GCGGCGCGAC GCCACACGCG CGTATCACCC ATTTTGATGA GAAAGTGATG
GGCTTGATCC ATACCATCAA GGGTTTCGAA GTGGCGGCCA GCCATGCGGC GCTGAGCGGT
GAGTTGAATG ATGTCTTACT GGCGCTGAAT CTCAGTCCGC TGGTGCATTC TGACCGTGAT
GCGGAATTGC TGGCGCGCGA AATGATCCTG GCGCATGAGA AATGGCTGCC TAATTTTGCG
GCGACGATAG AGAAACTCAA ACGCGAACAA CGCTAA
 
Protein sequence
MQQKLKVVTI GGGSSYTPEL LEGFLKRYHE LPVSELWLVD VEEGQEKLDI IFDLCQRMVK 
KAGVPLTVHK TLDRRLALKD ADFVTTQLRV GQLKARELDE RIPLSHGYLG QETNGAGGLF
KGLRTIPVIF DIIKDVEEIC PQAWVINFTN PAGMVTEAVY RHTGFKRFIG VCNIPIGMKM
FIRDVLVLSD SDDLSIELFG LNHMVFIKDV LVNGESRFDE LLDGVASGRL TAGSVKNIFD
LPFSEGLIRS LNLLPCSYLL YYFKQKEMLA IEMGEYYKGG ARAQVVQKVE KQLFDLYKDP
ELNVKPKELE LRGGAYYSDA ACEVINAIYN DKQAEHYVNV PHHGHIDNIP ADWAVEMTCI
LGRGGATPHA RITHFDEKVM GLIHTIKGFE VAASHAALSG ELNDVLLALN LSPLVHSDRD
AELLAREMIL AHEKWLPNFA ATIEKLKREQ R