Gene Ent638_2731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_2731 
Symbol 
ID5114592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp2947153 
End bp2949471 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content55% 
IMG OID640492918 
Productbeta-galactosidase 
Protein accessionYP_001177447 
Protein GI146312373 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAGAG AGTACAACAA CATGAAATGG CTTTGTTCTG TAGCTGTCGC AGTAGGTCTG 
GCGCTGCAAC CCGCGCTCGC TGAAGATCTG TTTGGCAATC ACCCGTTAAC CCCGGAAGCA
CGCGATAAAT TCGTCAACGA ATTGCTCACG AAAATGACGG TCGATGAGAA AATTGGCCAG
CTGCGTTTGA TCAGCGTCGG ACCGGATAAC CCGAAAGAGG CGATCCGCGA CATGATTCAG
GAGAGTCAGG TCGGGGCGAT TTTTAACACC GTGACCCGTG AAGACATCCG CAAAATGCAG
GATCAGGTAA TGCAGCTAAG CCGCCTGAAA ATTCCTCTGT TCTTCGCCTA CGATGTGGTC
CACGGCCAGC GTACCGTTTT CCCCATCAGC CTCGGTTTAG CGTCCTCTTT CAATCTGGAT
GCGGTCAGAA CCGTTGGGCG TATTTCTGCC TATGAAGCGG CGGATGACGG TCTGAACATG
ACCTGGGCGC CAATGGTCGA CGTCTCCCGC GATCCGCGTT GGGGTCGTGC ATCAGAAGGT
TTTGGCGAAG ATACCTACCT CACCGCGACC TTGGGTAAAA CCATGGTAGA AGCGATGCAG
GGTAAAAGCC CGGCGGATCG CTATTCGGTA ATGACCAGCG TTAAACACTT TGCGGCGTAT
GGCGCAGTCG AAGGTGGTAA AGAGTACAAC ACCGTGGATA TGAGTCCGCA GCGTCTCTTC
AACGACTACA TGCCGCCGTA CAAAGCCGGG CTGGATGCCG GTAGCGGCGC GGTAATGGTG
GCGCTGAACT CTCTGAATGG CACACCGGCG ACCTCAGATT CCTGGCTGCT CAAAGATGTT
CTGCGCGATC AGTGGGGCTT TAAAGGCATC ACCGTTTCCG ATCACGGCGC GATCAAAGAG
TTGATTAAGC ATGGCGCGGC GTCCGACCCA GAAGACGCGG TACGCGTGGC GCTCAAAGCC
GGTATCAACA TGAGCATGAG CGACGAGTAT TACAGCAAAT ATCTGCCCGA TCTGGTGAAA
ACCGGCAAGG TCACGATGAC TGAGCTGGAT GACGCCACGC GTCATGTGCT GAATGTGAAA
TACGACATGG GCTTGTTTAA CGATCCGTAC AGCCATCTGG GACCGAAAGA TTCCGATCCG
GCAGATACCA ACGCGGAAAG TCGTTTGCAC CGCAAAGACG CACGTGAAGT GGCGCGCGAA
AGCCTGGTAC TGCTGAAAAA CCGTCTCGAC ACGCTGCCGC TGAAAAAATC CGGCACCATT
GCGGTCGTTG GTCCTCTGGC TGACAGCAAA CGCGACGTGA TGGGGAGCTG GTCCGCCGCC
GGTGTGGCCG ATCAATCCGT GACCGTGTTG ACGGGGATTA AAAACGCGCT GGGCGAAGAC
GGCAAAGTGG TTTATGCCAA AGGCGCGAAC GTCACCAATG ATAAAGACAT TGTGACGTTC
CTGAACCAGT ATGAAGAGGC GGTGAAAGTT GATCCGCGTT CTGCACAGGC GATGATCGAC
GAAGCCGTCA ACGCGGCGAA ACAGTCTGAC GTGGTGGTTG CAGTCGTCGG TGAAGCGCAA
GGCATGGCGC ACGAGGCGTC CAGCCGTACG GATATCACTA TTCCACAAAG TCAGCGCGAC
CTAATTACTG CGCTGAAAGC CACCGGCAAA CCGCTGGTGC TGGTGCTGAT GAACGGTCGT
CCGCTGGCGC TGGTCAAAGA AGATCAGCAG GCTGACGCGC TGCTGGAAAC CTGGTTTGCG
GGTACCGAAG GCGGTAACGC GATTGCTGAT GTGTTGTTTG GCGATTACAA CCCATCGGGC
AAACTGCCGA TGTCCTTCCC TCGCTCTGTC GGGCAGATCC CGGTGTACTA CAGCCATCTC
AATACCGGCC GTCCTTACAA TGCGGATAAG CCAAACAAAT ACACATCGCG CTACTTTGAC
GAAGCGAATG GCCCGCTGTA TCCGTTCGGC TATGGTCTGA GCTACACCAC CTTTAACGTT
TCTGACGTGA AAATGTCTGC ACCGTCTCTG AAGCGTGACG GAAAAGTGAC GGCCAGTGTG
GAAGTGACCA ACACCGGTAA GCGCGAAGGC GCGACGGTCA TCCAGATGTA CGTTCAGGAT
GTAACCGCGT CGATGAGCCG CCCAGTGAAA CAGCTGCGTG GCTTCGAAAA AGTGGACCTG
AAACCGGGGG AGACGAAAAC CGTCAGCTTC CCGATTGATG TGGACGCGCT GAAGTTCTGG
AATCAGCAGA TGAAGTATGA CGCTGAGGCT GGCAAGTTTA ACGTCTTTAT CGGGGTGGAC
TCCGCTCGCG TGAATAAAGG CGAGTTCGAA CTGCTGTAA
 
Protein sequence
MMREYNNMKW LCSVAVAVGL ALQPALAEDL FGNHPLTPEA RDKFVNELLT KMTVDEKIGQ 
LRLISVGPDN PKEAIRDMIQ ESQVGAIFNT VTREDIRKMQ DQVMQLSRLK IPLFFAYDVV
HGQRTVFPIS LGLASSFNLD AVRTVGRISA YEAADDGLNM TWAPMVDVSR DPRWGRASEG
FGEDTYLTAT LGKTMVEAMQ GKSPADRYSV MTSVKHFAAY GAVEGGKEYN TVDMSPQRLF
NDYMPPYKAG LDAGSGAVMV ALNSLNGTPA TSDSWLLKDV LRDQWGFKGI TVSDHGAIKE
LIKHGAASDP EDAVRVALKA GINMSMSDEY YSKYLPDLVK TGKVTMTELD DATRHVLNVK
YDMGLFNDPY SHLGPKDSDP ADTNAESRLH RKDAREVARE SLVLLKNRLD TLPLKKSGTI
AVVGPLADSK RDVMGSWSAA GVADQSVTVL TGIKNALGED GKVVYAKGAN VTNDKDIVTF
LNQYEEAVKV DPRSAQAMID EAVNAAKQSD VVVAVVGEAQ GMAHEASSRT DITIPQSQRD
LITALKATGK PLVLVLMNGR PLALVKEDQQ ADALLETWFA GTEGGNAIAD VLFGDYNPSG
KLPMSFPRSV GQIPVYYSHL NTGRPYNADK PNKYTSRYFD EANGPLYPFG YGLSYTTFNV
SDVKMSAPSL KRDGKVTASV EVTNTGKREG ATVIQMYVQD VTASMSRPVK QLRGFEKVDL
KPGETKTVSF PIDVDALKFW NQQMKYDAEA GKFNVFIGVD SARVNKGEFE LL