Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_2731 |
Symbol | |
ID | 5114592 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | - |
Start bp | 2947153 |
End bp | 2949471 |
Gene Length | 2319 bp |
Protein Length | 772 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640492918 |
Product | beta-galactosidase |
Protein accession | YP_001177447 |
Protein GI | 146312373 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAGAG AGTACAACAA CATGAAATGG CTTTGTTCTG TAGCTGTCGC AGTAGGTCTG GCGCTGCAAC CCGCGCTCGC TGAAGATCTG TTTGGCAATC ACCCGTTAAC CCCGGAAGCA CGCGATAAAT TCGTCAACGA ATTGCTCACG AAAATGACGG TCGATGAGAA AATTGGCCAG CTGCGTTTGA TCAGCGTCGG ACCGGATAAC CCGAAAGAGG CGATCCGCGA CATGATTCAG GAGAGTCAGG TCGGGGCGAT TTTTAACACC GTGACCCGTG AAGACATCCG CAAAATGCAG GATCAGGTAA TGCAGCTAAG CCGCCTGAAA ATTCCTCTGT TCTTCGCCTA CGATGTGGTC CACGGCCAGC GTACCGTTTT CCCCATCAGC CTCGGTTTAG CGTCCTCTTT CAATCTGGAT GCGGTCAGAA CCGTTGGGCG TATTTCTGCC TATGAAGCGG CGGATGACGG TCTGAACATG ACCTGGGCGC CAATGGTCGA CGTCTCCCGC GATCCGCGTT GGGGTCGTGC ATCAGAAGGT TTTGGCGAAG ATACCTACCT CACCGCGACC TTGGGTAAAA CCATGGTAGA AGCGATGCAG GGTAAAAGCC CGGCGGATCG CTATTCGGTA ATGACCAGCG TTAAACACTT TGCGGCGTAT GGCGCAGTCG AAGGTGGTAA AGAGTACAAC ACCGTGGATA TGAGTCCGCA GCGTCTCTTC AACGACTACA TGCCGCCGTA CAAAGCCGGG CTGGATGCCG GTAGCGGCGC GGTAATGGTG GCGCTGAACT CTCTGAATGG CACACCGGCG ACCTCAGATT CCTGGCTGCT CAAAGATGTT CTGCGCGATC AGTGGGGCTT TAAAGGCATC ACCGTTTCCG ATCACGGCGC GATCAAAGAG TTGATTAAGC ATGGCGCGGC GTCCGACCCA GAAGACGCGG TACGCGTGGC GCTCAAAGCC GGTATCAACA TGAGCATGAG CGACGAGTAT TACAGCAAAT ATCTGCCCGA TCTGGTGAAA ACCGGCAAGG TCACGATGAC TGAGCTGGAT GACGCCACGC GTCATGTGCT GAATGTGAAA TACGACATGG GCTTGTTTAA CGATCCGTAC AGCCATCTGG GACCGAAAGA TTCCGATCCG GCAGATACCA ACGCGGAAAG TCGTTTGCAC CGCAAAGACG CACGTGAAGT GGCGCGCGAA AGCCTGGTAC TGCTGAAAAA CCGTCTCGAC ACGCTGCCGC TGAAAAAATC CGGCACCATT GCGGTCGTTG GTCCTCTGGC TGACAGCAAA CGCGACGTGA TGGGGAGCTG GTCCGCCGCC GGTGTGGCCG ATCAATCCGT GACCGTGTTG ACGGGGATTA AAAACGCGCT GGGCGAAGAC GGCAAAGTGG TTTATGCCAA AGGCGCGAAC GTCACCAATG ATAAAGACAT TGTGACGTTC CTGAACCAGT ATGAAGAGGC GGTGAAAGTT GATCCGCGTT CTGCACAGGC GATGATCGAC GAAGCCGTCA ACGCGGCGAA ACAGTCTGAC GTGGTGGTTG CAGTCGTCGG TGAAGCGCAA GGCATGGCGC ACGAGGCGTC CAGCCGTACG GATATCACTA TTCCACAAAG TCAGCGCGAC CTAATTACTG CGCTGAAAGC CACCGGCAAA CCGCTGGTGC TGGTGCTGAT GAACGGTCGT CCGCTGGCGC TGGTCAAAGA AGATCAGCAG GCTGACGCGC TGCTGGAAAC CTGGTTTGCG GGTACCGAAG GCGGTAACGC GATTGCTGAT GTGTTGTTTG GCGATTACAA CCCATCGGGC AAACTGCCGA TGTCCTTCCC TCGCTCTGTC GGGCAGATCC CGGTGTACTA CAGCCATCTC AATACCGGCC GTCCTTACAA TGCGGATAAG CCAAACAAAT ACACATCGCG CTACTTTGAC GAAGCGAATG GCCCGCTGTA TCCGTTCGGC TATGGTCTGA GCTACACCAC CTTTAACGTT TCTGACGTGA AAATGTCTGC ACCGTCTCTG AAGCGTGACG GAAAAGTGAC GGCCAGTGTG GAAGTGACCA ACACCGGTAA GCGCGAAGGC GCGACGGTCA TCCAGATGTA CGTTCAGGAT GTAACCGCGT CGATGAGCCG CCCAGTGAAA CAGCTGCGTG GCTTCGAAAA AGTGGACCTG AAACCGGGGG AGACGAAAAC CGTCAGCTTC CCGATTGATG TGGACGCGCT GAAGTTCTGG AATCAGCAGA TGAAGTATGA CGCTGAGGCT GGCAAGTTTA ACGTCTTTAT CGGGGTGGAC TCCGCTCGCG TGAATAAAGG CGAGTTCGAA CTGCTGTAA
|
Protein sequence | MMREYNNMKW LCSVAVAVGL ALQPALAEDL FGNHPLTPEA RDKFVNELLT KMTVDEKIGQ LRLISVGPDN PKEAIRDMIQ ESQVGAIFNT VTREDIRKMQ DQVMQLSRLK IPLFFAYDVV HGQRTVFPIS LGLASSFNLD AVRTVGRISA YEAADDGLNM TWAPMVDVSR DPRWGRASEG FGEDTYLTAT LGKTMVEAMQ GKSPADRYSV MTSVKHFAAY GAVEGGKEYN TVDMSPQRLF NDYMPPYKAG LDAGSGAVMV ALNSLNGTPA TSDSWLLKDV LRDQWGFKGI TVSDHGAIKE LIKHGAASDP EDAVRVALKA GINMSMSDEY YSKYLPDLVK TGKVTMTELD DATRHVLNVK YDMGLFNDPY SHLGPKDSDP ADTNAESRLH RKDAREVARE SLVLLKNRLD TLPLKKSGTI AVVGPLADSK RDVMGSWSAA GVADQSVTVL TGIKNALGED GKVVYAKGAN VTNDKDIVTF LNQYEEAVKV DPRSAQAMID EAVNAAKQSD VVVAVVGEAQ GMAHEASSRT DITIPQSQRD LITALKATGK PLVLVLMNGR PLALVKEDQQ ADALLETWFA GTEGGNAIAD VLFGDYNPSG KLPMSFPRSV GQIPVYYSHL NTGRPYNADK PNKYTSRYFD EANGPLYPFG YGLSYTTFNV SDVKMSAPSL KRDGKVTASV EVTNTGKREG ATVIQMYVQD VTASMSRPVK QLRGFEKVDL KPGETKTVSF PIDVDALKFW NQQMKYDAEA GKFNVFIGVD SARVNKGEFE LL
|
| |