Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4390 |
Symbol | ebgA |
ID | 6970808 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4065247 |
End bp | 4068339 |
Gene Length | 3093 bp |
Protein Length | 1030 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643388112 |
Product | cryptic beta-D-galactosidase subunit alpha |
Protein accession | YP_002272549 |
Protein GI | 209400384 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCGCT GGGAAAACAT TCAGCTCACC CACGAAAACC GACTTGCGCC GCGTGCGTAC TTTTTTTCAT ATGATTCTGT TGCGCAAGCG CGTACCTTTG CCCGCGAAAC GAGCAGCCTG TTTCTGCCCT TAAGCGGTCA GTGGAATTTC CACTTTTTTG ACCATCCGCT GCAAGTGCCA GAAGCCTTCA CCTCTGAGTT AATGGCTGAC TGGGGGCATA TTACCGTCCC CGCCATGTGG CAAATGGAAG GTCACGGCAA ACTGCAATAT ACCGACGAAG GTTTTCCGTT CCCCATCGAT GTGCCGTTTG TTCCCAGCGA TAACCCAACC GGTGCCTATC AACGTATTTT CACCCTCAGT GACGGCTGGC AGGGTAAACA GACGCTGATT AAATTTGACG GCGTCGAAAC CTATTTTGAA GTCTACGTTA ACGGTCAGTA TGTGGGTTTC AGCAAGGGCA GTCGCCTGAC CGCAGAGTTT GACATCAGCG CGATGGTTAA AACCGGCGAC AACCTGTTGT GTGTGCGCGT GATGCAGTGG GCGGACTCTA CCTACGTGGA AGACCAGGAT ATGTGGTGGT CAGCGGGGAT CTTCCGCGAT GTTTATCTGG TCGGAAAACA CCTAACGCAT ATTAACGATT TCACTGTGCG TACCGACTTT GACGAAGCCT ATTGCGATGC CACGCTTTCC TGCGAAGTGG TGCTGGAAAA TCTCGCCGCC TCCCCTGTCG TCACGACGCT GGAATATACC CTGTTTGATG GCGAACGCGT GGTGCACAGC AGCGCCATTG ATCATTTGGC AATTGAAAAA CTGACCAGCG CCAGCTTTGC TTTTACTGTC GAACAGCCGC AGCAATGGTC AGCAGAATCC CCTTATCTTT ACCATCTGGT CATGACGCTG AAAGACGCCA ACGGCAACGT TCTGGAAGTG GTGCCACAAC GCGTTGGCTT CCGTGATATC AAAGTGCGCG ACGGTCTGTT CTGGATCAAT AACCGTTATG TGATGCTGCA CGGCGTCAAC CGTCACGACA ACGATCATCG CAAAGGCCGC GCCGTTGGAA TGGATCGCGT CGAGAAAGAT CTCCAGTTGA TGAAGCAGCA CAACATCAAC TCCGTGCGTA CCGCTCACTA CCCGAACGAT CCGCGTTTTT ACGAACTGTG TGATATCTAC GGCCTGTTTG TGATGGCGGA AACCGACGTC GAATCGCACG GCTTTGCTAA TGTCGGCGAT ATCAGCCGTA TTACCGACGA TCCGCAGTGG GAAAAAGTCT ACGTCGAGCG CATTGTTCGC CATATTCACG CGCAGAAAAA CCATCCGTCG ATCATCATCT GGTCGCTGGG CAATGAATCC GGCTATGGCT GTAACATCCG CGCGATGTAC CACGCAGCGA AGGCGCTGGA TGACACGCGA CTGGTGCATT ACGAAGAAGA TCGCGATGCT GAAGTGGTCG ATATTATTTC CACCATGTAC ACCCGCGTGC CGCTGATGAA TGAGTTTGGT GAATACCCGC ATCCGAAGCC GCGCATCATC TGTGAATATG CTCATGCGAT GGGGAACGGA CCGGGCGGGC TGACGGAGTA CCAGAACGTC TTCTATAAGC ACGATTGTAT TCAGGGACAT TATGTTTGGG AGTGGTGCGA CCACGGGATC CAGGCGCAGG ATGACAACGG CAATGTCTGG TATAAATTCG GCGGCGACTA CGGCGACTAT CCCAACAACT ATAACTTCTG TCTTGATGGT TTGATCTATT CCGATCAGAC GCCGGGACCA GGCCTGAAAG AGTACAAACA GGTTATCGCG CCGGTAAAAA TCCACGCGCT GGATCTGACT CGCGGCGAGC TGAAAGTCGA AAATAAACTG TGGTTTACCA CGCTGGATGA CTACACCCTG CACGCAGAGG TGCGCGCCGA AGGTGAAACG CTCGCGACGC AACAGATTAA ACTGCGCGAC GTTGCGCCGA ACAGCGAAGC CCCCTTGCAG ATCACGCTGC CGCAGCTGGA CGCCCGCGAA GCGTTCCTCA ACATTACGGT GACCAAAGAT TCCCGCACCC GCTACAGCGA AGCCGGGCAT TCTATCGCCA CTTATCAGTT CCCGCTGAAG GAAAACACCG CGCAGCCTGT ACCTTTCGCA CCGAATAACG CCCGTCCGCT GACGCTGGAA GACGATCGTT TGAGCTGTAC TGTTCGCGGC TACAACTTTG CGATCACCTT CTCAAAAACG AGTGGTAAAC CGACATCCTG GCAGGTAAAT GGCGAATCGC TGCTGACCCG CGAGCCAAAG ATCAACTTCT TCAAGCCAAT GATCGACAAC CACAAGCAGG AGTACGAAGG GTTGTGGCAG CCGAATCATT TGCAGATCAT GCAGGAACAT CTGCGCGACT TTGCCGTAGA ACAGAGCGAT GGTGAAGTGT TGATCATCAG CCGCACGGTT ATAGCACCGC CGGTGTTTGA CTTCGGGATG CGCTGTACCT ACATCTGGCG CATCGCTGCA GATGGCCAGG TTAACGTGGC GCTTTCCGGC GAGCGTTACG GCGACTATCC GCACATCATT CCGTGCATCG GTTTCACCAT GGGGATTAAC GGCGAATACG ATCAGGTGGC GTATTACGGT CGTGGACCGG GCGAAAACTA CGCCGACAGC CAGCAGGCTA ACATCATCGA TATCTGGCGC AGCACCGTCG ATGCCATATT CGAGAACTAT CCCTTCCCGC AGAACAACGG CAACCGTCAG CATGTCCGCT GGACGGCACT GACTAACCGC CACGGCAACG GTCTGCTGGT GGTTCCGCAG CGCCCAATTA ACTTCAGCGC CTGGCACTAT ACCCAGGAAA ACATCCACGC TGCCCAGCAC TGTAACGAGC TGCAGCGCAG TGATGACATC ACCCTGAATC TCGACCACCA GCTGCTTGGC CTCGGCTCCA ATTCCTGGGG CAGCGAGGTG CTGGACTCCT GGCGCGTCTG GTTCCGTGAC TTCAGCTACG GCTTTACGTT GCTGCCGGTT TCTGGCGGAG AAGCTACCGC GCAAAGCCTG GCGTCGTATG AGTTCGGCGC AGGGTTCTTT TCCACAAATT TGCACAGCGA GAATAAGCAA TGA
|
Protein sequence | MNRWENIQLT HENRLAPRAY FFSYDSVAQA RTFARETSSL FLPLSGQWNF HFFDHPLQVP EAFTSELMAD WGHITVPAMW QMEGHGKLQY TDEGFPFPID VPFVPSDNPT GAYQRIFTLS DGWQGKQTLI KFDGVETYFE VYVNGQYVGF SKGSRLTAEF DISAMVKTGD NLLCVRVMQW ADSTYVEDQD MWWSAGIFRD VYLVGKHLTH INDFTVRTDF DEAYCDATLS CEVVLENLAA SPVVTTLEYT LFDGERVVHS SAIDHLAIEK LTSASFAFTV EQPQQWSAES PYLYHLVMTL KDANGNVLEV VPQRVGFRDI KVRDGLFWIN NRYVMLHGVN RHDNDHRKGR AVGMDRVEKD LQLMKQHNIN SVRTAHYPND PRFYELCDIY GLFVMAETDV ESHGFANVGD ISRITDDPQW EKVYVERIVR HIHAQKNHPS IIIWSLGNES GYGCNIRAMY HAAKALDDTR LVHYEEDRDA EVVDIISTMY TRVPLMNEFG EYPHPKPRII CEYAHAMGNG PGGLTEYQNV FYKHDCIQGH YVWEWCDHGI QAQDDNGNVW YKFGGDYGDY PNNYNFCLDG LIYSDQTPGP GLKEYKQVIA PVKIHALDLT RGELKVENKL WFTTLDDYTL HAEVRAEGET LATQQIKLRD VAPNSEAPLQ ITLPQLDARE AFLNITVTKD SRTRYSEAGH SIATYQFPLK ENTAQPVPFA PNNARPLTLE DDRLSCTVRG YNFAITFSKT SGKPTSWQVN GESLLTREPK INFFKPMIDN HKQEYEGLWQ PNHLQIMQEH LRDFAVEQSD GEVLIISRTV IAPPVFDFGM RCTYIWRIAA DGQVNVALSG ERYGDYPHII PCIGFTMGIN GEYDQVAYYG RGPGENYADS QQANIIDIWR STVDAIFENY PFPQNNGNRQ HVRWTALTNR HGNGLLVVPQ RPINFSAWHY TQENIHAAQH CNELQRSDDI TLNLDHQLLG LGSNSWGSEV LDSWRVWFRD FSYGFTLLPV SGGEATAQSL ASYEFGAGFF STNLHSENKQ
|
| |