Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_3543 |
Symbol | ebgA |
ID | 5587097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 3552288 |
End bp | 3555380 |
Gene Length | 3093 bp |
Protein Length | 1030 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640927169 |
Product | cryptic beta-D-galactosidase subunit alpha |
Protein accession | YP_001464538 |
Protein GI | 157158539 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCGCT GGGAAAACAT TCAGCTCACC CACGAAAACC GACTTGCGCC GCGTGCGTAC TTTTTTTCAT ATGATTCTGT TGCGCAAGCG CGTACCTTTG CCCGCGAAAT CAGCAGCCTG TTTCTGCCCT TAAGCGGTCA GTGGAATTTC CACTTTTTTG ACCATCCGCT GCAAGTACCA GAAGCCTTCA CCTCTGAGTT AATGGCTGAC TGGGGGCATA TTACCGTCCC CGCCATGTGG CAAATGGAAG GTCACGGCAA ACTGCAATAT ACCGACGAAG GTTTTCCGTT CCCCATCGAT GTGCCGTTTG TCCCCAGCGA TAACCCAACC GGTGCCTATC AACGTATTTT CACCCTCAGC GACGGCTGGC AGGGTAAACA GACGCTGATT AAATTTGACG GCGTCGAAAC CTATTTTGAA GTCTATGTTA ACGGTCAGTA TGTGGGTTTC AGCAAGGGCA GTCGCCTGAC CGCAGAGTTT GACATCAGCG CGATGGTTAA AACCGGCGAC AACCTGTTGT GTGTGCGCGT GATGCAGTGG GCGGACTCTA CCTACGTGGA AGACCAGGAT ATGTGGTGGT CAGCGGGGAT CTTCCGCGAT GTTTATCTGG TCGGAAAACA CCTAACGCAT ATTAACGATT TCACTGTGCG TACCGACTTT GACGAAGCCT ATTGCGATGC CACGCTTTCC TGCGAAGTGG TGCTGGAAAA TCTCGCCGCC TCCCCTGTCG TCACGACGCT GGAATATACC CTGTTTGATG GCGAACGCGT GGTGCACAGC AGCGCCATTG ATCATTTGGC AATTGAAAAA CTGACCAGCG CCAGCTTTGC TTTTACTGTC GAACAGCCGC AGCAATGGTC AGCAGAATCC CCTTATCTTT ACCATCTGGT CATGACGCTG AAAGACGCCA ACGGCAACGT TCTGGAAGTG GTGCCACAAC GCGTTGGCTT CCGTGATATC AAAGTGCGCG ACGGTCTGTT CTGGATCAAT AACCGTTATG TGATGCTGCA CGGCGTCAAC CGTCACGACA ACGATCATCG CAAAGGCCGC GCCGTTGGAA TGGATCGCGT CGAGAAAGAT CTCCAGTTGA TGAAGCAGCA CAATATCAAC TCCGTGCGTA CCGCTCACTA CCCGAACGAT CCGCGTTTTT ACGAACTGTG TGATATCTAC GGCTTGTTTG TGATGGCGGA AACCGACGTC GAATCGCACG GCTTTGCTAA TGTCGGCGAT ATCAGCCGTA TTACCGACGA TCCGCAGTGG GAAAAAGTCT ACGTCGAGCG CATTGTTCGC CATATTCACG CGCAGAAAAA CCATCCGTCG ATCATCATCT GGTCGCTGGG CAATGAATCC GGCTATGGCT GTAACATCCG CGCGATGTAC CACGCAGCGA AGGCGCTGGA TGACACGCGA CTGGTGCATT ACGAAGAAGA TCGCGATGCT GAAGTGGTCG ATATTATTTC CACCATGTAC ACCCGCGTGC CGCTGATGAA TGAGTTTGGT GAATACCCGC ATCCGAAGCC GCGCATCATC TGTGAATATG CTCATGCGAT GGGGAACGGA CCAGGCGGGC TGACGGAGTA CCAGAACGTC TTCTATAAGC ACGATTGTAT TCAGGGACAT TATGTTTGGG AGTGGTGCGA CCACGGGATC CAGGCGCAGG ATGACAACGG CAACGTCTGG TATAAATTCG GCGGCGACTA CGGCGACTAT CCCAACAACT ATAACTTCTG TCTTGATGGT TTGATCTATT CCGATCAGAC GCCGGGACCG GGCCTGAAAG AGTACAAACA GGTTATCGCG CCGGTAAAAA TCCACGCGCT GGATCTGACT CACGGCGAGC TGAAAGTCGA AAATAAACTG TGGTTTACCA CGCTTGATGA CTACACCCTG CACGCAGAGG TGCGCGTCGA AGGTGAAACG CTCGCGACGC AGCAGATTAA ACTGCGCGAC GTTGCGCCGA ACAGCGAAGC CCCCTTGCAG ATCACGCTGC CGCAGCTGGA CGCCCGCGAA GCGTTCCTCA ACATTACGGT GACCAAAGAT TCCCGCACCC GCTACAGCGA AGCCGGGCAT TCTATCGCCA CTTATCAGTT CCCGCTGAAG GAAAACACCG CGCAGCCAGT GCCTTTCGCA CCAAATAATG CGCGTCCGCT GACGCTGGAA GACGATCGTT TGAGCTGCAC CGTTCGCGGC TACAACTTCG CGATCACCTT CTCAAAAATG AGTGGCAAAC CGACATCCTG GCAGGTGAAT GGCGAATCGC TGCTGACTCG CGAGCCAAAG ATCAACTTCT TCAAGCCGAT GATCGACAAC CACAAGCAGG AGTACGAAGG GCTGTGGCAA CCGAATCATT TGCAGATCAT GCAGGAACAT CTGCGCGACT TTGCCGTAGA ACAGAGCGAT GGTGAAGTGC TGATCATCAG CCGCACAGTT ATTGCCCCGC CGGTGTTTGA CTTCGGGATG CGCTGCACCT ACATCTGGCG CATCGCTGCC GATGGCCAGG TTAACGTGGC GCTTTCCGGC GAGCGTTACG GCGACTATCC GCACATCATT CCGTGCATCG GTTTCACCAT GGGAATTAAC GGCGAATACG ATCAGGTGGC GTATTACGGT CGTGGACCGG GCGAAAACTA CGCCGACAGC CAGCAGGCTA ACATCATCGA TATCTGGCGC AGCACCGTCG ATGCCATGTT CGAGAACTAT CCCTTCCCGC AGAACAACGG CAACCGTCAG CATGTCCGCT GGACGGCACT GACTAACCGC CACGGCAACG GTTTGCTGGT GGTTCCGCAG CGCCCAATTA ACTTCAGCGC CTGGCACTAT ACCCAGGAAA ACATCCACGC TGCCCAGCAC TGTAACGAGC TGCAGCGCAG TGATGACATC ACCCTGAACC TCGACCACCA GCTACTTGGC CTCGGCTCCA ACTCCTGGGG CAGCGAGGTG CTGGACTCCT GGCGCGTCTG GTTCCGTGAC TTCAGCTACG GCTTTACGTT GCTGCCGGTT TCTGGCGGAG AAGCTACCGC GCAAAGCCTG GCGTCGTATG AGTTCGGCGC AGGGTTCTTT TCCACGAATT TGCACAGCGA GAATAAGCAA TGA
|
Protein sequence | MNRWENIQLT HENRLAPRAY FFSYDSVAQA RTFAREISSL FLPLSGQWNF HFFDHPLQVP EAFTSELMAD WGHITVPAMW QMEGHGKLQY TDEGFPFPID VPFVPSDNPT GAYQRIFTLS DGWQGKQTLI KFDGVETYFE VYVNGQYVGF SKGSRLTAEF DISAMVKTGD NLLCVRVMQW ADSTYVEDQD MWWSAGIFRD VYLVGKHLTH INDFTVRTDF DEAYCDATLS CEVVLENLAA SPVVTTLEYT LFDGERVVHS SAIDHLAIEK LTSASFAFTV EQPQQWSAES PYLYHLVMTL KDANGNVLEV VPQRVGFRDI KVRDGLFWIN NRYVMLHGVN RHDNDHRKGR AVGMDRVEKD LQLMKQHNIN SVRTAHYPND PRFYELCDIY GLFVMAETDV ESHGFANVGD ISRITDDPQW EKVYVERIVR HIHAQKNHPS IIIWSLGNES GYGCNIRAMY HAAKALDDTR LVHYEEDRDA EVVDIISTMY TRVPLMNEFG EYPHPKPRII CEYAHAMGNG PGGLTEYQNV FYKHDCIQGH YVWEWCDHGI QAQDDNGNVW YKFGGDYGDY PNNYNFCLDG LIYSDQTPGP GLKEYKQVIA PVKIHALDLT HGELKVENKL WFTTLDDYTL HAEVRVEGET LATQQIKLRD VAPNSEAPLQ ITLPQLDARE AFLNITVTKD SRTRYSEAGH SIATYQFPLK ENTAQPVPFA PNNARPLTLE DDRLSCTVRG YNFAITFSKM SGKPTSWQVN GESLLTREPK INFFKPMIDN HKQEYEGLWQ PNHLQIMQEH LRDFAVEQSD GEVLIISRTV IAPPVFDFGM RCTYIWRIAA DGQVNVALSG ERYGDYPHII PCIGFTMGIN GEYDQVAYYG RGPGENYADS QQANIIDIWR STVDAMFENY PFPQNNGNRQ HVRWTALTNR HGNGLLVVPQ RPINFSAWHY TQENIHAAQH CNELQRSDDI TLNLDHQLLG LGSNSWGSEV LDSWRVWFRD FSYGFTLLPV SGGEATAQSL ASYEFGAGFF STNLHSENKQ
|
| |