Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_0624 |
Symbol | ebgA |
ID | 6064632 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 670184 |
End bp | 673276 |
Gene Length | 3093 bp |
Protein Length | 1030 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641600030 |
Product | cryptic beta-D-galactosidase subunit alpha |
Protein accession | YP_001723627 |
Protein GI | 170018673 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCGCT GGGAAAACAT TCAGCTCACC CACGAAAACC GACTTGCGCC GCGTGCGTAC TTTTTTTCAT ATGATTCTGT TGCGCAAGCG CGTACCTTTG CCCGCGAAAC CAGCAGCCTG TTTCTGCCCT TAAGCGGTCA GTGGAATTTC CATTTTTTTG ACCACCCGCT GCAAGTGCCT GAAGCCTTCA CCTCTGAGTT AATGGCTGAC TGGGGGCATA TTACCGTCCC CGCCATGTGG CAAATGGAAG GTCACGGCAA ACTGCAATAT ACCGACGAAG GTTTTCCGTT CCCCATCGAT GTGCCGTTTG TCCCCAGCGA TAACCCAACC GGTGCCTATC AACGTATTTT CACCCTCAGC GACGGCTGGC AGGGTAAACA GACGCTGATT AAATTTGACG GCGTCGAAAC CTATTTTGAA GTCTATGTTA ACGGTCAGTA TGTGGGTTTC AGCAAGGGCA GTCGCCTGAC CGCAGAGTTT GACATCAGCG CGATGGTTAA AACCGGCGAC AACCTGTTGT GTGTGCGCGT GATGCAGTGG GCGGACTCTA CCTACGTGGA AGACCAGGAT ATGTGGTGGT CAGCGGGGAT CTTCCGCGAT GTTTATCTGG TCGGAAAACA ACTAACGCAT ATTAACGATT TCTCCGTGCG TACCGACTTT GACGAAGCCT ATTGCGATGC CACGCTTTCC TGCGAAGTGG TGCTGGAAAA TCTCGCCGCC TCCCCTGTCG TAACGACGCT GGAATATACC CTGTTTGATG GCGAACGCGT GGTGCACAGC AGCGCCATTG ATCATTTGGC AATTGAAAAA CTGACCAGCG CCAGCTTTGC TTTTACTGTC GAACAGCCGC AGCAATGGTC AGCAGAATCC CCTTATCTTT ACCATCTGGT CATGACGCTG AAAGACGCCG ACGGCAACGT TCTGGAAGTG GTACCACAAC GCGTTGGCTT CCGTGATATC AAAGTGCGCG ACGGTCTGTT CTGGATCAAT AACCGTTATG TGATGCTGCA CGGCGTCAAC CGTCACGACA ACGATCATCG CAAAGGCCGC GCCGTTGGAA TGGATCGCGT CGAGAAAGAT CTCCAGTTGA TGAAGCAGCA CAACATCAAC TCCGTGCGTA CCGCTCACTA CCCGAACGAT CCGCGTTTTT ACGAACTGTG TGATATCTAC GGCTTGTTTG TGATGGCGGA AACCGACGTC GAATCGCACG GCTTTGCTAA TGTCGGCGAT ATCAGCCGTA TTACCGACGA TCCGCAGTGG GAAAAAGTCT ACGTCGAGCG CATTGTTCGC CATATCCACG CGCAGAAAAA CCATCCGTCG ATCATCATCT GGTCGCTGGG CAATGAATCC GGCTATGGCT GTAACATCCG CGCGATGTAC CATGCGGCGA AAGCGCTGGA TGACACGCGA CTGGTGCATT ACGAAGAAGA TCGCGATGCT GAAGTGGTCG ATATTATTTC CACCATGTAC ACCCGCGTGC CGCTGATGAA TGAGTTTGGT GAATACCCGC ATCCGAAGCC GCGCATCATC TGTGAATATG CTCATGCGAT GGGGAACGGA CCGGGCGGGC TGACGGAGTA CCAGAACGTC TTCTATAAGC ACGATTGTAT TCAGGGACAT TATGTTTGGG AGTGGTGCGA CCACGGAATC CAGGCGCAGG ATGACAACGG CAATGTCTGG TATAAATTCG GCGGCGACTA CGGCGACTAT CCCAACAACT ATAACTTCTG TCTTGATGGT TTGATCTATT CCGATCAGAC GCCGGGACCA GGCCTGAAAG AGTACAAACA GGTTATCGCG CCGGTAAAAA TCCACGCGCT GGATCTGACT CGCGGCGAGC TGAAAGTCGA AAATAAACTG TGGTTTACCA CGCTTGATGA CTACACCCTG CACGCAGAGG TGCGCGCCGA AGGTGAAACG CTCGCAACGC AGCAGATTAA ACTGCGCGAC GTTGCGCCGA ACAGCGAAGC CCCCTTGCAG ATCACGCTGC CGCAGCTGGA CGCCCGCGAA GCGTTCCTCA ACATTACGGT GACCAAAGAT TCCCGCACCC GCTACAGCGA AGCCGGGCAT TCTATCGCCA CTTATCAGTT CCCGCTGAAG GAAAACACCG CGCAGCCAGT GCCTTTCGCA CCAAATAATG CGCGTCCGCT GACGCTGGAA GACGATCGTT TGAGCTGCAC CGTTCGCGGC TACAACTTCG CGATCACCTT CTCAAAAATG AGTGGCAAAC CGACATCCTG GCAGGTAAAT GGCGAGTCGC TGCTGACCCG CGAGCCAAAG ATCAACTTCT TCAAGCCGAT GATCGACAAC CACAAGCAGG AGTACGAAGG GCTGTGGCAA CCGAATCATT TGCAGATCAT GCAGGAACAT CTGCGCGACT TTGCCGTAGA ACAGAGCGAT GGTGAAGTGT TGATCATCAG CCGCACGGTT ATAGCACCGC CGGTGTTTGA CTTCGGGATG CGCTGCACCT ACATCTGGCG CATCGCTGCA GATGGCCAGG TTAACGTGGC GCTTTCCGGC GAGCGTTACG GCGACTATCC GCACATCATT CCGTGCATCG GTTTCACCAT GGGGATTAAC GGCGAATACG ATCAGGTGGC GTATTACGGT CGTGGACCGG GCGAAAACTA CGCCGACAGC CAGCAGGCTA ACATCATCGA TATCTGGCGC AGCACCGTCG ATGCCATGTT CGAGAACTAT CCCTTCCCGC AGAACAACGG CAACCGTCAG CATGTCCGCT GGACGGCACT GACTAACCGC CACGGCAACG GTCTGCTGGT GGTTCCGCAG CGCCCAATTA ACTTCAGCGC CTGGCACTAT ACCCAGGAAA ACATCCACGC TTCCCAGCAC TGTAACGAGC TGCAGCGCAG TGATGACATC ACCCTGAATC TCGACCACCA GCTGCTTGGC CTCGGCTCCA ACTCCTGGGG CAGCGAGGTG CTGGACTCCT GGCGCGTCTG GTTCCGTGAC TTCAGCTACG GCTTTACGTT GCTGCCGGTT TCTGGCGGAG AAGCTACTGC GCAAAGCCTG GCGTCGTATG AGTTCGGCGC AGGGTTCTTT TCCACGAATT TGCACAGCGA GAATAAGCAA TGA
|
Protein sequence | MNRWENIQLT HENRLAPRAY FFSYDSVAQA RTFARETSSL FLPLSGQWNF HFFDHPLQVP EAFTSELMAD WGHITVPAMW QMEGHGKLQY TDEGFPFPID VPFVPSDNPT GAYQRIFTLS DGWQGKQTLI KFDGVETYFE VYVNGQYVGF SKGSRLTAEF DISAMVKTGD NLLCVRVMQW ADSTYVEDQD MWWSAGIFRD VYLVGKQLTH INDFSVRTDF DEAYCDATLS CEVVLENLAA SPVVTTLEYT LFDGERVVHS SAIDHLAIEK LTSASFAFTV EQPQQWSAES PYLYHLVMTL KDADGNVLEV VPQRVGFRDI KVRDGLFWIN NRYVMLHGVN RHDNDHRKGR AVGMDRVEKD LQLMKQHNIN SVRTAHYPND PRFYELCDIY GLFVMAETDV ESHGFANVGD ISRITDDPQW EKVYVERIVR HIHAQKNHPS IIIWSLGNES GYGCNIRAMY HAAKALDDTR LVHYEEDRDA EVVDIISTMY TRVPLMNEFG EYPHPKPRII CEYAHAMGNG PGGLTEYQNV FYKHDCIQGH YVWEWCDHGI QAQDDNGNVW YKFGGDYGDY PNNYNFCLDG LIYSDQTPGP GLKEYKQVIA PVKIHALDLT RGELKVENKL WFTTLDDYTL HAEVRAEGET LATQQIKLRD VAPNSEAPLQ ITLPQLDARE AFLNITVTKD SRTRYSEAGH SIATYQFPLK ENTAQPVPFA PNNARPLTLE DDRLSCTVRG YNFAITFSKM SGKPTSWQVN GESLLTREPK INFFKPMIDN HKQEYEGLWQ PNHLQIMQEH LRDFAVEQSD GEVLIISRTV IAPPVFDFGM RCTYIWRIAA DGQVNVALSG ERYGDYPHII PCIGFTMGIN GEYDQVAYYG RGPGENYADS QQANIIDIWR STVDAMFENY PFPQNNGNRQ HVRWTALTNR HGNGLLVVPQ RPINFSAWHY TQENIHASQH CNELQRSDDI TLNLDHQLLG LGSNSWGSEV LDSWRVWFRD FSYGFTLLPV SGGEATAQSL ASYEFGAGFF STNLHSENKQ
|
| |