Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3258 |
Symbol | ebgA |
ID | 5592544 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 3266077 |
End bp | 3269169 |
Gene Length | 3093 bp |
Protein Length | 1030 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640922375 |
Product | cryptic beta-D-galactosidase subunit alpha |
Protein accession | YP_001459870 |
Protein GI | 157162552 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 52 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCGCT GGGAAAACAT TCAGCTCACC CACGAAAACC GACTTGCGCC GCGTGCGTAC TTTTTTTCAT ATGATTCTGT TGCACAGGCG CGTACCTTTG CCCGCGAAAC CAGCAGCCTG TTTCTGCCCT TAAGCGGTCA GTGGAATTTC CATTTTTTTG ACCACCCGCT GCAAGTGCCT GAAGCCTTCA CCTCTGAGTT AATGGCTGAC TGGGGGCATA TTACCGTCCC CGCCATGTGG CAAATGGAAG GTCACGGCAA ACTGCAATAT ACCGACGAAG GTTTTCCGTT CCCCATCGAT GTGCCGTTTG TCCCCAGCGA TAACCCAACC GGTGCCTATC AACGTATTTT CACCCTCAGC GACGGCTGGC AGGGTAAACA GACGCTGATT AAATTTGACG GCGTCGAAAC CTATTTTGAA GTCTATGTTA ACGGTCAGTA TGTGGGTTTC AGCAAGGGCA GTCGCCTGAC CGCAGAGTTT GACATCAGCG CGATGGTTAA AACCGGCGAC AACCTGTTGT GTGTGCGCGT GATGCAGTGG GCGGACTCTA CCTACGTGGA AGACCAGGAT ATGTGGTGGT CGGCGGGGAT CTTCCGCGAT GTTTATCTGG TCGGAAAACA CCTAACGCAT ATTAACGATT TCACTGTGCG TACCGACTTT GACGAAGCCT ATTGCGATGC CACGCTTTCC TACGAAGTGG TGCTGGAAAA TCTCGCAGCC TCCCCTGTCG TAACGACGCT GGAATATACC CTGTTTGATG GCGAACGCGT GGTGCACAGC AGCACCATTG ATCATTTGGC AATTGAAAAA CTGACCAGCG CCAGCTTTGC TTTTACTGTC GAACAGCCCC AGCAATGGTC AGCAGAATCC CCTTATCTTT ACCATCTGGT CATGACGCTG AAAGACGCCG ACGGCAACGT TCTGGAAGTG GTACCACAAC GCGTTGGCTT CCGTGATATC AAAGTGCGCG ACGGTCTGTT CTGGATCAAT AACCGTTATG TGATGCTGCA CGGCGTCAAC CGTCACGACA ACGATCATCG CAAAGGCCGC GCCGTTGGAA TGGATCGCGT CGAGAAAGAT CTCCAGTTGA TGAAGCAGCA CAATATCAAC TCCGTGCGTA CCGCTCACTA CCCGAACGAT CCGCGTTTTT ACGAACTGTG TGATATCTAC GGCTTGTTTG TGATGGCGGA AACCGACGTC GAATCGCACG GCTTTGCTAA TGTCGGCGAT ATCAGCCGTA TTACCGACGA TCCGCAGTGG GAAAAAGTCT ACGTCGAGCG CATTGTTCGC CATATTCACG CGCAGAAAAA CCATCCGTCG ATCATCATCT GGTCGCTGGG CAATGAATCC GGCTATGGCT GTAACATCCG CGCGATGTAC CATGCGGCTA AAGCGCTGGA TGACACGCGA CTGGTGCATT ACGAAGAAGA TCGCGATGCT GAAGTGGTCG ATATTATTTC CACCATGTAC ACCCGCGTGC CGCTGATGAA TGAGTTTGGT GAATACCCGC ATCCGAAGCC GCGCATCATC TGTGAATATG CTCATGCGAT GGGGAACGGA CCGGGCGGGC TGACGGAGTA CCAGAACGTC TTCTATAAGC ACGATTGTAT TCAGGGACAT TATGTCTGGG AATGGTGCGA CCACGGGATC CAGGCGCAGG ATGACAACGG CAATGTCTGG TATAAATTCG GCGGCGACTA CGGCGACTAT CCCAACAACT ATAACTTCTG TCTTGATGGT TTGATCTATT CCGATCAGAC GCCGGGACCA GGCCTGAAAG AGTACAAACA GGTTATCGCG CCGGTAAAAA TCCACGCGCT GGATCTGACT CGCGGCGAGC TGAAAGTCGA AAATAAACTG TGGTTTACCA CGCTTGATGA CTACACCCTG CACGCAGAGG TGCGCGCCGA AGGTGAAACG CTCGCAACGC AGCAGATTAA ACTGCGCGAC GTTGCGCCGA ACAGCGAAGC CCCCTTGCAG ATCACGCTGC CGCAGCTGGA CGCCCGCGAA GCGTTCCTCA ACATTACGGT GACCAAAGAT TCCCGCACCC GCTACAGCGA AGCCGGGCAT TCTATCGCCA CTTATCAGTT CCCGCTGAAG GAAAACACCG CGCAGCCAGT GCCTTTCGCA CCAAATAATG CGCGTCCGCT GACGCTGGAA GACGATCGTT TGAGCTGCAC CGTTCGCGGC TACAACTTCG CGATCACCTT CTCAAAAATG AGTGGCAAAC CGACATCCTG GCAGGTAAAT GGCGAGTCGC TGCTGACTCG CGAGCCAAAG ATCAACTTCT TCAAGCCGAT GATCGACAAC CACAAGCAGG AGTACGAAGG GCTGTGGCAG CCGAATCATT TGCAGATCAT GCAGGAACAT CTGCGCGACT TTGCCGTAGA ACAGAGCGAT GGTGAAGTGT TGATCATCAG CCGCACGGTT ATAGCACCGC CGGTGTTTGA CTTCGGGATG CGCTGCACCT ACATCTGGCG CATCGCTGCC GATGGCCAGG TTAACGTGGC GCTTTCCGGC GAGCGTTACG GCGACTATCC GCACATCATT CCGTGCATCG GTTTCACCAT GGGGATTAAC GGCGAATACG ATCAGGTGGC GTATTACGGT CGTGGACCGG GCGAAAACTA CGCCGACAGC CAGCAGGCTA ACATCATCGA TATCTGGCGC AGCACCGTCG ATGCCATGTT CGAGAACTAT CCCTTCCCGC AGAACAACGG CAACCGTCAG CATGTCCGCT GGACGGCACT GACTAACCGC CACGGCAACG GTCTGCTGGT GGTTCCGCAG CGCCCAATTA ACTTCAGCGC CTGGCACTAT CCCCAGGAAA ACATCCACGC TGCCCAGCAC TGTAACGAGC TGCAGCGCAG TGATGACATC ACCCTGAATC TCGACCACCA GCTGCTTGGC CTCGGCTCCA ACTCCTGGGG CAGCGAGGTG CTGGACTCCT GGCGCGTCTG GTTCCGTGAC TTCAGCTACG GCTTTACGTT GCTGCCGGTT TCTGGCGGAG AAGCTACCGC GCAAAGCCTG GCGTCGTATG AGTTCGGCGC AGGGTTCTTT TCCACAAATT TGCACAGCGA GAATAAGCAA TGA
|
Protein sequence | MNRWENIQLT HENRLAPRAY FFSYDSVAQA RTFARETSSL FLPLSGQWNF HFFDHPLQVP EAFTSELMAD WGHITVPAMW QMEGHGKLQY TDEGFPFPID VPFVPSDNPT GAYQRIFTLS DGWQGKQTLI KFDGVETYFE VYVNGQYVGF SKGSRLTAEF DISAMVKTGD NLLCVRVMQW ADSTYVEDQD MWWSAGIFRD VYLVGKHLTH INDFTVRTDF DEAYCDATLS YEVVLENLAA SPVVTTLEYT LFDGERVVHS STIDHLAIEK LTSASFAFTV EQPQQWSAES PYLYHLVMTL KDADGNVLEV VPQRVGFRDI KVRDGLFWIN NRYVMLHGVN RHDNDHRKGR AVGMDRVEKD LQLMKQHNIN SVRTAHYPND PRFYELCDIY GLFVMAETDV ESHGFANVGD ISRITDDPQW EKVYVERIVR HIHAQKNHPS IIIWSLGNES GYGCNIRAMY HAAKALDDTR LVHYEEDRDA EVVDIISTMY TRVPLMNEFG EYPHPKPRII CEYAHAMGNG PGGLTEYQNV FYKHDCIQGH YVWEWCDHGI QAQDDNGNVW YKFGGDYGDY PNNYNFCLDG LIYSDQTPGP GLKEYKQVIA PVKIHALDLT RGELKVENKL WFTTLDDYTL HAEVRAEGET LATQQIKLRD VAPNSEAPLQ ITLPQLDARE AFLNITVTKD SRTRYSEAGH SIATYQFPLK ENTAQPVPFA PNNARPLTLE DDRLSCTVRG YNFAITFSKM SGKPTSWQVN GESLLTREPK INFFKPMIDN HKQEYEGLWQ PNHLQIMQEH LRDFAVEQSD GEVLIISRTV IAPPVFDFGM RCTYIWRIAA DGQVNVALSG ERYGDYPHII PCIGFTMGIN GEYDQVAYYG RGPGENYADS QQANIIDIWR STVDAMFENY PFPQNNGNRQ HVRWTALTNR HGNGLLVVPQ RPINFSAWHY PQENIHAAQH CNELQRSDDI TLNLDHQLLG LGSNSWGSEV LDSWRVWFRD FSYGFTLLPV SGGEATAQSL ASYEFGAGFF STNLHSENKQ
|
| |