Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_02895 |
Symbol | ebgA |
ID | 8116552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 3087919 |
End bp | 3091011 |
Gene Length | 3093 bp |
Protein Length | 1030 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644849083 |
Product | hypothetical protein |
Protein accession | YP_003000656 |
Protein GI | 251786352 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCGCT GGGAAAACAT TCAGCTCACC CACGAAAACC GACTTGCGCC GCGTGCGTAC TTTTTTTCAT ATGATTCTGT TGCGCAAGCG CGTACCTTTG CCCGCGAAAC CAGCAGCCTG TTTCTGTCCT TAAGCGGTCA GTGGAATTTC CATTTTTTTG ACCACCCGCT GCAAGTGCCT GAAGCCTTCA CCTCTGAGTT AATGGCTGAC TGGGGGCATA TTACCGTCCC CGCCATGTGG CAAATGGAAG GTCACGGCAA ACTGCAATAT ACCGACGAAG GTTTTCCGTT CCCCATCGAT GTGCCGTTTG TCCCCAGCGA TAACCCAACC GGTGCCTATC AACGTATTTT CACCCTCAGC GACGGCTGGC AGGGTAAACA GACGCTGATT AAATTTGACG GCGTCGAAAC CTATTTTGAA GTCTATGTTA ACGGTCAGTA TGTGGGTTTC AGCAAGGGCA GTCGCCTGAC CGCAGAGTTT GACATCAGCG CGATGGTTAA AACCGGCGAC AACCTGTTGT GTGTGCGCGT GATGCAGTGG GCGGACTCTA CCTACGTGGA AGACCAGGAT ATGTGGTGGT CAGCGGGGAT CTTCCGCGAT GTTTATCTGG TCGGAAAACA CCTAACGCAT ATTAACGATT TCACTGTGCG TACCGACTTT GACGAAGCCT ATTGCGATGC CACGCTTTCC TGCGAAGTGG TGCTGGAAAA TCTCGCCGCC TCCCCTGTCG TCACGACGCT GGAATATACC CTGTTTGATG GCGAACGCGT GGTGCACAGC AGCGCCATTG ATCATTTGGC AATTGAAAAA CTGACCAGCG CCAGCTTTGC TTTTACTGTC GAACAGCCGC AGCAATGGTC AGCAGAATCC CCTTATCTTT ACCATCTGGT CATGACGCTG AAAGACGCCA ACGGCAACGT TCTGGAAGTG GTGCCACAAC GCGTTGGCTT CCGTGATATC AAAGTGCGCG ACGGTCTGTT CTGGATCAAT AACCGTTATG TGATGCTGCA CGGCGTCAAC CGTCACGACA ACGATCATCG CAAAGGCCGC GCCGTTGGAA TGGATCGCGT CGAGAAAGAT CTCCAGTTGA TGAAGCAGCA CAATATCAAC TCCGTGCGTA CCGCTCACTA CCCGAACGAT CCGCGTTTTT ACGAACTGTG TGATATCTAC GGCCTGTTTG TGATGGCGGA AACCGACGTC GAATCGCACG GCTTTGCTAA TGTCGGCGAT ATTAGCCGTA TTACCGACGA TCCGCAGTGG GAAAAAGTCT ACGTCGAGCG CATTGTTCGC CATATCCACG CGCAGAAAAA CCATCCGTCG ATCATCATCT GGTCGCTGGG CAATGAATCC GGCTATGGCT GTAACATCCG CGCGATGTAC CATGCGGCGA AAGCGCTGGA TGACACGCGA CTGGTGCATT ACGAAGAAGA TCGCGATGCT GAAGTGGTCG ATATTATTTC CACCATGTAC ACCCGCGTGC CGCTGATGAA TGAGTTTGGT GAATACCCGC ATCCGAAGCC GCGCATCATC TGTGAATATG CTCATGCGAT GGGGAACGGA CCGGGCGGGC TGACGGAGTA CCAGAACGTC TTCTATAAGC ACGATTGCAT TCAGGGTCAT TATGTCTGGG AGTGGTGCGA CCACGGGATC CAGGCACAGG ACGACCACGG CAATGTCTGG TATAAATTCG GCGGCGACTA CGGCGACTAT CCCAACAACT ATAACTTCTG TCTTGATGGT TTGATCTATT CCGATCAGAC GCCGGGACCG GGCCTGAAAG AGTACAAACA GGTTATCGCG CCGGTAAAAA TCCACGCGCG GGATCTGACT CGCGGCGAGT TGAAAGTCGA AAATAAACTG TGGTTTACCA CGCTTGATGA CTACACCCTG CACGCAGAGG TGCGCGCCGA AGGTGAAACG CTCGCGACGC AGCAGATTAA ACTGCGCGAC GTTGCGCCGA ACAGCGAAGC CCCCTTGCAG ATCACGCTGC CGCAGCTGGA CGCCCGCGAA GCGTTCCTCA ACATTACGGT GACCAAAGAT TCCCGCACCC GCTACAGCGA AGCCGGGCAT TCTATCGCCA CTTATCAGTT CCCGCTGAAG GAAAACACCG CGCAGCCAGT GCCTTTCGCA CCAAATAATG CGCGTCCGCT GACGCTGGAA GACGATCGTT TGAGCTGCAC CGTTCGCGGC TACAACTTCG CGATCACCTT CTCAAAAATG AGTGGCAAAC CGACATCCTG GCAGGTGAAT GGCGAATCGC TGCTGACTCG CGAGCCAAAG ATCAACTTCT TCAAGCCGAT GATCGACAAC CACAAGCAGG AGTACGAAGG GCTGTGGCAA CCGAATCATT TGCAGATCAT GCAGGAACAT CTGCGCGACT TTGCCGTAGA ACAGAGCGAT GGTGAAGTGC TGATCATCAG CCGCACAGTT ATTGCCCCGC CGGTGTTTGA CTTCGGGATG CGCTGCACCT ACATCTGGCG CATCGCTGCC GATGGCCAGG TTAACGTGGC GCTTTCCGGC GAGCGTTACG GCGACTATCC GCACATCATT CCGTGCATCG GTTTCACCAT GGGAATTAAC GGCGAATACG ATCAGGTGGC GTATTACGGT CGTGGACCGG GCGAAAACTA CGCCGACAGC CAGCAGGCTA ACATCATCGA TATCTGGCGC AGCACCGTCG ATGCCATGTT CGAGAACTAT CCCTTCCCGC AGAACAACGG TAACCGTCAG CATGTCCGCT GGACGGCACT GACTAACCGC CACGGTAACG GTCTGCTGGT GGTTCCGCAG CGCCCAATTA ACTTCAGCGC CTGGCACTAT ACCCAGGAAA ACATCCACGC TGCCCAGCAC TGTAACGAGC TGCAGCGCAG TGATGACATC ACCCTGAACC TCGATCACCA GCTGCTTGGC CTCGGCTCCA ACTCCTGGGG CAGCGAGGTG CTGGACTCCT GGCGCGTCTG GTTCCGTGAC TTCAGCTACG GCTTTACGTT GCTGCCGGTT TCTGGCGGAG AAGCTACCGC GCAAAGCCTG GCGTCGTATG AGTTCGGCGC AGGGTTCTTT TCCACGAATT TGCACAGCGA GAATAAGCAA TGA
|
Protein sequence | MNRWENIQLT HENRLAPRAY FFSYDSVAQA RTFARETSSL FLSLSGQWNF HFFDHPLQVP EAFTSELMAD WGHITVPAMW QMEGHGKLQY TDEGFPFPID VPFVPSDNPT GAYQRIFTLS DGWQGKQTLI KFDGVETYFE VYVNGQYVGF SKGSRLTAEF DISAMVKTGD NLLCVRVMQW ADSTYVEDQD MWWSAGIFRD VYLVGKHLTH INDFTVRTDF DEAYCDATLS CEVVLENLAA SPVVTTLEYT LFDGERVVHS SAIDHLAIEK LTSASFAFTV EQPQQWSAES PYLYHLVMTL KDANGNVLEV VPQRVGFRDI KVRDGLFWIN NRYVMLHGVN RHDNDHRKGR AVGMDRVEKD LQLMKQHNIN SVRTAHYPND PRFYELCDIY GLFVMAETDV ESHGFANVGD ISRITDDPQW EKVYVERIVR HIHAQKNHPS IIIWSLGNES GYGCNIRAMY HAAKALDDTR LVHYEEDRDA EVVDIISTMY TRVPLMNEFG EYPHPKPRII CEYAHAMGNG PGGLTEYQNV FYKHDCIQGH YVWEWCDHGI QAQDDHGNVW YKFGGDYGDY PNNYNFCLDG LIYSDQTPGP GLKEYKQVIA PVKIHARDLT RGELKVENKL WFTTLDDYTL HAEVRAEGET LATQQIKLRD VAPNSEAPLQ ITLPQLDARE AFLNITVTKD SRTRYSEAGH SIATYQFPLK ENTAQPVPFA PNNARPLTLE DDRLSCTVRG YNFAITFSKM SGKPTSWQVN GESLLTREPK INFFKPMIDN HKQEYEGLWQ PNHLQIMQEH LRDFAVEQSD GEVLIISRTV IAPPVFDFGM RCTYIWRIAA DGQVNVALSG ERYGDYPHII PCIGFTMGIN GEYDQVAYYG RGPGENYADS QQANIIDIWR STVDAMFENY PFPQNNGNRQ HVRWTALTNR HGNGLLVVPQ RPINFSAWHY TQENIHAAQH CNELQRSDDI TLNLDHQLLG LGSNSWGSEV LDSWRVWFRD FSYGFTLLPV SGGEATAQSL ASYEFGAGFF STNLHSENKQ
|
| |