Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spro_1973 |
Symbol | ebgA |
ID | 5603418 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Serratia proteamaculans 568 |
Kingdom | Bacteria |
Replicon accession | NC_009832 |
Strand | + |
Start bp | 2158498 |
End bp | 2161590 |
Gene Length | 3093 bp |
Protein Length | 1030 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640937511 |
Product | cryptic beta-D-galactosidase subunit alpha |
Protein accession | YP_001478204 |
Protein GI | 157370215 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.75541 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00919107 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAACAATT GGGAAAACAT AGAAAAACAG TCGGAGAATC GCTTGCCGGC GCGGGCGAGT TTCTTCAGCT ATGCGGATGC CAGGCAGGCA TTGAGTCTTG ATCGCAACGC AAGTTTGGGC TTCCAGTTGC TGAGCGGGCG TTGGCAGTTT CGCTACTTCG AACACCCGGA CCTGGTGCCG GAGGCGTTTT ACCGCCAGCC GATGGCGGAA TGGGGCGAAA TCACCGTGCC GGGCATGTGG CAGATGGAAG GGCATGGTCA ACTGCAGTAT ACCGACGAAG GTTACCCATT CCCGATCGAC GTGCCTTATG TGCCGACCAA TAATCCTGCC GGGGCCTATC AGCGCCTTTT TACTCTCGAT AACGAGTGGC TGGATCAACA GGTCATCATC AAATTTGACG GCGTGGAAAC CTATTTTGAG GTTTACCTCA ATGGTCACTA CGTCGGTTTT AGTAAAGGCA GCCGTCTGTC AGCCGAGTTT GATCTCAGCG ACTATCTGCA AGCGGGGGAC AACCTGCTGT CCGTTCGCGT GCTGCAATGG GCCGATTCTA CCTATATCGA AGATCAGGAC ATGTGGTGGA TGGCCGGTAT CTTCCGCGAT GTTTACCTGA TCGGCCAGCA ACGCACGCAT ATTCATGATC TTACTTTGGT TACCACCTTT GACGAGCAGT ATTGCGATGC GCTGCTGGCG ATAGACGTCG AGTTGCAGCA TCTGGGGCAA GGCGTGGCTG GCGGTTATCG TCTGCAGGCG CAGCTCTTTG ACGGTGAAGC CTGCGTCGGT GAGCTGTGGG CGAACGAACT GAGCATTGGG GCGGGTGCCA CCTGCCGGTT TGAAATTCCG GTCAGCCAAC CCAGGCAGTG GAATGCGGAA GATCCCTATC TGTATCAACT GTTGCTCAGT CTGCGCGACG GCGATGGCAA TCTGTTATCG GTAGTGCCGC AGCGGGTGGG TTTCCGTGAA ATTAGCGTGC GCGATGGGCT ATTCCACATC AACGGGCGCT ACCTGAAACT GCACGGCGTT AACCGCCATG ACCACGATCA TCGCAAAGGT CGGGCGGTTG ATATGGCGCG GGTTGAGCGC GACATCGTGC TGATGAAACA GCACAACATC AACTCGGTGC GCACCGCCCA TTATCCGAAC GATCCGCGTT TTTACGAGCT GTGCGATATC TACGGCCTGT TTGTGATGGC GGAAACCGAT CTGGAAAGCC ACGGGTTTGC CAACGTTGGC GATATCAGCC GCATTACCGA CGATCCACGT TGGGAAAACG CCTACGTAGA ACGCATTGAA CGCCATGTGA AGGCGCAGAA AAACCATCCT TCAATCATTA TCTGGTCACT GGGCAACGAG TCTGGCTATG GCTGCAACAT TCGCGCCATG GCCAAACGCT GTAAGGCGCT CGATGCCACC CGACTGGTGC ATTATGAAGA AGACCGGGAT GCGGAAGTGG TCGATGTGAT CAGCACCATG TATTCCCGCG TGGCGATGAT GAATGCGTTC GGTGAGTATC CTCACCCCAA ACCCCGGATC CTGTGTGAAT ACGCTCATGC TATGGGCAAT GGGCCGGGCG GACTGTTCGA ATATCAGTCG GTGTTTAACC GCCATGCCAG CCTGCAGGGA CATTACATCT GGGAATGGTG CGATCACGGT TTGCTCAGCC ATGACCAGCA GGGGCGGGAA CGTTATCAGT ACGGCGGCGA TTACGGCGAT TACCCGAATA ACTATAACTT CTGCATGGAT GGCCTGATCT ATCCGGATCA ACGTCCTGGT CCCGGCCTGC GTGAATATAA ACAGGTGCTG TGTCCGGTCG AGGTGAGCGG CGTGGGCGAG AAAACGTCGG TATTACGGGT CAAAAACCGC TATTGGTTCA GCTCGCTGGC CGATATTACG CTGAAGGTCA GCGTTAAGGC CGGGGGGCGG CAACTGACCG GCTATGAGAT TAAGCTGCCG CACCTGCAGC CGGGGGAATC AGAAGAGGTC CATTTGCCTG CATTGGCGCT GGGCGCGGAA GAAACCTTCA TCGACGTCGA AGTGTATAAA GACAGCGCCA CCCGTTACAG CGAAAGCGGA GATCTGCTGG GGCAGTATCA ACATCTGTTG CAGCCCGCCA CTGCGTTAAT TAGCCCACCG GAGTCCATTC CTGTTTCTGC GCTGCAATGC GCTGACGAAA ACCATCAAAT GATTGTCAGT GGCGAGAACT TCAGCCTGAC CTTCTCGCGC CTGAGTGGCG AACTGCAGTC GTGGAAAGTG GCAGGAGAGG AGGTGGTGGG CCGCGCACCG CACCTGACTT TCTTCAAACC GGTGATCGAC AACCACAAGC AGGAGTATGA AGGCATTTGG CTGCCGAACC ATCTGCAGAT CATGCAGCAG CATTTCCGTA GCCTGCACTG GGAACTGCAG GGGGATGACG TGGTGATTGA AGTCCGCACC CTGATTGCCC CGCCGGTATT TGATTTTGGC ATGCGTTGCC GTTACCGCTG GCAGATCTCT GCGCAGGGCC ATGTCAGCCT GGATCTGTCG GGCGAGCCTT ACGGCGATTT CCAGCAGGTG ATCCCGAAGA TTGGCCTGGA TTTTGGCCTC AGCCGCCGCT TTGAACAGGT GGAATACTAC GGCCGTGGGC CGGGGGAAAA CTATCAGGAC AGCTGCCAGG CTAACCTGAT CGGCCACTAT CAGCAGCGGG TTGGCGAGCT GTTTGAGCAC TATCCGTTCC CGCAGGATAA CGGCAATCGC CAAGAGGTAC GCTGGCTGAG TCTGCAAGAT GCCAATGGCC ACGGCATCTT TATCCAGCCG CGGCGGCCGA TCAATTTCAG CCTGTGGCCC TACAGTGCCG AGATGCTGCA CCAGGCTCAG CATATTAATG AACTGCAAGA AAGCGACTAC CTGACGTTAA ACCTCGACGA TCAAATTCTG GGGCTGGGCT CCAACTCCTG GGGTTCGGAG GTGCTGGATT CCTATCGGGT TTATCTGTCG TCGTTTAACT ACGGCTTTAC GCTGGTGCCG TTTAACCGGC AGGAAACCGA GGCGGCTACG CTTGCCGGCT ATCGTTTTTC ACCGGCCATT AATAACGCTC AGTCAGAAGA GGCGAACTTA TGA
|
Protein sequence | MNNWENIEKQ SENRLPARAS FFSYADARQA LSLDRNASLG FQLLSGRWQF RYFEHPDLVP EAFYRQPMAE WGEITVPGMW QMEGHGQLQY TDEGYPFPID VPYVPTNNPA GAYQRLFTLD NEWLDQQVII KFDGVETYFE VYLNGHYVGF SKGSRLSAEF DLSDYLQAGD NLLSVRVLQW ADSTYIEDQD MWWMAGIFRD VYLIGQQRTH IHDLTLVTTF DEQYCDALLA IDVELQHLGQ GVAGGYRLQA QLFDGEACVG ELWANELSIG AGATCRFEIP VSQPRQWNAE DPYLYQLLLS LRDGDGNLLS VVPQRVGFRE ISVRDGLFHI NGRYLKLHGV NRHDHDHRKG RAVDMARVER DIVLMKQHNI NSVRTAHYPN DPRFYELCDI YGLFVMAETD LESHGFANVG DISRITDDPR WENAYVERIE RHVKAQKNHP SIIIWSLGNE SGYGCNIRAM AKRCKALDAT RLVHYEEDRD AEVVDVISTM YSRVAMMNAF GEYPHPKPRI LCEYAHAMGN GPGGLFEYQS VFNRHASLQG HYIWEWCDHG LLSHDQQGRE RYQYGGDYGD YPNNYNFCMD GLIYPDQRPG PGLREYKQVL CPVEVSGVGE KTSVLRVKNR YWFSSLADIT LKVSVKAGGR QLTGYEIKLP HLQPGESEEV HLPALALGAE ETFIDVEVYK DSATRYSESG DLLGQYQHLL QPATALISPP ESIPVSALQC ADENHQMIVS GENFSLTFSR LSGELQSWKV AGEEVVGRAP HLTFFKPVID NHKQEYEGIW LPNHLQIMQQ HFRSLHWELQ GDDVVIEVRT LIAPPVFDFG MRCRYRWQIS AQGHVSLDLS GEPYGDFQQV IPKIGLDFGL SRRFEQVEYY GRGPGENYQD SCQANLIGHY QQRVGELFEH YPFPQDNGNR QEVRWLSLQD ANGHGIFIQP RRPINFSLWP YSAEMLHQAQ HINELQESDY LTLNLDDQIL GLGSNSWGSE VLDSYRVYLS SFNYGFTLVP FNRQETEAAT LAGYRFSPAI NNAQSEEANL
|
| |