Gene Spro_1973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_1973 
SymbolebgA 
ID5603418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp2158498 
End bp2161590 
Gene Length3093 bp 
Protein Length1030 aa 
Translation table11 
GC content56% 
IMG OID640937511 
Productcryptic beta-D-galactosidase subunit alpha 
Protein accessionYP_001478204 
Protein GI157370215 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.75541 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00919107 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAACAATT GGGAAAACAT AGAAAAACAG TCGGAGAATC GCTTGCCGGC GCGGGCGAGT 
TTCTTCAGCT ATGCGGATGC CAGGCAGGCA TTGAGTCTTG ATCGCAACGC AAGTTTGGGC
TTCCAGTTGC TGAGCGGGCG TTGGCAGTTT CGCTACTTCG AACACCCGGA CCTGGTGCCG
GAGGCGTTTT ACCGCCAGCC GATGGCGGAA TGGGGCGAAA TCACCGTGCC GGGCATGTGG
CAGATGGAAG GGCATGGTCA ACTGCAGTAT ACCGACGAAG GTTACCCATT CCCGATCGAC
GTGCCTTATG TGCCGACCAA TAATCCTGCC GGGGCCTATC AGCGCCTTTT TACTCTCGAT
AACGAGTGGC TGGATCAACA GGTCATCATC AAATTTGACG GCGTGGAAAC CTATTTTGAG
GTTTACCTCA ATGGTCACTA CGTCGGTTTT AGTAAAGGCA GCCGTCTGTC AGCCGAGTTT
GATCTCAGCG ACTATCTGCA AGCGGGGGAC AACCTGCTGT CCGTTCGCGT GCTGCAATGG
GCCGATTCTA CCTATATCGA AGATCAGGAC ATGTGGTGGA TGGCCGGTAT CTTCCGCGAT
GTTTACCTGA TCGGCCAGCA ACGCACGCAT ATTCATGATC TTACTTTGGT TACCACCTTT
GACGAGCAGT ATTGCGATGC GCTGCTGGCG ATAGACGTCG AGTTGCAGCA TCTGGGGCAA
GGCGTGGCTG GCGGTTATCG TCTGCAGGCG CAGCTCTTTG ACGGTGAAGC CTGCGTCGGT
GAGCTGTGGG CGAACGAACT GAGCATTGGG GCGGGTGCCA CCTGCCGGTT TGAAATTCCG
GTCAGCCAAC CCAGGCAGTG GAATGCGGAA GATCCCTATC TGTATCAACT GTTGCTCAGT
CTGCGCGACG GCGATGGCAA TCTGTTATCG GTAGTGCCGC AGCGGGTGGG TTTCCGTGAA
ATTAGCGTGC GCGATGGGCT ATTCCACATC AACGGGCGCT ACCTGAAACT GCACGGCGTT
AACCGCCATG ACCACGATCA TCGCAAAGGT CGGGCGGTTG ATATGGCGCG GGTTGAGCGC
GACATCGTGC TGATGAAACA GCACAACATC AACTCGGTGC GCACCGCCCA TTATCCGAAC
GATCCGCGTT TTTACGAGCT GTGCGATATC TACGGCCTGT TTGTGATGGC GGAAACCGAT
CTGGAAAGCC ACGGGTTTGC CAACGTTGGC GATATCAGCC GCATTACCGA CGATCCACGT
TGGGAAAACG CCTACGTAGA ACGCATTGAA CGCCATGTGA AGGCGCAGAA AAACCATCCT
TCAATCATTA TCTGGTCACT GGGCAACGAG TCTGGCTATG GCTGCAACAT TCGCGCCATG
GCCAAACGCT GTAAGGCGCT CGATGCCACC CGACTGGTGC ATTATGAAGA AGACCGGGAT
GCGGAAGTGG TCGATGTGAT CAGCACCATG TATTCCCGCG TGGCGATGAT GAATGCGTTC
GGTGAGTATC CTCACCCCAA ACCCCGGATC CTGTGTGAAT ACGCTCATGC TATGGGCAAT
GGGCCGGGCG GACTGTTCGA ATATCAGTCG GTGTTTAACC GCCATGCCAG CCTGCAGGGA
CATTACATCT GGGAATGGTG CGATCACGGT TTGCTCAGCC ATGACCAGCA GGGGCGGGAA
CGTTATCAGT ACGGCGGCGA TTACGGCGAT TACCCGAATA ACTATAACTT CTGCATGGAT
GGCCTGATCT ATCCGGATCA ACGTCCTGGT CCCGGCCTGC GTGAATATAA ACAGGTGCTG
TGTCCGGTCG AGGTGAGCGG CGTGGGCGAG AAAACGTCGG TATTACGGGT CAAAAACCGC
TATTGGTTCA GCTCGCTGGC CGATATTACG CTGAAGGTCA GCGTTAAGGC CGGGGGGCGG
CAACTGACCG GCTATGAGAT TAAGCTGCCG CACCTGCAGC CGGGGGAATC AGAAGAGGTC
CATTTGCCTG CATTGGCGCT GGGCGCGGAA GAAACCTTCA TCGACGTCGA AGTGTATAAA
GACAGCGCCA CCCGTTACAG CGAAAGCGGA GATCTGCTGG GGCAGTATCA ACATCTGTTG
CAGCCCGCCA CTGCGTTAAT TAGCCCACCG GAGTCCATTC CTGTTTCTGC GCTGCAATGC
GCTGACGAAA ACCATCAAAT GATTGTCAGT GGCGAGAACT TCAGCCTGAC CTTCTCGCGC
CTGAGTGGCG AACTGCAGTC GTGGAAAGTG GCAGGAGAGG AGGTGGTGGG CCGCGCACCG
CACCTGACTT TCTTCAAACC GGTGATCGAC AACCACAAGC AGGAGTATGA AGGCATTTGG
CTGCCGAACC ATCTGCAGAT CATGCAGCAG CATTTCCGTA GCCTGCACTG GGAACTGCAG
GGGGATGACG TGGTGATTGA AGTCCGCACC CTGATTGCCC CGCCGGTATT TGATTTTGGC
ATGCGTTGCC GTTACCGCTG GCAGATCTCT GCGCAGGGCC ATGTCAGCCT GGATCTGTCG
GGCGAGCCTT ACGGCGATTT CCAGCAGGTG ATCCCGAAGA TTGGCCTGGA TTTTGGCCTC
AGCCGCCGCT TTGAACAGGT GGAATACTAC GGCCGTGGGC CGGGGGAAAA CTATCAGGAC
AGCTGCCAGG CTAACCTGAT CGGCCACTAT CAGCAGCGGG TTGGCGAGCT GTTTGAGCAC
TATCCGTTCC CGCAGGATAA CGGCAATCGC CAAGAGGTAC GCTGGCTGAG TCTGCAAGAT
GCCAATGGCC ACGGCATCTT TATCCAGCCG CGGCGGCCGA TCAATTTCAG CCTGTGGCCC
TACAGTGCCG AGATGCTGCA CCAGGCTCAG CATATTAATG AACTGCAAGA AAGCGACTAC
CTGACGTTAA ACCTCGACGA TCAAATTCTG GGGCTGGGCT CCAACTCCTG GGGTTCGGAG
GTGCTGGATT CCTATCGGGT TTATCTGTCG TCGTTTAACT ACGGCTTTAC GCTGGTGCCG
TTTAACCGGC AGGAAACCGA GGCGGCTACG CTTGCCGGCT ATCGTTTTTC ACCGGCCATT
AATAACGCTC AGTCAGAAGA GGCGAACTTA TGA
 
Protein sequence
MNNWENIEKQ SENRLPARAS FFSYADARQA LSLDRNASLG FQLLSGRWQF RYFEHPDLVP 
EAFYRQPMAE WGEITVPGMW QMEGHGQLQY TDEGYPFPID VPYVPTNNPA GAYQRLFTLD
NEWLDQQVII KFDGVETYFE VYLNGHYVGF SKGSRLSAEF DLSDYLQAGD NLLSVRVLQW
ADSTYIEDQD MWWMAGIFRD VYLIGQQRTH IHDLTLVTTF DEQYCDALLA IDVELQHLGQ
GVAGGYRLQA QLFDGEACVG ELWANELSIG AGATCRFEIP VSQPRQWNAE DPYLYQLLLS
LRDGDGNLLS VVPQRVGFRE ISVRDGLFHI NGRYLKLHGV NRHDHDHRKG RAVDMARVER
DIVLMKQHNI NSVRTAHYPN DPRFYELCDI YGLFVMAETD LESHGFANVG DISRITDDPR
WENAYVERIE RHVKAQKNHP SIIIWSLGNE SGYGCNIRAM AKRCKALDAT RLVHYEEDRD
AEVVDVISTM YSRVAMMNAF GEYPHPKPRI LCEYAHAMGN GPGGLFEYQS VFNRHASLQG
HYIWEWCDHG LLSHDQQGRE RYQYGGDYGD YPNNYNFCMD GLIYPDQRPG PGLREYKQVL
CPVEVSGVGE KTSVLRVKNR YWFSSLADIT LKVSVKAGGR QLTGYEIKLP HLQPGESEEV
HLPALALGAE ETFIDVEVYK DSATRYSESG DLLGQYQHLL QPATALISPP ESIPVSALQC
ADENHQMIVS GENFSLTFSR LSGELQSWKV AGEEVVGRAP HLTFFKPVID NHKQEYEGIW
LPNHLQIMQQ HFRSLHWELQ GDDVVIEVRT LIAPPVFDFG MRCRYRWQIS AQGHVSLDLS
GEPYGDFQQV IPKIGLDFGL SRRFEQVEYY GRGPGENYQD SCQANLIGHY QQRVGELFEH
YPFPQDNGNR QEVRWLSLQD ANGHGIFIQP RRPINFSLWP YSAEMLHQAQ HINELQESDY
LTLNLDDQIL GLGSNSWGSE VLDSYRVYLS SFNYGFTLVP FNRQETEAAT LAGYRFSPAI
NNAQSEEANL