Gene Spro_3173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_3173 
SymbollacZ 
ID5605502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp3491438 
End bp3494527 
Gene Length3090 bp 
Protein Length1029 aa 
Translation table11 
GC content60% 
IMG OID640938716 
Productbeta-D-galactosidase 
Protein accessionYP_001479401 
Protein GI157371412 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00462554 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAGCC TGCCCTATCC TTCCCTGAAA GACCTCCTCG CCCGCCGGGA CTGGCAAAAT 
CCGGCCTGTA CCCATTATCA GCGGCTGGCT GCACATCCAC CTTTCTCGAG CTGGCGCAAC
CTGAATGCCG CCCGTGATGA CAAGAGCAGC GAGAGCCGGC AAATATTGAA CGGCGACTGG
CAGTTCAGCT ATTTCGATAA GCCCCAGGCG GTACCCGACG GTTGGTTACA ACAGGACCTG
ACCGACGCAG ATACTCTTGC GGTGCCGTCC AACTGGCAAC TTGCCGCTTA TGACGCGCCG
ATCTACACCA ACGTCCGCTA CCCGATCCCG GTCAATCCGC CACAGGTACC AGAGGAAAAC
CCGACCGGCT GCTATTCGCG GCAGTTTACC GTCGACCCTG CCTGGCTGGC GGAAGGCCAG
ACGCGCATCA TTTTTGACGG CGTCAACTCG GCGTTTTATC TGTGGTGCAA CGGCCACTGG
GTCGGTTATT CACAGGACAG CCGCCTGCCC GCCGAGTTTG ATCTCAGCCC CTGGTTGCAG
GCCGGTGAGA ACCGACTGGC GGTGATGGTG CTGCGCTGGT GCGATGGCAG CTATCTGGAA
GATCAGGATA TGTGGCGCAT GAGCGGTATT TTCCGCGACG TCAGCCTGCT GCATAAACCC
GCCACGCATC TGAGCGATAT CCGCATCACC ACCCCGCTGT ATGACAGTTT CCGCCGTGGC
GAACTGGTGG CGGAAGTTCA CATCAACCAG CCCGCGCAGC ACCGGGTACA GTTGCAGCTA
TGGCGTGATG GCCAGCTGGT TGGGGAAAAA ACTCAGGCAT TCGGCAGTGA AATTATCGAC
GAACGCGGTG CCTATGAGGA TCGCACTACC CTGTGCCTCC CGGTGGAACA ACCGGCGTTG
TGGAGCGCGG AAACACCCAC GCTGTACCGC GCAACCGTCA CCCTGCTGTC GCCGGAAGGA
AAAATTATTG AGGTGGAGGC CTATGACGTC GGCTTCCGCC AGGTGGAAAT CAGCAATGGA
CTGCTGAAGC TTAACGGCCA GCCCTTGTTG ATCCGCGGTA CCAACCGCCA CGAACATCAT
CCGCAGCACG GCCAGGTGAT GGACGAGGCC ACCATGCGCC ATGACATCCT GCTGATGAAG
CAACACAACT TCAACGCGGT GCGCTGCTCA CACTACCCGA ACCATCCATT GTGGTACCGG
TTATGCGATC GCTACGGGCT GTATGTGGTG GATGAAGCCA ATATTGAAAC CCACGGCATG
CAGCCGATGA ACCGGTTGTC TGACGATCCG CTATGGTTGC CGGCAATGAG CGAACGCGTA
ACCCGCATGG TGCAGCGTGA CCGCAACCAC CCTTGTATTA TTATCTGGTC GCTGGGTAAC
GAATCCGGTC ACGGCTGCAA CCACGACGCG CTGTATCGCT GGGTGAAAAC TCAGGATCCT
ACCCGCCCGG TTCAGTACGA AGGCGGGGGG GCCAACAGCG CCGCCACCGA TATTATCTGC
CCGATGTATG CGCGGGTGGA TCAGGATCAG CCGTTCCCCG CCGTGCCCAA GTGGTCAATC
AAAAAGTGGA TCGGCCTGCC GGATGAGCAT CGTCCGCTGA TCCTGTGCGA ATACGCTCAT
GCGATGGGCA ACAGCTTCGG GGGTTTTGAC CGCTACTGGC AGGCCTTCCG TCAGTATCCG
CGCCTGCAGG GTGGCTTCGT CTGGGACTGG GTCGATCAGG CACTGACCCG CAGTGATGAA
AACGGCAACC CTTACTGGGC TTACGGTGGC GACTTTGGCG ACACGCCGAA CGATCGACAA
TTCTGCCTTA ACGGTCTGGT ATTCCCCGAC CGCACACCCC ACCCTGCGCT GTTTGAAGCG
CAACGCGCAC AGCAATTTTT CCAGTTTACC TTCGACGCCG AAACGCTGAC GCTGACCGTC
AACAGCGAGT ATCTGTTCCG CCAAACCGAT AATGAACGGC TGAACTGGCG GCTGGAACTC
GATGGCACGG AGCGCGCCAG CGGCAGCTTC GATCTCAACC TGCTACCGCA GAGTAGCGCC
AGCTTCCCAC TGCTCGAACG CTTGCCGATG CTCCATCAAC CCGGCGAACT GTGGCTGAAT
GTCGAAGTGG TGCAACCGCT GGCCACCGAC TGGTCCGAAG CCAACCATCG CTGCGCCTGG
GATCAATGGC TGGTGCCACG CACGCTGCAT TTTGCACCAC CAGCAGTGGC CGGTTCAGCG
CCACAGCTGA GCCAAAATGA CCAAACTATC GACATAACCC ATGGCCATCA ACGCTGGCAG
TTTACGCGCC ACGACGGCTG CCTGAGCCAA TGGTGGCAAC ATGACCACTC TCAACTGCTG
ACGCCACTGC GCGATAACTT TATCCGCGCG CCGCTGGATA ACGACATCGG CATCAGCGAA
GTCGAGCGCA TCGATCCCAA CGCCTGGGTA GAACGCTGGA AGCTGGCGGG CATGTATCGG
CTGGAGGAGC GCTGCACGCT GTTGCAGGCC GATCAATTGA GCGACGGCGT GCGGGTGGTG
AGTGAACACC TGTTCGAAGC CGATGGGCAA ACGCTGCTGC GCAGTCGCAA ACAGTGGCTG
TTCGACAGCG AGGGCGCCGT CAGCATCAGC GTCGACGTCG ATATTGCCGC CAGTCTGCCG
CCACCGGCAC GTATTGGCCT GAGCTGCCAA TTGAAAGAAA TTCATCCACA GGCGCAATGG
TTGGGGCTGG GCCCACATGA GAATTACCCG GACCGCCGCC TCGCCGCGCA ATTTGGACGT
TGGCAGCAGC CGCTGGAAGC GTTGCACACG CCGTATATCT TCCCCGGCGA GAACGGGCTG
CGCTGCGAGA CCCGCAGCCT GCTGTACGGT GGCTGGCACA TCGACGGACG GTTCCACTTC
TCGCTCAGCC GCTACGGCCT GCGCCAGTTG ATGGAGTGCA GCCACCAGCA CCTGCTGCAA
CCGGAAGCCG GCACCTGGCT CAGCCTGGAC GGTTTCCACA TGGGGGTGGG CGGTGACGAC
TCCTGGAGCC CGAGCGTTAA TCAGGACTAC CTGCTCAGCG GCAGCCATTA CCATTATCAA
CTGCGTCTAA AACGCGCAGA ACGGAGCTAA
 
Protein sequence
MSSLPYPSLK DLLARRDWQN PACTHYQRLA AHPPFSSWRN LNAARDDKSS ESRQILNGDW 
QFSYFDKPQA VPDGWLQQDL TDADTLAVPS NWQLAAYDAP IYTNVRYPIP VNPPQVPEEN
PTGCYSRQFT VDPAWLAEGQ TRIIFDGVNS AFYLWCNGHW VGYSQDSRLP AEFDLSPWLQ
AGENRLAVMV LRWCDGSYLE DQDMWRMSGI FRDVSLLHKP ATHLSDIRIT TPLYDSFRRG
ELVAEVHINQ PAQHRVQLQL WRDGQLVGEK TQAFGSEIID ERGAYEDRTT LCLPVEQPAL
WSAETPTLYR ATVTLLSPEG KIIEVEAYDV GFRQVEISNG LLKLNGQPLL IRGTNRHEHH
PQHGQVMDEA TMRHDILLMK QHNFNAVRCS HYPNHPLWYR LCDRYGLYVV DEANIETHGM
QPMNRLSDDP LWLPAMSERV TRMVQRDRNH PCIIIWSLGN ESGHGCNHDA LYRWVKTQDP
TRPVQYEGGG ANSAATDIIC PMYARVDQDQ PFPAVPKWSI KKWIGLPDEH RPLILCEYAH
AMGNSFGGFD RYWQAFRQYP RLQGGFVWDW VDQALTRSDE NGNPYWAYGG DFGDTPNDRQ
FCLNGLVFPD RTPHPALFEA QRAQQFFQFT FDAETLTLTV NSEYLFRQTD NERLNWRLEL
DGTERASGSF DLNLLPQSSA SFPLLERLPM LHQPGELWLN VEVVQPLATD WSEANHRCAW
DQWLVPRTLH FAPPAVAGSA PQLSQNDQTI DITHGHQRWQ FTRHDGCLSQ WWQHDHSQLL
TPLRDNFIRA PLDNDIGISE VERIDPNAWV ERWKLAGMYR LEERCTLLQA DQLSDGVRVV
SEHLFEADGQ TLLRSRKQWL FDSEGAVSIS VDVDIAASLP PPARIGLSCQ LKEIHPQAQW
LGLGPHENYP DRRLAAQFGR WQQPLEALHT PYIFPGENGL RCETRSLLYG GWHIDGRFHF
SLSRYGLRQL MECSHQHLLQ PEAGTWLSLD GFHMGVGGDD SWSPSVNQDY LLSGSHYHYQ
LRLKRAERS