Gene Sala_1017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1017 
Symbol 
ID4081705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1049490 
End bp1051643 
Gene Length2154 bp 
Protein Length717 aa 
Translation table11 
GC content65% 
IMG OID638009377 
ProductBeta-galactosidase 
Protein accessionYP_616067 
Protein GI103486506 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGCT GCCTGACGAT GCTGTTCGCG GCGATGCTTG CCGCCTGTTC GGCGGCGCCG 
CGGCCGACGC CCGCCGCGGC GCCGCCTTTC ACGCGCGACA CGACCGAACT GACGCATGGG
TGGCAATTCC GCTTCGACGA TGGGCTGACG CCCGAAACCG CGGCCACGCT CGTCGACGGC
GCGTGGCAAG ATGTCACCGT GCCCCACACA TGGAACCGGC TCGGCGAATA TCGCATCGGC
CGCACGGCGG CGACCGACAA CCGGCAGGGC AAGGGCTGGT ATCGCCTGCG CGTCAATGGC
GCGGCATTGC CAGCGGGCAA GCGCCATATC ATCGAGTTCG AAGCGGTCGG CAATCTCGCC
GACCTATGGG TCAATGGCCG CCATGTCGGG CGCCACGCGG GCGCTTTCTC CCGCTTCCGC
TTCGATCTCA CCGATTTTCT GAACCCAGCG GATCCCAACA TCATCCTGCT GCGCGCCGAC
AACAGTATGC CCGGACCGGG TAGCAGCACC GAGCATATCA TTCCGCTCGA CGGCGACTTC
TTCATCCACG GCGGCCCCTA CCGCCCGGCG CGCCTCCTCC ACGTCGCGCC GAGCCATATC
GCACTCGACG ACCATGGCGG GCCCGGCGTC TATGCGACCC CGACGATCAA AGACGGTGCG
GGTCAGGTGG CGATCCGCGT CCGGTTGACC GACATCGCGG CGGGGCAGAG CCTCGTGGCG
ACGCTGCGCG ATGCCGGGGG GCGAACCGTC GCCGAAGGCA CACGCCCGCT CACCGCGGGT
CAGCGCGAGG CCGCGCTGAC GCTCGACGTC GCCGCGCCGC GCCGCTGGAA CGGTCGCGCC
GACCCTTATC GCTATCGCCT CGAAACGCGC CTTGCCGATA CTGGCGGCAC GCTCGACAGC
GTGACCGTCC CCGTCGGCTT TCGCGCGTTC CGGTTCGACG CGGCAACGGG TTTTTATCTC
AACGGAAAAC ATCTGCCGCT GCATGGCGTA TCGCGACATC AGGATTATCT GGGCAAGGGC
TGGGCACTCT CGGCGGAGGA TCATGCGCGC GACATGGCGC TAATCGCCGA AATGGGCGCG
AACACGGTGC GCTTCGCCCA TTATCAGCAT GCCGCCGACT GGTTCGACCT TGCCGACCGT
TTTGGCATGA TCGTCTGGGC CGAACTGCCC TTCGTCAACA AGCCGAGCCA CGGCGATGCG
CCCGCATCAC CCGAACTGGT CGCCAACGCG CGCCAGCAGA TGATCGAGCT GATTCGCCAG
AATTACAACC ATCCCTCGGT CGTCACCTGG GGCATCGGCA ACGAGGTCGA TATGGACATG
GCGTTCGGCC GTATGGGACC AAAGGCCGAT GCGCGCCCTC TGCTGCGGGA ACTCCATGCG
CTGTCGAAGG CCGAAGACCC ATCACGCCCG ACGGTGATCG CTGACTGCTG CGAACTGACC
CCGGCGAAGA AGCCCGATTA CCAGCCGCCA CTCACCGGCG AGGCCGACCT GATGGGCTAC
AACCGCTATT ATGGCTGGTA TTATGGCGAG GTATCCGACC TCGGCCCGCA TCTCGACGCT
TTGCATGCCA AATATCCGTC CGTGCCGATT TCGGTCAGCG AATATGGCGC GGGCGGCGCG
CTCAGCCAGC ATGTCGAGGA TCCAGCGCAC CACCCGATCA ACCCCGGCGG CCGCCCGCAC
CCGGAGGAGT TCCAGAGCTG GCTGCACGAA CAAAGCTGGC CGCAACTGCG CGATCGCCGC
TATCTATGGG CCAGCTGGAT ATGGAACATG TTCGATTTTT CGTCGAAAAT CCGAAAGGAA
GGCGACGCGA CCGACATCAA CGACAAGGGC CTCGTCACCT TCGACCGCAA GGTCAAGAAG
GACGCCTTTT TCTATTACAA GGCCCAATGG TCGACCGAAC CCGTCGTCCA TATCACCAGC
CGTCGCTGGA CGACGCGCGC CAGTCCCGTC ACGGCGATCA AGGTTTACAG TAACGCGCCC
GCCGTGACGC TGACGTTCAA CGGGGTGCCG CTGGGCGAGG TCGCGTGCGC CGATCGCATC
TGCACGCTGG ACAATATCGT GCTGCGCCCC GGCGACAATG TGGTGACGGC GCGCGCGACC
TTTGCATCCG GGGTGGTGAA GGACGAAGTG CGCTGGACCC TTACGCCGCT CTAA
 
Protein sequence
MPRCLTMLFA AMLAACSAAP RPTPAAAPPF TRDTTELTHG WQFRFDDGLT PETAATLVDG 
AWQDVTVPHT WNRLGEYRIG RTAATDNRQG KGWYRLRVNG AALPAGKRHI IEFEAVGNLA
DLWVNGRHVG RHAGAFSRFR FDLTDFLNPA DPNIILLRAD NSMPGPGSST EHIIPLDGDF
FIHGGPYRPA RLLHVAPSHI ALDDHGGPGV YATPTIKDGA GQVAIRVRLT DIAAGQSLVA
TLRDAGGRTV AEGTRPLTAG QREAALTLDV AAPRRWNGRA DPYRYRLETR LADTGGTLDS
VTVPVGFRAF RFDAATGFYL NGKHLPLHGV SRHQDYLGKG WALSAEDHAR DMALIAEMGA
NTVRFAHYQH AADWFDLADR FGMIVWAELP FVNKPSHGDA PASPELVANA RQQMIELIRQ
NYNHPSVVTW GIGNEVDMDM AFGRMGPKAD ARPLLRELHA LSKAEDPSRP TVIADCCELT
PAKKPDYQPP LTGEADLMGY NRYYGWYYGE VSDLGPHLDA LHAKYPSVPI SVSEYGAGGA
LSQHVEDPAH HPINPGGRPH PEEFQSWLHE QSWPQLRDRR YLWASWIWNM FDFSSKIRKE
GDATDINDKG LVTFDRKVKK DAFFYYKAQW STEPVVHITS RRWTTRASPV TAIKVYSNAP
AVTLTFNGVP LGEVACADRI CTLDNIVLRP GDNVVTARAT FASGVVKDEV RWTLTPL