Gene Sala_1014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1014 
Symbol 
ID4081702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1042291 
End bp1044609 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content64% 
IMG OID638009374 
Productglycoside hydrolase family protein 
Protein accessionYP_616064 
Protein GI103486503 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.285084 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCGA TTTCGCGCAA TCTGACAAGC GCCACGCTCG CCACGCTGCT GGTGGCCGGC 
TCGCTGGCCC CCGCCCCGCT GACCGCCGCC CCCGCGGCGA CCGCCAGCGA CAAGGCGCCG
GTCGATGCCG CGAGCTGGCA GCGCGCCGAC CCCGCGATGG ACCGCTTCAT CGCCGATCTG
ATGGTGAAGA TGACGCTCGA CGAAAAGACC GGCCAGCTCA CGCTGCTCAC GAGCAACTGG
GAGTCGACCG GCCCGACGAT GCGCGACAGT TACAAGGAGG ATATTCGCGC CGGGCGTGTC
GGGGCGATCT TCAACGCCTA CACCGCCAAA TATACGCGCG AACTGCAAGC GCTTGCGGTC
GAGGGAACGC GCCTCAAAAT CCCCCTGCTC TTCGGCTATG ACGTGATCCA CGGCCACCGG
ACGATCTTTC CCATCTCGCT CGGCGAAGCG GCGAGCTGGG ACCTGCAGGC GATTGAAAAA
GCCGCTCGAA TCTCGGCCAT CGAGGCATCG GCCGAGGGCA TCCACTGGAC CTTCTCACCC
ATGGTCGACA TCGCGCGCGA TCCGCGCTGG GGTCGCATTT CCGAAGGCGC GGGCGAGGAT
GTCTATCTCG GCAGCCTGAT CGCAAAGGCG CGCGTGCGCG GCTATCAGGG CGGCGACCTG
TCGCGGCCCG ACACGATCCT GGCGACCGCC AAGCATTTTG CCGCCTATGG CGCGGCGCAG
GCGGGACGCG ATTACCACAC GGTCGACATT TCGGAGCGCA CGATGCGCGA TGTCTATCTG
CCGCCATTCA AGGCCGCGGC CGACGCGGGG GCAGCGACCT TCATGACCGC ATTCAACGAA
TATGACGGTG TCCCGGCGTC GGGGAGCCAC TATCTGCTCA CCGACGTGCT GCGCAAGAAA
TGGGGCTTCA AAGGCTTTGT CGTAACCGAT TACACGTCGA TCAACGAAAT GGTCCCGCAC
GGCTATGCGA AGGATCTGAA GCAGGCAGGC GAGCAGGCGA TGCGCGCCGG AGTCGACATG
GACATGCAAG GTGCGGTTTT CATGGAAAAC CTCGCCAAAT CGGTCGCCGA GGGCAAGGTC
GACACCGCGC GCATCGACGC GGCGGTGAAG GCGATACTCG AGATGAAATA TCGCCTCGGC
CTGTTCGACG ATCCTTATCG TTACGCCGAC GCGGCGCGCG AAAAAGCGAC GATCTACAAG
CCCGCGTTTC TCGAAGCGGC GCGCGATGTC GCGCGCAAGT CGATCGTCCT CCTCAAGAAC
AAGGACAATG TCCTGCCACT GGCCGCCAGC GCAAAGTCGA TCGCGGTGAT CGGCCCGCTC
GGCAACAGCA AGGAAGATAT GATCGGCAGC TGGTCGGCCG CGGGCGACCG GCGGACGCGG
CCGGTTACCT TGCTCGAAGG CTTGCAGGCC GGCGCCCCCA AGGGAACGAC GATCGCCTAT
GCCAAGGGCG CGAGCTATCA TTTCGACGAT GTCGGCAAGA CCGACGGTTT TGCCGAAGCG
CTCGCGCTTG CGGAAAAATC GGATGTCATC ATCGCCGCGA TGGGTGAACA TTGGAACATG
ACCGGCGAGG CGGCAAGCCG CACCTCGCTT GACCTGCCGG GCAACCAGCA GGCGCTTCTC
GAAGCGCTCG AAAAGACCGG CAAGCCGGTC ATCCTCGTGC TGATGAGCGG GCGACCGAAC
AGCATCGAAT GGGCCGATGC CAATGTCGAT GCGATTCTGG AGGCCTGGTA TCCCGGCACG
ATGGGGGGAC ATGCGATCGC CGACATATTG TACGGTCGCT ACAACCCGTC GGGCAAATTA
CCGGTGACCT TTCCGCGCAC GGTCGGGCAG GTGCCGATCC ATTATGACAT GAAGAACACC
GGTCGCCCGA TCGAACTGGG CGCGCCGGGC GCGAAATATG TCTCGCGCTA CCTCAACACG
CCGAACACGC CGCTTTATCC CTTTGGCTAT GGCCTCAGCT ACACAAGCTT CACTTACTCG
CCGGTCACGC TCGACAGGTC GAAAATCCGC CCCGGCGAAC CGCTGACCGC CAGCGTCACC
GTGACCAACA GCGGCCCGCG CGACGGGGAG GAGGTGGTGC AGCTTTACGT CCGCGACCTC
GTCGGTTCGG TGACGCGCCC GGTCAAGGAA TTGAAGGGAT TCCAGAAGAT CGGCCTGAAA
AAGGGCGAAA CGCGCACGGT GCGCTTCACG CTGACCGACG CCGACCTCGC CTTCACGCGC
CAGGACATGA GCTGGGGCAG CGAGCCCGGC GCGTTCAAGC TGTGGATCGG CCCCTCGTCG
GCCGAAGGAT CCGAAGCCAG CTTCGAACTG ACCGAATAG
 
Protein sequence
MPPISRNLTS ATLATLLVAG SLAPAPLTAA PAATASDKAP VDAASWQRAD PAMDRFIADL 
MVKMTLDEKT GQLTLLTSNW ESTGPTMRDS YKEDIRAGRV GAIFNAYTAK YTRELQALAV
EGTRLKIPLL FGYDVIHGHR TIFPISLGEA ASWDLQAIEK AARISAIEAS AEGIHWTFSP
MVDIARDPRW GRISEGAGED VYLGSLIAKA RVRGYQGGDL SRPDTILATA KHFAAYGAAQ
AGRDYHTVDI SERTMRDVYL PPFKAAADAG AATFMTAFNE YDGVPASGSH YLLTDVLRKK
WGFKGFVVTD YTSINEMVPH GYAKDLKQAG EQAMRAGVDM DMQGAVFMEN LAKSVAEGKV
DTARIDAAVK AILEMKYRLG LFDDPYRYAD AAREKATIYK PAFLEAARDV ARKSIVLLKN
KDNVLPLAAS AKSIAVIGPL GNSKEDMIGS WSAAGDRRTR PVTLLEGLQA GAPKGTTIAY
AKGASYHFDD VGKTDGFAEA LALAEKSDVI IAAMGEHWNM TGEAASRTSL DLPGNQQALL
EALEKTGKPV ILVLMSGRPN SIEWADANVD AILEAWYPGT MGGHAIADIL YGRYNPSGKL
PVTFPRTVGQ VPIHYDMKNT GRPIELGAPG AKYVSRYLNT PNTPLYPFGY GLSYTSFTYS
PVTLDRSKIR PGEPLTASVT VTNSGPRDGE EVVQLYVRDL VGSVTRPVKE LKGFQKIGLK
KGETRTVRFT LTDADLAFTR QDMSWGSEPG AFKLWIGPSS AEGSEASFEL TE