Gene Sala_1663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1663 
Symbol 
ID4080972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1748404 
End bp1750449 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content63% 
IMG OID638010037 
Productalpha-glucosidase 
Protein accessionYP_616709 
Protein GI103487148 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCAAA AATCACGCGG CGAGAGGAAG CAAGGCGTGA AATTGCGGTT TTTCGAGAGC 
AGGGATGGGT TCGAGCTTCG ATTCGGCTCG CATCTCGTGC TACGTCACGT TTCGGATTGT
CCTGCGGTGG TCATCGCTCG CGGTGCTCCC AGCATCGAGA TGTATCGCGG CAATTACCGG
ATCGACGATT CACCGGAAGG AATAATCCGG TGCGCCGATT GGCGCGCCGA CGGCGCTGAC
ATGCTCCTGC TCGACGCTGG CGAGCCGACG GTGCGGCTGG GCATCGAGGG CAGGGCGCTG
CGCGTCGAGG CGCTGCGGAG CGGCTATGAC CGGATAACCA CGCGCTTCTT CGCCGAACCG
GGCGAGGCGG TCTGGGGCGG CGGCGAACAG ATGAGCTATC TGATGCTGGC GGGCCGCCGA
TTTCCGATGT GGACGAGCGA GCCGGGGGTG GGGCGCGACA AGTCGACCGA ACTGACGCGC
ATCATGGATG CCGAAGGAAT GGCCGGCGGC GATTATTGGA ACAGCAATTA CCCGCAGCCG
ACTTTCCTGA CCTCCCGATG GATCGCCGTT CATCTGGACG ATTTTGGCTA TAGCGTCCTC
GATTTTTCCG ATCCGGCAAC GCATCGCGTC GACTATTGGG GGACGCAGGC GCGGTTCGAG
TTTTTCGCCG CCGACGGTCC GGCAGAGCTT GTCGGCCGAT TATCGACGCG GTTCGGCCGC
CAGCCTGCGC TGCCCGACTG GGCCATCGGG GGCGCGATCG TCGGGCTCAA GTCGGGTGCA
TCGAGCTTCG AGCGGCTGGA GCGATTTCTG GACGCTGGCG CCGTCGTGGG CGGATTGTGG
TGCGAGGATT GGGCGGGTGT TCGCGAGACG AGCTTTGGTC GCCGGCTTTT CTGGGACTGG
GTCCGCGGCG CGCGCAGCGA ACAGCGTTAT CCCGATCTTC GGACGCGCAT CGCGGCGCTG
GAAGAGCGCG GCATTCGTTT CCTCGCCTAT GTGAATCCCT ATCTGGCGGT CGATGGCACG
CTGTTCGAAC AGGCGAAAGC GGACGGACAT TTCTGTCTGC GCCAGGACAG CGACGAGGTC
TATCTCGTCG ATTTTGGCGA GTTTTATTGC GGGGTTCTCG ACTTCACGCG CGCGGCGACG
CGGGACTGGT TTGCCGAGCA TATTCTGGGC CGCGAGATGC TCGACAACGG CATTGCCGGA
TGGATGGCCG ATTTCGGCGA ATATCTGCCG ACCGATGTTC GGCTCGCCGA CGGGTCGGAC
CCGATGGAGG CGCATAATCG CTGGCCAGTG CTTTGGGCCG AGGTCAATGC CCGGGCGGTT
GCATCGCGCG GCAAGACGGG CGAGGCTCTT TTCTTCATGC GGGCGGGATT TTCCGGCGTT
CAGGCGCATT GTCCGCTGCT CTGGGCCGGT GACCAGAGCG TCGATTTTAC CCGTCACGAC
GGTATCGGAA CGGTGTTGAC GGGAGCCTTG TCGGCCGGGC TGGTCGGTAA CGCCTACAGT
CATTCCGACT GCGGCGGCTA TACGTCGCTC CACGGCAATG TGCGGACCGA GGAACTGATG
CAGCGCTGGT GCGAGCTTGC CGCCTTCGCG CCGGTCATGC GCAGCCATGA GGGCAACCGG
CCCGACGACA ATCTGCAATA TGATTCGACG GCTGAACTGC TCGCCTGCTT CGCGCGCTGG
AGCCGGGTCC ATGCGCATCT GGCACCCTAT GTCAGGCATT TGTGCGACGA GGCGCAGGAG
ACGGGACTGC CCGCCCAGCG ACCGCTGTTT CTTCATTACC CGGACGATCC CACCCTCTTT
ACTGTACAGG ACCAGTATCT TTACGGTGCC GACCTCCTGG TGGCGCCAGT CGTCGAACAG
GGGATCGAAC GGCGCAGCGT CGTCCTTCCG GGGAAGGGGC CATGGCGCCA CTGCTGGACG
GGGGAGGAGT TTGCGCCCGG TGTTCACCAG ATCCCCGCGC CCATTGGCAT GCCACCGGTA
TTCTTCCGGC CCGACAGTGG CTTTGCGCCG CTGTTCGGCC GACTGGCGGA GATATGGAAA
CAATGA
 
Protein sequence
MVQKSRGERK QGVKLRFFES RDGFELRFGS HLVLRHVSDC PAVVIARGAP SIEMYRGNYR 
IDDSPEGIIR CADWRADGAD MLLLDAGEPT VRLGIEGRAL RVEALRSGYD RITTRFFAEP
GEAVWGGGEQ MSYLMLAGRR FPMWTSEPGV GRDKSTELTR IMDAEGMAGG DYWNSNYPQP
TFLTSRWIAV HLDDFGYSVL DFSDPATHRV DYWGTQARFE FFAADGPAEL VGRLSTRFGR
QPALPDWAIG GAIVGLKSGA SSFERLERFL DAGAVVGGLW CEDWAGVRET SFGRRLFWDW
VRGARSEQRY PDLRTRIAAL EERGIRFLAY VNPYLAVDGT LFEQAKADGH FCLRQDSDEV
YLVDFGEFYC GVLDFTRAAT RDWFAEHILG REMLDNGIAG WMADFGEYLP TDVRLADGSD
PMEAHNRWPV LWAEVNARAV ASRGKTGEAL FFMRAGFSGV QAHCPLLWAG DQSVDFTRHD
GIGTVLTGAL SAGLVGNAYS HSDCGGYTSL HGNVRTEELM QRWCELAAFA PVMRSHEGNR
PDDNLQYDST AELLACFARW SRVHAHLAPY VRHLCDEAQE TGLPAQRPLF LHYPDDPTLF
TVQDQYLYGA DLLVAPVVEQ GIERRSVVLP GKGPWRHCWT GEEFAPGVHQ IPAPIGMPPV
FFRPDSGFAP LFGRLAEIWK Q