Gene Sala_1023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1023 
Symbol 
ID4082306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1056342 
End bp1057610 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content68% 
IMG OID638009383 
Productextracellular solute-binding protein 
Protein accessionYP_616073 
Protein GI103486512 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.705733 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATTGA CCCGGCGCCA ATTGACAGGC GCACTCGCGG CGCTGCCCCT GCTCCCCATG 
CTCGGCGGCT GCGAGGAGCG GCACGCCGAC ACGCTGACCA TCTGGGCGAT GGGCAATGAA
GGAGCGAGCC TCCCCGCCCT TCTCAACAGG CTTGCGTTGC CCGCGGACCT GCCACCGGTC
GACGTGCAGC CACTGCCGTG GAGCGCAGCG CACGAAAAAC TGCTCACCGG CTTCGCGGGC
GGCTCGCTGC CCACGATCGG CCAGGTCGGC AACAGCTGGA TCGCCGAGAT GGCGGCGATC
GGCGCGATTG CTCCCCTGCC CGCTTCCGCC ACCACGCTGC TCGACGATCA GTTCGCCGCG
GTCGTTGAAA CCAACCGGAT CGGCGGCACC GCCTGGGCCG TGCCCTGGTA TGTCGACACG
CGGCTGCAAT TTTACCGCAA GGACATGTTC GCGCGTGCGG GTTATGCCGC GCCGCCGCTC
GCATGGGCCG AATGGAAGCG CGCGCTGCAC CGCGTCAAGG CGCTCGCCGG ACCCGGCAAT
TACGCCGTGC TGCTGCCGCT CAATGAGTTC GAGCAACTGC TGACCATCGC GCTGTCGGCG
AGTGCGCGCC TGCTGCGCGA CAAGGGGGCG CGCGGCGCCT TTTCCGACCC CGAGTTCAAG
GCTGCGCTCG CCTTCTATAA ATCGCTGTTC GACGAGCGGC TCGCGCCGAT CGCATCGGCG
ACGCAGATTT CGAACATCTG GACCGAATTC GCCAAAGGCT ATTTCAGCAT TTTTACGTCG
GGCCCATGGA CGATAGGTGA CATGAAAAGC CGCCTCGATC CCGCCATGCA GGACAAATGG
GCGACCGCGC CCAATCCCGG TCCCGGCGGC ATCGGTTCGG CGGCGCCGGG CGGGTCGAGC
CTCGTCGTTT TCGCCAGCCA GGCGGACAGC GCCGCCGCAT GGGATATCGT CGCGCGCCTG
CTCGCGCCCA CCGCACAGCT CGCGTTTCAC CGGCTGACCG GCAATCTGCC CGCGCGGCGT
TCGGTCTGGC GCGCCGCTGG CCTCGCGAGC GACCCCATCG TCGCCCCCTT CGCCACCCAG
CTCGACCATG CGACCGCGTT GCCCAAAGTG CCCGAATGGG AACGCATCGT CACCGAAATG
CAGGTGGTCG CCGAGCGCAT GGTGCGCGGC CACTATAGCG TCGATGCCGC CGCGCACGAG
ATCGACCGCC GCGCCGACCG CCTGCTCGAA AAAAGGCGCT GGATGCTCGA CAGGGGGCGC
GCCCTGTGA
 
Protein sequence
MRLTRRQLTG ALAALPLLPM LGGCEERHAD TLTIWAMGNE GASLPALLNR LALPADLPPV 
DVQPLPWSAA HEKLLTGFAG GSLPTIGQVG NSWIAEMAAI GAIAPLPASA TTLLDDQFAA
VVETNRIGGT AWAVPWYVDT RLQFYRKDMF ARAGYAAPPL AWAEWKRALH RVKALAGPGN
YAVLLPLNEF EQLLTIALSA SARLLRDKGA RGAFSDPEFK AALAFYKSLF DERLAPIASA
TQISNIWTEF AKGYFSIFTS GPWTIGDMKS RLDPAMQDKW ATAPNPGPGG IGSAAPGGSS
LVVFASQADS AAAWDIVARL LAPTAQLAFH RLTGNLPARR SVWRAAGLAS DPIVAPFATQ
LDHATALPKV PEWERIVTEM QVVAERMVRG HYSVDAAAHE IDRRADRLLE KRRWMLDRGR
AL