Gene Sala_0153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0153 
Symbol 
ID4081823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp147432 
End bp148898 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content65% 
IMG OID638008512 
Productglucose/sorbosone dehydrogenases-like protein 
Protein accessionYP_615210 
Protein GI103485649 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAAGC ATATTTTTGC CCTGCTGTTT TTGCTTGCAT TGACATCGTG CGGAGGTGGT 
GGGGGAGGCG ATACGCCGCC GCCGTTGGCC AACACGCCGC CCGTATTCAC CTCACTCCAG
ACCGCCAGCG TCGCCGAGAA CACGGCGAAC GCCTATCAGG CGGCGGCAAG CGATGCCGAC
GGCGACGCGC TGACCTTCGC CATCGACGGC GGCGCCGATG CGGCGCGCTT TGCGATCACC
GGCGCGGGCG CATTGCGGTT CAACACGCCG CCCGATTTCG ACCTGCCCGG CGATGCCGAT
GGCGACAATG TCTATGCCGT CGTGCTGCGC GTCAGCGATG GCCGGGCGAG CGTGACACAG
GCGGTCAACA TCACCGTCAC CAACAGCCGC GAAGGGATCG CCGTCGCGCG CGTCGGAACG
GGGTTCAGCC AGCCGCTTTA TGTCGCGCCG ATCCCCGGCG ACAATCGCAT CTATGTGGTC
GAAAAGGGCG GCGACGTCTA TCGTTTCGAC CCCGCGGACG GCAGCCGGAC GCGCGTGCTC
GACATTACCG ACATTTCGAC GAGCGGCGAG CGCGGCCTGC TTGGCCTTGC GCCCTATCCC
GACCATGCAA CCTCGCAGCG CCTGTTCGCG GTCGCCACGG CGATCAACGG CAATGTCCAG
GTGCGGCGCT ACACGCTGGG CCAGCCGAAT AGTTCGACCA GTTACGACCT CGTGCTCGAC
ATTCCGCATC CCGGTTTCGA CAATCATAAT GGCGGCTGGA TCGGCTTCGG CCCCGACGGC
CATGTCTATG TCGCGGTCGG AGACGGCGGC GGCGCGGGCG ACCCCAACAA CAATGCGCAA
AACCGCAATG TACAGCTTGG CAAGATCCTG CGCTTCGCGG TGGGCACCGG GGGCAGCACC
TATGCCCCGG CACCGGGCAA CCCCTTTCTT GCGGGCGGCG GCGATCCCTA TGTCTTTGCG
CTTGGCCTGC GCAACCCGTT CCGCGCCTCC TTTTCGGGGT CGACGCTGCT GATCGGCGAT
GTCGGGCAGA ATGCGGTCGA GGAAATCGAC ATGGTCGCGA CGGCACAGCC GGGGCTCAAC
TTTGGCTGGC GCTTCCTGGA AGGCACGCAG CCCTTTTCGG GAACGGCGCC TGCGGGCCTG
ACCCCGCCCG TCGCCGAATA TGGCCATGGC AGCGGTCCGC GTCAGGGGCG TTCGGTCACC
GGCGGCTATG TCTACCGTGG ACCGGTGGCC TCGTTGCAGG GCCAGTATGT CTTTGGCGAT
TTCGTGTCGG GCAATATATG GACCGTGCCC TTTGCCGACC TCGTTGCCGG TCAGACCTTG
CCCGCGGCCC GCTTTGCCGT GCGCAACGAG GATTTTGCTC CCGACGCAGG AACGATAGCC
AACATCGCCT CATTCGGCGA GGACAGCGCG GGCAATCTGT TCATCGTCAG CATCGGCGGC
GACATTTTCA TGGTGCGGCC GGGCTGA
 
Protein sequence
MPKHIFALLF LLALTSCGGG GGGDTPPPLA NTPPVFTSLQ TASVAENTAN AYQAAASDAD 
GDALTFAIDG GADAARFAIT GAGALRFNTP PDFDLPGDAD GDNVYAVVLR VSDGRASVTQ
AVNITVTNSR EGIAVARVGT GFSQPLYVAP IPGDNRIYVV EKGGDVYRFD PADGSRTRVL
DITDISTSGE RGLLGLAPYP DHATSQRLFA VATAINGNVQ VRRYTLGQPN SSTSYDLVLD
IPHPGFDNHN GGWIGFGPDG HVYVAVGDGG GAGDPNNNAQ NRNVQLGKIL RFAVGTGGST
YAPAPGNPFL AGGGDPYVFA LGLRNPFRAS FSGSTLLIGD VGQNAVEEID MVATAQPGLN
FGWRFLEGTQ PFSGTAPAGL TPPVAEYGHG SGPRQGRSVT GGYVYRGPVA SLQGQYVFGD
FVSGNIWTVP FADLVAGQTL PAARFAVRNE DFAPDAGTIA NIASFGEDSA GNLFIVSIGG
DIFMVRPG