Gene Sala_2981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2981 
Symbol 
ID4083064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp3124219 
End bp3125493 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content67% 
IMG OID638011366 
Product3,4-dihydroxy-2-butanone 4-phosphate synthase 
Protein accessionYP_618019 
Protein GI103488458 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase 
TIGRFAM ID[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.57457 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0436179 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCC AGCTCATCGA TCGCATCCGC GCCATCGTCG CCGACGGCAC CATGTCGCGT 
TCGGGCCTCG CGCGCGCCGC GGGGCTTCAC GCCAACAGCC TGCGCGACCT CGACTCGCCC
GGCTGGAACC CGACCGCCGA AACGCTGCGC AAGCTCGAAA ACTGGCTCGC CAACGGCAGC
GACCTGTCGC CGATGGCGAG CCCCGAAGAG ATCATCGCCG AGGCGCGCAA CGGCCGCATG
TTCATCCTCG TCGACGACGA GGATCGCGAA AATGAGGGCG ACCTTGTCAT TCCCGCGCAG
ATGGCCACCC CCGACGCGAT CAATTTCATG GCGACGCACG GCCGCGGGCT CATCTGCCTG
ACGCTGACCA GGGCGCGCGT CGATGCGCTG GGCCTCGAGC TGATGAGCCG CAACAATGGC
ACGCGGCACG AAACGGCCTT CACCACCTCG ATCGAAGCGC GCGAAGGCGT GACGACGGGC
ATTTCGGCGG GGGACCGGGC GCGCACGGTG GCGGTGGCGA TCGATGCGAG CAAGGGGCGC
GACGATATCG TCACGCCGGG GCATGTCTTT CCCCTCATCG CGCGCGACGG CGGCGTGCTC
GTCCGCGCGG GCCATACCGA GGCGTCGGTC GACATCGCGC GCCTTGCCGG ACTCAATCCG
TCGGGGGTGA TCTGCGAGAT CATGAACGAC GATGGCACCA TGGCGCGCCT CGACGACCTC
ATCCCCTTCG CGCGGCGCCA CGGGCTCAAG ATCGGTACGA TCGCCGACCT CATCGCCTAT
CGCAACCGCA GCGACCGGCT GGTCGAATGC GTTGCCGACG AACCGTTCGA ATCGGATTAT
GGCGGCGAGT GGCGGCTTAA ATCCTATCGC AACAAGATCG ACGGCAGCGT CAATCTGGTG
CTGCAAAAGG GGCCGGTCGA TCCCGATGGC GTGACATTGG TGCGGATGCA CCCCGTGTCG
ATCTTCGACG ATATCATGGG GCGTCCCGGC CCCCGCAAGC GCCGCCTGCA ACGGTCGATG
GACGCGATCG GCGAAGCGGG TGCGGGGGTC ATCGTCCTGC TCATGCGCCC GCTCCCCGGA
TCGGCCGACG CCGAGGCGCT GCCGCCGCCG ACCGGCGGCA TGGACCTGCG CACCTACGGC
ATCGGCGCGC AGATCCTCGC CGACCTCGGC GTTCACGCGA TGGAACTGCT CACCCCCACC
CACAGCAATA TCGTCGGGCT CGAAGGCTAT GGCCTGTCGG TCGTCGGCGA ACGCCCCATT
CCCGGAGAAG CCTGA
 
Protein sequence
MTIQLIDRIR AIVADGTMSR SGLARAAGLH ANSLRDLDSP GWNPTAETLR KLENWLANGS 
DLSPMASPEE IIAEARNGRM FILVDDEDRE NEGDLVIPAQ MATPDAINFM ATHGRGLICL
TLTRARVDAL GLELMSRNNG TRHETAFTTS IEAREGVTTG ISAGDRARTV AVAIDASKGR
DDIVTPGHVF PLIARDGGVL VRAGHTEASV DIARLAGLNP SGVICEIMND DGTMARLDDL
IPFARRHGLK IGTIADLIAY RNRSDRLVEC VADEPFESDY GGEWRLKSYR NKIDGSVNLV
LQKGPVDPDG VTLVRMHPVS IFDDIMGRPG PRKRRLQRSM DAIGEAGAGV IVLLMRPLPG
SADAEALPPP TGGMDLRTYG IGAQILADLG VHAMELLTPT HSNIVGLEGY GLSVVGERPI
PGEA