Gene Sala_1330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1330 
Symbol 
ID4081001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1392648 
End bp1393679 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content63% 
IMG OID638009693 
Producttransketolase, central region 
Protein accessionYP_616377 
Protein GI103486816 
COG category[C] Energy production and conversion 
COG ID[COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0296032 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.157339 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCG AAACCCGGAC GAAAACCATG AACATGATCG AGGCGATCAA CAGCGCCATG 
GACGTCATGC TCGAACGCGA TCCCGCGACC GTCGTGATGG GCGAGGACGT CGGCTATTTC
GGCGGCGTTT TCCGCGCCAC CGCGGGCCTC CAGAAAAAGC ATGGCAAGAC GCGCGTGTTC
GACACGCCGA TCAACGAATG CGGCATCATC GGCGTCGCGG TCGGCATGGG CGCCTATGGT
CTGCGCCCCG TCCCCGAAAT CCAGTTCGCC GATTATATCT ATCCGGGGCT CGACCAGCTC
GTCAGCGAGG CGGCGCGCCT GCGCTATCGC TCGGCGAACG ACTATATATG CCCGATGACG
GTGCGCACAC CGTTCGGCGG CGGGATTTTC GGCGGCCAGA CGCACAGCCA GTCGCCCGAA
AGCATCATGA CGCATATCTG CGGCGTCAAG ACGGTGATCC CGTCGAACCC CTATGATGCC
AAGGGGCTGC TGATCGCGGC GATCGAGGAT AACGACCCCG TCGTCTTCCT CGAACCCAAG
CGCATCTATA ACGGCCCGTT CAGCGGCTAT TACGATCGCC CGGTCGAACC CTGGTCGAAG
CATGACGCCA GTGCGGTGCC CGAGGGCTAT TACCGCATCG ACCTGGGGAA AGCGGCGACG
GTGCGCGAGG GCGAAGCGGT GACCGTACTC GCCTATGGCA CAATGGTTCA TGTCGCAAAG
ACGATCATCG AGGAAATGGG GATCGACGCC GAAATCCTCG ACCTGCGCAC GCTGTTGCCG
CTCGACATAG CGGCGATCGA GGCGTCGGTG AAAAAGACCG GCCGCTGCCT GATCATCCAC
GAAGCGACGC GCACGTCGGG TTTTGGCGCC GAACTCGCCG CGCTGGTGCA GGAACGCTGC
TTCTATCATC TCGAGGCGCC CGTCGAGCGC GTCACCGGTT TCGACACGCC CTATCCGCAC
AGCCTGGAAT GGGCCTATTT CCCCGGCCCG GTGCGCATTG CGACCGCGCT GACCAAGATT
TTGAAGGACT GA
 
Protein sequence
MSAETRTKTM NMIEAINSAM DVMLERDPAT VVMGEDVGYF GGVFRATAGL QKKHGKTRVF 
DTPINECGII GVAVGMGAYG LRPVPEIQFA DYIYPGLDQL VSEAARLRYR SANDYICPMT
VRTPFGGGIF GGQTHSQSPE SIMTHICGVK TVIPSNPYDA KGLLIAAIED NDPVVFLEPK
RIYNGPFSGY YDRPVEPWSK HDASAVPEGY YRIDLGKAAT VREGEAVTVL AYGTMVHVAK
TIIEEMGIDA EILDLRTLLP LDIAAIEASV KKTGRCLIIH EATRTSGFGA ELAALVQERC
FYHLEAPVER VTGFDTPYPH SLEWAYFPGP VRIATALTKI LKD