Gene Sare_2239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2239 
Symbol 
ID5705865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2575261 
End bp2576265 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content71% 
IMG OID641271719 
Producttransketolase central region 
Protein accessionYP_001537090 
Protein GI159037837 
COG category[C] Energy production and conversion 
COG ID[COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0788664 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.171519 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGCA CGACCATGGC GAAGGCACTC AACGCCGCGC TCGCCGACGC GATGCTCGAG 
GACGATCGGG TGCTCGTGTT CGGCGAGGAC GTCGGCCAAC TCGGCGGGGT CTTCCGGATC
ACCGACGGGC TGGCGGCCCG CTTCGGCGAC AAGCGCTGTT TCGACACTCC GCTCGCCGAG
GCCGGCATCG TCGGTTTCGC GGTCGGCCTG GCCATGTCAG GTCTGCGGCC GGTGGTGGAG
ATGCAGTTCG ACGCGTTCGG GTACCCGGCG TTCGAACAGA TCGCCTCGCA TGTGGCGAAG
CTGCGCAACC GCACCCGCGG CGCGTTGACC GCGCCCATCG TCATCCGGAT CCCGTACGCC
GGGGGCATCG GCGGGGTGGA GCACCACTGT GACTCCTCCG AGGCGTACTA CGCGCACACC
CCCGGCCTGA AGGTCGTCGC CCCGGCCACT GTGGCCGACG CCTACTCGCT GCTGCGCGAG
GCGATCGACG ACCCGGACCC GGTCGTGTTC CTGGAGCCGA AGAAGCTCTA TTTCGCCAGC
GCCGAGGCGC AACTGCCGGC CCGGACCGAA CCGTTCGGCC GTGCCGCCGT ACGCCGTCCC
GGCGCCGGCG CCACCCTGGT CGCGTACGGA CCGGCGGTGC CGGTGGCACT GGAGGCCGCC
GAGGCGGCCC GGGAGGAGGG CTGGGACCTC GAGGTCGTCG ACGTGCGCAC GATCGTGCCG
TTCGACGACG ACACGATCGC GGCTTCGGTG CGGAAGACGG GTCGGTGCGT GGTGGTCCAG
GAGGCTCAGG GTTTCGCCGG GGTCGGCGCG GAGATCGCCG CCCGGGTGCA GGAGCGCTGC
TTCCACTCTC TGCACGCCCC GGTGCTGCGG GTGTCCGGGC TGGATATCCC GTATCCCGCG
CCGATGCTGG AGCATACCCA CCTGCCGTCG GTGGATCGGG TGCTCGACGC CGTGGCCCGC
CTCCAGTGGG ACGACCAGCC CGACGAGCGA TGGGTGGCGG CCTGA
 
Protein sequence
MASTTMAKAL NAALADAMLE DDRVLVFGED VGQLGGVFRI TDGLAARFGD KRCFDTPLAE 
AGIVGFAVGL AMSGLRPVVE MQFDAFGYPA FEQIASHVAK LRNRTRGALT APIVIRIPYA
GGIGGVEHHC DSSEAYYAHT PGLKVVAPAT VADAYSLLRE AIDDPDPVVF LEPKKLYFAS
AEAQLPARTE PFGRAAVRRP GAGATLVAYG PAVPVALEAA EAAREEGWDL EVVDVRTIVP
FDDDTIAASV RKTGRCVVVQ EAQGFAGVGA EIAARVQERC FHSLHAPVLR VSGLDIPYPA
PMLEHTHLPS VDRVLDAVAR LQWDDQPDER WVAA