Gene Sare_0108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0108 
Symbol 
ID5707057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp121115 
End bp122104 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content69% 
IMG OID641269634 
Producttransketolase central region 
Protein accessionYP_001535034 
Protein GI159035781 
COG category[C] Energy production and conversion 
COG ID[COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.741994 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000246371 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCACGG AGACCCTCAC CCTCGGCAAG GCCCTCAACG CCGGGATGCG CAAGGCCCTG 
GAGAACGACC CGAAGGTCGT CATCATGGGC GAGGACGTCG GCAAGCTCGG TGGTGTCTTC
CGGATCACCG ACGGGCTGCA GAAGGACTTC GGCGACCAGC GGGTGATCGA TACCCCGCTC
GCCGAGTCGG GCATCATCGG TACCGCAATC GGCCTGGCCA TCCGTGGCTA CCGGCCGGTC
TGCGAGATCC AGTTCGACGG TTTCGTCTAC CCGGCGTACG ACCAGATCGT GTCGCAGGTG
GCGAAGATGC ACTACCGCTC CCGCGGCAAG CTCAGGATCC CGATGGTGAT CCGCATCCCG
TTCGGTGGCG GCATCGGGGC GGTGGAGCAC CACTCCGAGT CGCCCGAGGC CTACTTCGCG
CACACACCCG GGCTCAAGGT CGCCACCTGC GCCAGCCCGC AGGACGCGTA CGTGATGATC
CAGCAGGCCA TCGCGTCGGA CGACCCGATC GTGTTCCTCG AACCCAAGCG CCGCTACTGG
GAGAAGGGGC CGGTCGAGGT CGACGGGCCG CTGCCGGAGG CGTACCCGCT GCACGCCGCC
CGCGTCGCGC GGCCGGGCAC CGACGCGACC CTGATCGGGT ACGGGCCGAT GGTGCGTACC
TGCCTGGACG CGGCGACCGC CGCCGCCGAG GACGGCCGTG AGTTGGAGGT CATCGACCTA
CGCACGCTCG CCCCGCTGGA CCTGGGCCTG GTGTACGAGT CGGTGCGCCG TACCGGTCGG
GCCGTGGTGG TGCACGAGGC ACCGTCGAAC ATCGGCCTCG GGGCCGAGGT CGCGGCCCGG
ATCACCGAGG AGTGCTTCTA CTCCCTGGAG TCCCCGGTGC TACGGGTTAC CGGCTTCGAC
ATCCCCTACC CGGCCTCCCG GGTGGAGGAG GAGTACCTAC CCGACCTTGA CCGGGTGCTC
GACGCCGTCG ACCGCACCTT CGGCTGGTGA
 
Protein sequence
MATETLTLGK ALNAGMRKAL ENDPKVVIMG EDVGKLGGVF RITDGLQKDF GDQRVIDTPL 
AESGIIGTAI GLAIRGYRPV CEIQFDGFVY PAYDQIVSQV AKMHYRSRGK LRIPMVIRIP
FGGGIGAVEH HSESPEAYFA HTPGLKVATC ASPQDAYVMI QQAIASDDPI VFLEPKRRYW
EKGPVEVDGP LPEAYPLHAA RVARPGTDAT LIGYGPMVRT CLDAATAAAE DGRELEVIDL
RTLAPLDLGL VYESVRRTGR AVVVHEAPSN IGLGAEVAAR ITEECFYSLE SPVLRVTGFD
IPYPASRVEE EYLPDLDRVL DAVDRTFGW