Gene Sare_0107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0107 
Symbol 
ID5707056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp119637 
End bp121109 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content71% 
IMG OID641269633 
Productdehydrogenase catalytic domain-containing protein 
Protein accessionYP_001535033 
Protein GI159035780 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.390532 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000261598 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCGCGAA TCAAGGAATT CAACCTGCCC GACCTCGGCG AGGGCCTGAC TGAGGGGGAG 
ATCCTCAGCT GGCTCGTGAA GGTGGGCGAC ACCGTCGAGC TGAACCAACC CATCGTCGAG
GTGGAGACCG CCAAGGCGGC GGTCGAGATC CCGGCGAAGT GGGCCGGCCG GGTCCAGTCG
ATCTTCCATG CGGAAGGCGC GACCGTCGAG GTCGGGTCGC CGATCATCGC GATCGACACG
GATCCGACCG CCGGCCCGGT CGAGGCAACG GAGTCGGTTG AGGCGGCCGG CACCGTTTCG
GGTGCTCCGT CGGCCGCCGC GCCGGCGGCG GTGACCTCCA CCGAGGGTGC GGGCGAGTCG
GGTCAGGGTG GGCGCACCCC CGTGCTGGTG GGCTACGGCC CGCGCACCAC TGTCGCGAAG
CGTCGCCCGC GTAAGGGTGC CGCGGCTTCG GCAGCGGTGC CGGCCGCACC GACACCCGCA
CCGCCGGTAC CCGCACCAGC CGCACCGCCG CGACCGGCAC CCGCACCGGC GGTCACCGGG
CCGACCACCG TCGGCAACGG GCGCGGCGGT CCGGCCGGCG GCGCTCTGGT GTTGGCCAAG
CCCCCGGTAC GCAAGCTGGC GAAGGACCTC GGGGTTGACC TGTCCACCCT GACCGGGTCG
GGTCCGCTCG GCTCGATCAG CCGAGACGAT GTGCAGCGGG CGGCGAGCGC CACCACCACG
GCCGAACCGC TGGCGGTGGC CGCGGCGGGC AGTACGGCAG CGAGTGTCGG CGCGCACCGC
GAGCAGCGGA TCCCGGTCAA GGGGGTCCGG AAGCTGACCG CGGAGAACAT GTCCCGCTCG
GCGTTCACGG CACCGCACGT GACGGAGTTC CTGACCGTCG ACATGACCCG GGCGATGAAG
GCCCTGGACC GTCTCCGTCA GCGACGCGAG TGGCGGGACG TCCGGGTCTC TCCGCTGCTG
CTGGTCGCCA AGGCGGTGCT GCTGGCGGTC CGGCGCCATC CGATGGTGAA CGCGACCTGG
GCCGGCGAGG AGATCGTCGT CAAGGACTAC GTGAACCTCG GCATCGCGGC GGCGACCGAG
CGCGGCCTGA TCGTGCCGAA CGTGAAGGAC GCGGGGCGGC TCAGCCTGCG GGAGTTGGCG
GATGCTCTGA CCGATCTCGT GCAGACTGCC AAGACGGGGA AGACCTCCCC GGCGGACATG
TCCGGCGGCA CCCTGACCAT CACCAACGTC GGGGTCTTCG GCGTGGACAC CGGTACGCCG
ATTCTGCCGC CGGGTGAGTC GGCGATCCTG GCCTTCGGTG CGGTCCGAAA GATGCCGTGG
GTGCACAAGG GCAAGGTTCG TCCCCGCCAG GTCACCACGC TCGGGTTGTC GTTCGACCAT
CGGATCATTG ACGGCGAGCT CGGGTCGAGG TTCCTGCGGG ATGTCGGCGA CTTCCTCGCC
GATCCCGAGG CGGCGTTGCT CGCCTGGACC TGA
 
Protein sequence
MSRIKEFNLP DLGEGLTEGE ILSWLVKVGD TVELNQPIVE VETAKAAVEI PAKWAGRVQS 
IFHAEGATVE VGSPIIAIDT DPTAGPVEAT ESVEAAGTVS GAPSAAAPAA VTSTEGAGES
GQGGRTPVLV GYGPRTTVAK RRPRKGAAAS AAVPAAPTPA PPVPAPAAPP RPAPAPAVTG
PTTVGNGRGG PAGGALVLAK PPVRKLAKDL GVDLSTLTGS GPLGSISRDD VQRAASATTT
AEPLAVAAAG STAASVGAHR EQRIPVKGVR KLTAENMSRS AFTAPHVTEF LTVDMTRAMK
ALDRLRQRRE WRDVRVSPLL LVAKAVLLAV RRHPMVNATW AGEEIVVKDY VNLGIAAATE
RGLIVPNVKD AGRLSLRELA DALTDLVQTA KTGKTSPADM SGGTLTITNV GVFGVDTGTP
ILPPGESAIL AFGAVRKMPW VHKGKVRPRQ VTTLGLSFDH RIIDGELGSR FLRDVGDFLA
DPEAALLAWT