Gene Sare_2238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2238 
Symbol 
ID5704301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2574113 
End bp2575261 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content72% 
IMG OID641271718 
Productpyruvate dehydrogenase (acetyl-transferring) 
Protein accessionYP_001537089 
Protein GI159037836 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID[TIGR03181] pyruvate dehydrogenase E1 component, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.191057 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.196658 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGACCA CACCCCGGGC GGTCCGCAGA AAGTCCCGGC CGGCCGCACA GCCGGACCCG 
GCCCACCCAC TACTGCCAGC CGGTGAACAG ATCCGCCTGC TCGACCCCGC CGGCACTCCG
CTCCCGGCTC ACCCCGACTA CCCAGAGCCA CCCGTCGAGG CGCTCGTCGA GCTGTACCGG
CGGATGGTGA TCGGCCGCCG GTTCGACCAA CAGGCCACAG CGCTGACCAA ACAGGGCCGG
CTGGCCGTCT ACCCGTCCGC CCGGGGTCAG GAGGCGTGCC AGGTCGGCGC GGTCCTCGCA
CTGCGCGACG ACGACTGGGT GTTCCCGACC TACCGTGAGT CCATGGCGCT GACCGCCCGG
GGGATCGACC CCGTCGAGGT GCTGACCCTG CTGCGCGGAG ACTGGCACTG CGGCTACGAC
CCGGTCCTCC GGCGAAGCGC CCCGCAGTGC ACCCCGCTGG CGACCCAGTG CGTGCACGCC
GCCGGCCTCG CCTACGGGGA GGCGTACCAG GGCCGGGAGA CGGTGGCCCT GACCTTCATC
GGCGACGGCG CCACCAGCGA GGGCGACTTC CACGAGGGGG TCAACTTCGC CGCCGTGTTC
AAGGCGCCGG TCGTCTACTT TGTGCAGAAC AACCGGTACG CGATCAGCGT CCCGCTGTCC
CGACAGACCG CCGCGCCCAG TCTGGCGTAC AAGGGCGTCG GCTACGGCGT GCCCAGCGAG
CAGGTCGACG GCAACGACCC GGTCGCCGTG CTCGCCGTGC TCACCCGGGC CGTGGCACAC
GCCCGCGCCG GCCACGGCCC CTTCCTGGTG GAGGCTCACA CCTACCGGAT GGAGCCACAC
ACCAACGCCG ACGACGCCAC TCGCTACCGC GACGCCGATG AGGTGGCCGT CTGGCAGGAC
CGTGACCCGG TCGCCCGGTT GGAGACCTAC CTGCGAGCCC GGCGCGCGTT GGACGACACC
ATCGTGGCGC GGGTCGCCGG GCAGGCCGAG GAGTACGCGG CCGATCTGCG CGAGCGGATG
CACGACAAGC CGACCGTCGA CCCGATGACG CTCTTCGACC ACGTCTACGC CGAACCGACG
CCGCAACTGG CCGAACAGCG CGAGCAGGTC CGCGCCGAAC TGACCGCCGA CCAGGAGGGA
GCCGCGTGA
 
Protein sequence
MTTTPRAVRR KSRPAAQPDP AHPLLPAGEQ IRLLDPAGTP LPAHPDYPEP PVEALVELYR 
RMVIGRRFDQ QATALTKQGR LAVYPSARGQ EACQVGAVLA LRDDDWVFPT YRESMALTAR
GIDPVEVLTL LRGDWHCGYD PVLRRSAPQC TPLATQCVHA AGLAYGEAYQ GRETVALTFI
GDGATSEGDF HEGVNFAAVF KAPVVYFVQN NRYAISVPLS RQTAAPSLAY KGVGYGVPSE
QVDGNDPVAV LAVLTRAVAH ARAGHGPFLV EAHTYRMEPH TNADDATRYR DADEVAVWQD
RDPVARLETY LRARRALDDT IVARVAGQAE EYAADLRERM HDKPTVDPMT LFDHVYAEPT
PQLAEQREQV RAELTADQEG AA