Gene Strop_2487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_2487 
Symbol 
ID5058950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp2797350 
End bp2798321 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content73% 
IMG OID640474745 
Productdehydrogenase, E1 component 
Protein accessionYP_001159311 
Protein GI145595014 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.777225 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGAAG TCGGCTCGGT CCGGCTCTAC CGCACGGTAC GGCTGATCCG CCGGTTTGAG 
GAGCGGGCGA TTGAACTCGT CCGCTCCGGC CACATCGTCG GCGGCATCCA CCCGTACGTC
GGCCAGGAGG GGATCGCGGC CGGGGTGTGT GCGGCGCTGC GCCCCGACGA CGTGGTCGCC
GGCACCCACC GGGGGCACGG CCACGTGCTC GCGAAGGGGG CCGATCCAGC CCGGATGATG
GCGGAACTGT GCGGTCGGGT TACGGGCCTG AACCGGGGCC GGGGCGGCTC GATGCACGCT
GCCGACTTCG CGGTCGGGGT GCTCGGCGCC AACGCCATCG TCGGCGCCGG TGGCGCGATT
GTCACCGGCG CCGTCTGGGC CCGGCGCCGG CGCGGTGACG ACCTGGTGGG GGTGAGCTTC
CTCGGCGACG GTGCGGTCAA CGAGGGGATG CTGTTGGAGG CGTTCAACCT GGCCGCACTC
TGGCGGGTGC CGGTGCTGTT TGTGTGCGAG AACAACGGCT ACGCCACGAC GATGCCGGTG
GCCGACGCGG TGGCCGGCAG CATTCCGGCA CGGGCGGAGG CATTCGGCAT TCGGACGTCC
GTGGTGGACG GCCAGGACCC GGCCGCCGTA CAGGCCACCA CCGCTGCCGC CCTCACCCGG
ATGCGCGCCG GCGGTGGCCC CGAGTTCCTG GAGGCCCAGA CCTACCGGTT CGATGCCCAC
CACACTTTCG AGCACGCGGT CCGCCTCGAC TATCGCTCGG TGGAGGAGGT CGAACGCGGC
CGGTCCCGGG ATCCGGTGCG GATCGCCGGC TCGCGGCTGT CGGCCACCGA ACGGGCGAAG
GTCGACGCCG ACGTGGAGGC GGTGCTCGAC GCGGCGGTGG CCGAGGCCCT CGCCGCCCCC
GAGCCCGACC CGGCCACCGC ACTGGAGCAC CTGTACGCCA GCGGGCTGAC CGCCCGCACT
GGAGGTGGGT AG
 
Protein sequence
MTEVGSVRLY RTVRLIRRFE ERAIELVRSG HIVGGIHPYV GQEGIAAGVC AALRPDDVVA 
GTHRGHGHVL AKGADPARMM AELCGRVTGL NRGRGGSMHA ADFAVGVLGA NAIVGAGGAI
VTGAVWARRR RGDDLVGVSF LGDGAVNEGM LLEAFNLAAL WRVPVLFVCE NNGYATTMPV
ADAVAGSIPA RAEAFGIRTS VVDGQDPAAV QATTAAALTR MRAGGGPEFL EAQTYRFDAH
HTFEHAVRLD YRSVEEVERG RSRDPVRIAG SRLSATERAK VDADVEAVLD AAVAEALAAP
EPDPATALEH LYASGLTART GGG