Gene Sare_2670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2670 
Symbol 
ID5706981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3045551 
End bp3046522 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content74% 
IMG OID641272128 
Productdehydrogenase E1 component 
Protein accessionYP_001537498 
Protein GI159038245 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000432402 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGACCGGAG TCGACCCGGT CCGGCTCTAC CGCACGGTAC GGCTGATCCG CCGGTTCGAG 
GAGCGGGCGA TCGAGCTCGT CCGCTCCGGC CACATCGTCG GCGGTATCCA CCCGTACATC
GGCCAGGAGG GGATCGCGGC CGGGGTGTGC GCGGCGCTGC GCCCCGACGA CGTGGTCGCC
GGCACCCACC GGGGGCACGG GCACGTGCTC GCGAAGGGTG CCGACCCGGC CCGGATGATG
GCGGAACTGT GCGGTCGGGT TACGGGCCTG AACCGGGGCC GGGGCGGGTC GATGCACGCC
GCCGACTTCG CGGTCGGGGT GCTCGGCGCC AACGCCATCG TCGGCGCCGG TGGCGCGATC
GTCACCGGCG CCGTCTGGGC CCGGCGCCGG CGCGGCGAGG ATCTGGTGGG GGTGAGTTTC
CTCGGCGACG GCGCGGTCAA CGAGGGGATG CTGCTGGAGG CGTTCAACCT GGCGGCACTC
TGGCGGGTGC CGGTGCTGTT CGTGTGTGAG AACAACGGCT ACGCCACCAC CATGCCGGTG
GCCGACTCGG TGGCCGGCAG CATCCCGGCA CGGGCGGAGG CGTTCGGCAT CCGGGCGTCC
GTGGTGGACG GCCAGGACCC GGCCGCCGTG CACGCCACCA CCGCTGCCGC CCTCGCCCGG
ATGCGCGCCG GCGGTGGCCC CGAGTTCCTG GAGGCCAGGA CCTACCGGTT CGATGCCCAC
CACACCTTCG AGCACACGGT CCGCCTCGAC TATCGCTCGG CGGAGGAGGT CGAACGCGGC
CGGTCCCGGG ATCCGGTGCG GATCGCCGGC TCGCGGCTGT CGGCCACCGA TCGGGCGAAC
GTCGACGCCG ACGTGGAGGC GGTGCTCGAC GTGGCGGTGG CCGAGGCTCT CGCCGCCCCC
GAGCCCGACC CGGCCACCGC ACTGGAGCAC CTGTACGCCA GCGGGCTGAC GGCCCGCACT
GGAGGTGGGT AG
 
Protein sequence
MTGVDPVRLY RTVRLIRRFE ERAIELVRSG HIVGGIHPYI GQEGIAAGVC AALRPDDVVA 
GTHRGHGHVL AKGADPARMM AELCGRVTGL NRGRGGSMHA ADFAVGVLGA NAIVGAGGAI
VTGAVWARRR RGEDLVGVSF LGDGAVNEGM LLEAFNLAAL WRVPVLFVCE NNGYATTMPV
ADSVAGSIPA RAEAFGIRAS VVDGQDPAAV HATTAAALAR MRAGGGPEFL EARTYRFDAH
HTFEHTVRLD YRSAEEVERG RSRDPVRIAG SRLSATDRAN VDADVEAVLD VAVAEALAAP
EPDPATALEH LYASGLTART GGG