Gene Smed_1076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1076 
Symbol 
ID5321922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1144482 
End bp1145528 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content60% 
IMG OID640790018 
Productdehydrogenase E1 component 
Protein accessionYP_001326763 
Protein GI150396296 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID[TIGR03182] pyruvate dehydrogenase E1 component, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.415529 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCCGC GAAAAACCGC GTCCGTTTCC AGCCGCAAAA CCGCTGCCAA GCCGGTCAAG 
AAAGATTTTG CCGGCGGCAC GATCGCCGAG TTCTCCAAGG AAGACGATCT CAAGGCTTAC
CGCGAAATGC TGTTGATCCG GCGTTTCGAG GAAAAGGCCG GCCAGCTTTA TGGCATGGGG
TTCATCGGCG GTTTCTGTCA CCTCTATATC GGCCAGGAAG CCGTCGTCGT CGGCATGCAG
CTTGCCTTGA AAGAGGGCGA TCAGGTCATC ACCGGCTATC GTGACCACGG CCACATGCTC
GCCTGCGGCA TGAGTGCGCG CGGCGTGATG GCGGAGCTCA CCGGTCGCCG TGGCGGCCTT
TCCAAGGGGA AGGGCGGCTC GATGCATATG TTCTCCAAGG AAAAACACTT CTATGGCGGT
CACGGCATCG TCGGTGCGCA GGTTTCGCTC GGCACCGGCC TTGCCTTCGC CAACAGATAT
CGCGGCAACG ACAATGTCAG CCTCGCCTAT TTCGGCGACG GTGCGGCCAA TCAGGGCCAG
GTCTATGAGA GCTTCAACAT GGCCGCTCTC TGGAAATTGC CGGTGATCTA CATCGTCGAA
AACAACCGCT ATGCCATGGG TACCTCCGTG TCGCGTGCCT CGGCGCAGAC CGACTTCTCC
CAGCGCGGCG CATCCTTCGG CATTCCGGGC TATCAGGTGG ACGGCATGGA TGTCCGCGCC
GTCAAGGCCG CAGCCGACGA GGCGGTGGAG CATTGCCGTT CCGGCAAGGG GCCGATCATC
CTTGAGATGC TGACCTACCG CTACCGCGGC CATTCGATGT CCGATCCGGC GAAGTATCGC
TCGAAGGACG AAGTACAGAA GATGCGTTCG GAGCATGATC CGATCGAGCA GGTGAAGGCC
CGCCTCATGG ATAAAGGCTG GGCCACCGAG GATGAGCTGA AGCAGATCGA CAAGGAGGTT
CGCGACATCG TCGCGGACAG TGCCGATTTC GCCCAGTCTG ATCCGGAGCC GGATGTTTCC
GAGCTCTACA CCGATATCCT GCTTTGA
 
Protein sequence
MAPRKTASVS SRKTAAKPVK KDFAGGTIAE FSKEDDLKAY REMLLIRRFE EKAGQLYGMG 
FIGGFCHLYI GQEAVVVGMQ LALKEGDQVI TGYRDHGHML ACGMSARGVM AELTGRRGGL
SKGKGGSMHM FSKEKHFYGG HGIVGAQVSL GTGLAFANRY RGNDNVSLAY FGDGAANQGQ
VYESFNMAAL WKLPVIYIVE NNRYAMGTSV SRASAQTDFS QRGASFGIPG YQVDGMDVRA
VKAAADEAVE HCRSGKGPII LEMLTYRYRG HSMSDPAKYR SKDEVQKMRS EHDPIEQVKA
RLMDKGWATE DELKQIDKEV RDIVADSADF AQSDPEPDVS ELYTDILL