Gene Smed_5071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5071 
Symbol 
ID5319373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp18021 
End bp19382 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content67% 
IMG OID640776851 
ProductBFD/(2Fe-2S)-binding domain-containing protein 
Protein accessionYP_001313783 
Protein GI150377188 
COG category[C] Energy production and conversion 
COG ID[COG1249] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGAGG CTGACATCAT CGTCGTCGGC GGTGGACCGG CCGGCGTGTC GGCAGCCGTC 
GAAGCCGCGA AATCAGGGCT CTCGGTGGTG CTTTGCGAGC AGAGGCCGGC GCTTGGCGGC
GCCATCCATC GTCAGCCCGC CGAGGGCGCG GCACCGGTAG CGGCTTTGCC GTCGCTTGCG
CGCCGCTGGC AGTCTCTGTC AGCGAAACTT TCCGCCTCGC GCGTGGAAGT CCGGACACGC
AGGGCATTTG TGGGAATTGA CAGCACCGGC GCCGTTCTGA TCGATGACCG GGCAGCGGGA
AAAGTCGAGG TCCGCCGGCC ACGCGCCCTG ATCCTGGCCT GCGGGGCTGT GGAGCGGGTG
AGGCCCCGGC CGGGCTGGCA CTTGCCGGGC GTCGCTGCAG CAGGCGGTCT CCAGGTGATG
CTGAAAGAGG GCCGGGTGCC GGAGGGACGC GTCCTGCTTG CGGGAAGCGG CCCTTTGCTG
CTCGCACTGG CCGCACAAAT GACGGCAGCC GGCAATCCGC CCGTGGCGGT CGTCGAGGAG
GGAGATCCCG TTTCGTGGCC CCTGGCAGCG ACCCGTCTGC TGACCCATCC ATTCATCCTT
CCGGATATGG CTGTCCTGAT GATGCCGGTC CTTCTCCGCC GCTTTCTGTG GCGGCGCGGC
ACGCGCCTGA CGGAGATTAC CCAATCCGCC GACATGCTTA GCGCAAGGCT GCGTGCACCA
GACGGACGGG AGGAGCGGTT CGAGGTCGAC CGTATCGGGC TTCATGACGG GCTGCGTGCG
AACGATTTCG GTCTGACGGC CGATCCCCCG TCAGGGCTCG TCATCTTGCG GGCAGGCGAT
CTGCGCGAAG TCCTCGGAGC GCATGCCGCC GAAGCGGATG GAGCCGAAGC GGGCCGCGAA
GCCGCAGCCC GGCTGGAAGG GCGACCGTCA TGCGGCAGCG GGAGCGCCAT GCGACGCCAC
CGCGCCCTGC AGCACAGGCT TTCCCGCATT TTCGCGCCGG CCCAAGGCGC GACGGTGCTG
AGGGACTGTC CCGACGAAAC CGTGATCTGT CGTTGCGAGG GCCGGACGAT CGGCCACTTA
AAGGAGCAGC TGGCCGGGCC TGACGCTGTT TCAGCACGAG AATTGCGGCT CAACGGGCGA
TTCGGTATGG GCGCGTGCCA GGGACGCTTC TGTTCCGAAT GGGCGCTTTC CCTGATGTCC
GAACTGCGTC GGTCAGCGGG AGTGCCAGCC TCTCCGCCAG ACTTCGCCGA AACGGGCGTT
TGCCGTTGGC CGTTGCGGCC CGTGGCTCTA TCCTCGCTCG CAAACGCTGG CATCTACGAT
GACACCCGCA CAGAACGGCA CATCGAAGGA ATCTCGGCAT GA
 
Protein sequence
MREADIIVVG GGPAGVSAAV EAAKSGLSVV LCEQRPALGG AIHRQPAEGA APVAALPSLA 
RRWQSLSAKL SASRVEVRTR RAFVGIDSTG AVLIDDRAAG KVEVRRPRAL ILACGAVERV
RPRPGWHLPG VAAAGGLQVM LKEGRVPEGR VLLAGSGPLL LALAAQMTAA GNPPVAVVEE
GDPVSWPLAA TRLLTHPFIL PDMAVLMMPV LLRRFLWRRG TRLTEITQSA DMLSARLRAP
DGREERFEVD RIGLHDGLRA NDFGLTADPP SGLVILRAGD LREVLGAHAA EADGAEAGRE
AAARLEGRPS CGSGSAMRRH RALQHRLSRI FAPAQGATVL RDCPDETVIC RCEGRTIGHL
KEQLAGPDAV SARELRLNGR FGMGACQGRF CSEWALSLMS ELRRSAGVPA SPPDFAETGV
CRWPLRPVAL SSLANAGIYD DTRTERHIEG ISA