Gene Smed_5005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5005 
Symbol 
ID5318654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1522703 
End bp1523929 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content69% 
IMG OID640776787 
ProductFAD-dependent pyridine nucleotide-disulphide oxidoreductase 
Protein accessionYP_001313719 
Protein GI150377123 
COG category[R] General function prediction only 
COG ID[COG0446] Uncharacterized NAD(FAD)-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0161752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0150156 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCAGA TCGTGATCGT CGGCGCCGGC GAATGTGGCG CGCGGGCAGC CTTCGCGCTC 
AGGGAGAAAG GCTTTGACGG AGAGATCACC CTGATCGGCG CCGAGCCGCT CCTTCCATAT
GAGCGGCCGC CGCTTTCGAA GGATGGCCTG GCCGCTGCCG CAGCGCCGAA ATTCGTTGCG
GCGCCCGAGC GCTATCGCGA GGCGAAGATT GCGGTGCTGA CCGGCACCGC GGTCGAGGCC
ATCGGACGTG CACGGAAGAC CGTCACGCTT TCGGACGGGC GATCCTTCGC CTATGACCGG
CTGCTGCTTG CCACCGGCGC CCGGCCGCGC GCGCTCCCGG GCATGCCCGG CAACTCCGCG
CGCATCCGCA TGCTGAGGAC GCATGCCGAT GCGCTGACGA TCCGCGCCGC ACTCGCGCCC
GGCCTCAAGC TCGCCATCAT CGGCGGCGGC TTCATCGGAC TGGAGCTTGC CGCAACGGCC
CGCAGGCTTG GCGCCGAGGT CGTGCTTGTG GAAGGCCAGC CGCGCATTCT CTCCCGCGGC
GTACCGGAGG AGATCGCCGC CGTCGTCGCT GAGCGGCACC GGCAGGAAGG CGTGGAGATC
ATCTGCGGCG CCGGCATCGC CGCCATCGAG GAAGGCGCAG ACGGCGCAAG CCTCCTGCTG
GCCGTGGGCG GCAGGATTGA CGCCGATCTT GTTGTCGTCG GCGTCGGCGC AGTGCCGAAC
AGCGAGCTCG CCGAGGCTGC GGGCCTCATG ATCGATAACG GCATCGCCGT CGACGAGAGG
CTTTGCACCT CGGACCCCGA CATTCTTGCC GCCGGCGACT GCTGCTCCTT TCCGCTTTCG
CATTATGGCG GACGGCGGGT GCGGCTCGAA GCCTGGCGCA ATGCCCAGGA CCAGGGGGCG
CTCGCCGCGG CCAACCTGAT GGGCGCGGGC GAACATATCG CCAGCGTGCC GTGGTTCTGG
TCGGATCAGT ATGAGCTTAC GCTGCAGATA TCGGGCCTCG CCGAAGGCAC CGCCGCGACC
GTCCGCCGCG AACTGGAGGA GGGGGCCTTC ATCCTCTTCC ATCTCGATGG CGAAGGCCGG
CTGATCGCCG CAAGCGGCAT CGGCCCCGGC AATGCCGTCG CCCGCGATAT ACGCCTGGCC
GAAATGCTGA TTGCCGCCGG CGCGAAGCCG GATCCGCAGG CCCTTGCGTC ACCCGAAATC
AGGCTCAAGA AGCTGCTGGC CGCCTGA
 
Protein sequence
MRQIVIVGAG ECGARAAFAL REKGFDGEIT LIGAEPLLPY ERPPLSKDGL AAAAAPKFVA 
APERYREAKI AVLTGTAVEA IGRARKTVTL SDGRSFAYDR LLLATGARPR ALPGMPGNSA
RIRMLRTHAD ALTIRAALAP GLKLAIIGGG FIGLELAATA RRLGAEVVLV EGQPRILSRG
VPEEIAAVVA ERHRQEGVEI ICGAGIAAIE EGADGASLLL AVGGRIDADL VVVGVGAVPN
SELAEAAGLM IDNGIAVDER LCTSDPDILA AGDCCSFPLS HYGGRRVRLE AWRNAQDQGA
LAAANLMGAG EHIASVPWFW SDQYELTLQI SGLAEGTAAT VRRELEEGAF ILFHLDGEGR
LIAASGIGPG NAVARDIRLA EMLIAAGAKP DPQALASPEI RLKKLLAA