Gene Smed_4037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4037 
Symbol 
ID5318337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp498798 
End bp499769 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content65% 
IMG OID640775845 
Productputative dehydrogenase protein 
Protein accessionYP_001312778 
Protein GI150376182 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03557] F420-dependent oxidoreductase, G6PDH family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.122035 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGGA TCGGCTATCA CGCATCGCAC GAGCAATTTA CGCCGCTCGA CCTGTTGGGC 
TGGGCGCGGG CGGCGGAAGA GGCCGGCTTC GATTGCACCA TGTCGTCCGA CCATCTCGCG
CCCTGGAGCG AGCGGCAGGG GCAAAGCGGC TTTGCCTGGG CGTGGCTCGG CGCCGCCCTA
CAGGCGACAG AGAAGAGCTT CGGCCTCGTC ACGGTTCCTT GCGGCTGGCG CTATCACCCC
GCGATAACGG CGCAGGCAGC TGCAACCCTT GCGCAGATGT TCCCGCGGCG GCTTGCCTGG
CTGGCGTTGG GCAGTGGGGA GGCGCTGAAC GAACAGGCTG TCGGTGGGGT CTGGCCCGAA
AAGGCGGAGC GAAGGGCCAG ACTCCTCGAG GCGGTCGAGG TCATCCGCGA GCTTTGGGCC
GGCCGGACGG TTAACCGGCA AGCACCCATT GCCGTGTCGG AGGCCCGCCT TTATACGCTC
GCCGAGCACC CGCCGGCGCT GATCGCCGCG GCTCTAACGC CTGAAACGGC CGAAACGGCC
GGAGAATGGG CGGACGGTCT CATCACCGTC AATCAGTCGT CGACAAAGCT TGCCGCCATT
GCCGAGGCCT TCAGGCGCGG CGGCGGCGAC GGCAAGCCTC TCTGCCTTCA GGTCCATGTC
TCCTATGCAC AGACGGACGA GGAGGCGCGG CAAAATGCTT TCGATCAGTG GAGGAGCAAC
GTGCTCAGCC CCGGTCAGTC GGAGACGCTG AGGACGCCGG GTGAAATCGA GTCAGCCACG
AAGAGCGTTC GTCCCGAGGA TCTCGACAAG CATGTCAGGA TCTCCTCGGA TCCGGGGCGG
CACGCCGCCT GGATCGAGGA GGATATCGCC GCCGGCTTCG ACGAGATATA TCTTCACAAT
GTCGGTCGCA ATCAGCTTGA GTTCATCGAT GTCTTTGGCA GATCGGTTCT GCCGCGTGTG
CGCGCCTTCT GA
 
Protein sequence
MARIGYHASH EQFTPLDLLG WARAAEEAGF DCTMSSDHLA PWSERQGQSG FAWAWLGAAL 
QATEKSFGLV TVPCGWRYHP AITAQAAATL AQMFPRRLAW LALGSGEALN EQAVGGVWPE
KAERRARLLE AVEVIRELWA GRTVNRQAPI AVSEARLYTL AEHPPALIAA ALTPETAETA
GEWADGLITV NQSSTKLAAI AEAFRRGGGD GKPLCLQVHV SYAQTDEEAR QNAFDQWRSN
VLSPGQSETL RTPGEIESAT KSVRPEDLDK HVRISSDPGR HAAWIEEDIA AGFDEIYLHN
VGRNQLEFID VFGRSVLPRV RAF