Gene Smed_1783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1783 
Symbol 
ID5322641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1865800 
End bp1866903 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content62% 
IMG OID640790721 
Productmethylenetetrahydrofolate reductase 
Protein accessionYP_001327453 
Protein GI150396986 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0685] 5,10-methylenetetrahydrofolate reductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.182207 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00669682 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCCCGG ATATCAATCC TTACGATCCG GGCGCCCCGC TCGACCCGCT CCCCGGCCAT 
TCTTCGCGCG GGCGGCTGGA GCGCGTTCTG CGTCGCGGTG AATTCGCCGT GACCGCCGAG
CTCAACCCGC CGGACAGCGC CAATCCGCAT GACGTCTACG AGCGCGCGGC GATCTTCGAC
GGGTGGGTTG ACGGCATCAA CGCAGTCGAC GCCTCGGGCG CCAATTGCCA TATGTCTTCC
GTCGGTATCT GCGCGCTGCT GACGCGCATG GGCTATGCGC CGATCATGCA GATCGCCTGC
CGGGACAGAA ATCGCATCGC CATTCAGGGC GACGTTCTCG GCGCCTCGGC CATGGGCGTC
CAGAACATCA TGTGCTTGAC GGGCGACGGC GTACAGGCCG GCGACCAGCC CGGGGCCAAG
CCCGTCTTCG ATCTGGACTG CATGTCCCTG CTCGAGACGG TGCGCATCAT GCGCGACAAT
TCCAAGTTTC TTTCGGGCCG CAAGCTATCG ACGCCGCCCC ACGTATTTCT TGGAGCGGCA
ATCAACCCTT TCGCCCCTCC CTACGATTTC CGCCCCTACC GCCTCGCCAA GAAGATCGAA
GCCGGCGCTC AATTCGTCCA GAGCCAGTAT TGCTTCGACG TTCCGATGTT CCGCGAATAT
ATGAAAAAGG TGCGCGACCT CGGTTTGCAC GAGAAGTGCT TCATCCTGGT GGGCGTCGGG
CCGTTGGCTT CCGCCAAGAC TGCCCGCTGG ATCCGATCCA ATGTTCCTGG CATCCACATT
CCGGATGGCA TTATCAGGCG GCTCGAGGGC GCCCAGGATC AGAAGAAGGA AGGCAAGCAG
CTCTGCATCG ACGTCATGAA CGAGGTGAAG GAGATCGAGG GTGTCTCAGG CGTCCATGTC
ATGGCGTACC GCCAGGAGGA GTATGTCGCG GAAATCGTAC ATGAATCAGG CGTCCTGAGG
GGCCGCAAGC CGTGGAAGCG CGAAGCCGCG CCGACCGATG CGATGGTTGC CGAACGGCTG
GAGCATATTC GCGAAGGCAG GGAGGAAAAC CAGCAGCAGA TGGCCGAGGC CGCCGCGCAC
CACCCGCATG AGACAAAGCA GTGA
 
Protein sequence
MSPDINPYDP GAPLDPLPGH SSRGRLERVL RRGEFAVTAE LNPPDSANPH DVYERAAIFD 
GWVDGINAVD ASGANCHMSS VGICALLTRM GYAPIMQIAC RDRNRIAIQG DVLGASAMGV
QNIMCLTGDG VQAGDQPGAK PVFDLDCMSL LETVRIMRDN SKFLSGRKLS TPPHVFLGAA
INPFAPPYDF RPYRLAKKIE AGAQFVQSQY CFDVPMFREY MKKVRDLGLH EKCFILVGVG
PLASAKTARW IRSNVPGIHI PDGIIRRLEG AQDQKKEGKQ LCIDVMNEVK EIEGVSGVHV
MAYRQEEYVA EIVHESGVLR GRKPWKREAA PTDAMVAERL EHIREGREEN QQQMAEAAAH
HPHETKQ