Gene Smed_3883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3883 
Symbol 
ID5318563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp339156 
End bp340259 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content61% 
IMG OID640775695 
Productmyo-inositol-1-phosphate synthase 
Protein accessionYP_001312628 
Protein GI150376032 
COG category[I] Lipid transport and metabolism 
COG ID[COG1260] Myo-inositol-1-phosphate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.397744 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATCGA AATCCATTCG CGTCGGCTTG GTCGGTATCG GGAATTGCGC ATCCTCGCTC 
GTTCAAGGGC TGACATTCTA TCGCGACGCG AAGGAGGACG AGCCGGTTCC GGGACTGATG
CATGCCAATC TCGGCGGGTA CCACGTCGGC GATATCGAGA TCTCCGCCGC CTTCGACGTT
GCCGCGTCGA AGGTCGGGCG CGATGTCGCG GAAGCGATCT ACGCCGCTCC CAACAACACG
TTTCGTTTCG CGAATGCACC GTCGACGGGC GTGGTCGTTC AGCGTGGCAG AACGCTCGAC
GGTATCGGCC GCTATCTGCG CGAGGAGATC GAAGAATCCG ACGTGCCTGC CGCTGATGTC
GCGGACGTAC TGCGGCAGAC CGAAACGGAC GTTCTCGTCT CCTATCTTCC CGTCGGTTCC
GAAGTCGCCA CGCGCTGGTA TGCGGAGCAG GCGCTCGCGG CCGGCTGCGG TTTCGTCAAC
TGCATTCCCG TTTTCATTGC CTCGGACAAA TCGTGGCAGC GGAAATTTGC CGAGCGCGGG
CTGCCGCTCA TCGGTGACGA TATAAAGAGC CAGGTCGGTG CGACCATCGT GCACCGGCTG
CTTGCCAATC TCTTCCGCGA TCGGGGGGTG CGTATCGACA GGACGTACCA GCTCAACTTT
GGCGGCAATA CCGACTTTCT CAATATGCTC GAGCGGGAAC GGCTCGAATC GAAGAAAATA
TCCAAGACCC AATCTGTGGT CAGCCAGATG GACATTCCGC TCGCGGCCGG AGACATTCAT
GTGGGTCCGA GTGATCACGT TCCGTGGCTC GCCGACCGCA AGTTCGCCTA TATTCGCGTC
GAGGGCACGA CATTCGGCAA CGTTCCCCTC AATGTCGAGC TGAAGCTCGA AGTGTGGGAT
TCGCCGAACT CGGCGGGTGT CGTGATCGAT GCTGTTCGCT GCGCCAAGCT CGCAATCGAC
CGCGGCATTG CCGGGCCGCT CATTGCTCCT TCGAGCTATT TCATGAAGTC GCCACCGCAG
CAATTTACCG ATGCGGAGGC GCGCAGGCGG CTGGAGGAAT TCATCGCAGG CGAGACCGGC
GCACTCCTGG GGGCGGCCGA GTGA
 
Protein sequence
MGSKSIRVGL VGIGNCASSL VQGLTFYRDA KEDEPVPGLM HANLGGYHVG DIEISAAFDV 
AASKVGRDVA EAIYAAPNNT FRFANAPSTG VVVQRGRTLD GIGRYLREEI EESDVPAADV
ADVLRQTETD VLVSYLPVGS EVATRWYAEQ ALAAGCGFVN CIPVFIASDK SWQRKFAERG
LPLIGDDIKS QVGATIVHRL LANLFRDRGV RIDRTYQLNF GGNTDFLNML ERERLESKKI
SKTQSVVSQM DIPLAAGDIH VGPSDHVPWL ADRKFAYIRV EGTTFGNVPL NVELKLEVWD
SPNSAGVVID AVRCAKLAID RGIAGPLIAP SSYFMKSPPQ QFTDAEARRR LEEFIAGETG
ALLGAAE