Gene Smed_1110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1110 
Symbol 
ID5321956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1180879 
End bp1182048 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content67% 
IMG OID640790051 
Productmonooxygenase FAD-binding 
Protein accessionYP_001326796 
Protein GI150396329 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.7504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.659993 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAGG CTGAGCGGGT TGCGATCGTC GGCGCCGGCA TCGCCGGGCT GACGACCGCG 
CTTTGCCTCG CCCGGCGGGG TTACCGGACC GACATCTTCG AGCAGGCCGA CGCGCTGGAC
GAAGTCGGAG CCGGGCTTCA GCTTTCGCCG AATGCATCGC GCATCCTCAT CGAACTTGGC
TTGCTGCCCG CTCTCGAAAG CGTCTGGAGC GAACCGGGGG AGATCTCGCT TGCCGACGGC
CGCTCGCTGC GGCAGCTTGC AAGCGTGCCT GCCGGCGCGC ATGCCCGCGC GCGCTGGGGC
GCACCCTACG CCGTCCTGCA TCGCGCCAGC TTGCAGGCGA TCCTCCAGGA CGCCGTCCGG
GCGGAACCGC TTTGCCGCCT TCATCTGGGA AGCCGCATCG GCGACGATCC GCAAGCAGCC
ATTTTCGAGG CGATCAAGCA AACACCGGCC GCAATCATCG GCGCCGACGG GATCTGGTCG
CGGATCCGCA CCTCGATTCC AGGCGCAGGC AGCTCCCGCT TTTCGGGCAA TATTGCCTGG
CGCTTCACGC TGCCACGTAC CCGGGCGCCT GCCTGCCTGC CGCATGACCG AGTTACGGCC
TTCCTCGCGC CGAGGGCCCA TCTGGTTGCC TATCCAATCC GAAAGATCGA CGGCTTCAAC
CTCGTTGCGA TCGTTGCTGG AAATGCATCT GGGGAAACGT GGGCGGGGCG GGAAACCGCC
GATCGCCGGC GCGCGTTCGA AAGGGCTTTC GGGGATTGGC ATCCCGACCT TCGCTCGCTG
CTCGACCACG CCGCGCCGGC GACCTGCTGG CCGCTCTGCA CGGTGGCCGA TGGAGCCTGG
CACAATCGGC GCGACACCAT CCTCATCGGC GATGCCGCCC ATGCCATGAC GCCGTTTGCG
GCCCAGGGCG CGGCCATGGC CATCGAGGAC GCCTGGGAGC TCGCCATCCG CATCGCGGAC
AGCCCGGATA CTCCCTCCGC CTTCTCGCGC TATGAAGAGG CACGCCGGGC GCGGATCGGC
CGCGTCCGCA AGCGCGCCGC CTTCAACAGC TTTGCCTATC ACGCCGCGGG TCCAGTACGC
ATCGCGCGCG ATTTCATTCT CGCCTCCAGG AAACCGGAAG CACTTGCCGC GGATTTCGAC
TGGCTTTTCG GCTATGGCGC GCAAAAGTGA
 
Protein sequence
MQKAERVAIV GAGIAGLTTA LCLARRGYRT DIFEQADALD EVGAGLQLSP NASRILIELG 
LLPALESVWS EPGEISLADG RSLRQLASVP AGAHARARWG APYAVLHRAS LQAILQDAVR
AEPLCRLHLG SRIGDDPQAA IFEAIKQTPA AIIGADGIWS RIRTSIPGAG SSRFSGNIAW
RFTLPRTRAP ACLPHDRVTA FLAPRAHLVA YPIRKIDGFN LVAIVAGNAS GETWAGRETA
DRRRAFERAF GDWHPDLRSL LDHAAPATCW PLCTVADGAW HNRRDTILIG DAAHAMTPFA
AQGAAMAIED AWELAIRIAD SPDTPSAFSR YEEARRARIG RVRKRAAFNS FAYHAAGPVR
IARDFILASR KPEALAADFD WLFGYGAQK