Gene Smed_3818 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3818 
Symbol 
ID5318010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp271574 
End bp272788 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content67% 
IMG OID640775630 
Producthypothetical protein 
Protein accessionYP_001312563 
Protein GI150375967 
COG category[R] General function prediction only 
COG ID[COG2081] Predicted flavoproteins 
TIGRFAM ID[TIGR00275] flavoprotein, HI0933 family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAGTA GTGAAATTGC GATTATCGGC GGCGGCCCCG CCGGCCTGAT GGCGGCGGAA 
ATCCTGTCAC GCTCCGGCCA CGCGGTGACG ATCTACGAGG CGATGCCGAG CGCGGCGCGC
AAATTCCTGC TCGCGGGCAA GTCCGGCCTC AACATCACGC ACTCCGAACA CAGCAAGGCT
TTTATGCAGC GGTTCGCCGA TGCGTCCGCA AGGCTGCAAC CGGCGCTCGA CGCTTTTGCT
CCGAAAGACG TTCGCGCCTG GGCGGATGAA CTTGGCGCGG AAACGTTTGT CGGCTCCTCC
GGGCGCGTCT TTCCGAGAGC CATGAAGGCT TCGCCGCTGC TGCGCGCATG GCTTCGGCGG
CTGGAGGCGC AAGGCGTCCG GCTCCTCACC CGCCATCGCT GGTCGGGCTT TGCCGAGGAC
GGTTATGTTT TCGACACGCC GGAGGGCAGG ACGCTCGTGC GTTGCGACGC GGCTCTCATG
GCGCTCGGCG GCGCGAGTTG GCCCCGGCTC GGATCCGACG CTGCCTGGGT GCCCCCGCTG
CGGGCAAGAG GCGTACCGAT CAGGGATCTC CGCCCCGCCA ATTGCGGGTT CGACGTCGCA
TGGAGCGGGG CCTTCCGTGA GCGTTTTGCC GGTCAGGCGC TGAAAGCAGT TACCGCCACA
TCCGGCGCCG GGACCATCCC GGGTGAATTC GTGATGAGCC GCCACGGCAT CGAAGGCAGC
CTCGTCTATG CCCACGCGGC TTGCCTGCGC GACCGGCTGG AGCAGGACGG AAAAGCCTCC
CTCATGCTCG ACCTTGCGCC AGGCAGAACG GCCGAAAGGC TCGCGCGGGA TCTCGCCCGG
CAGGATCGCA AGGCGAGCCT CTCCAACCGC CTGCGTAAGG GCGCCGGGCT CGACGGTGTG
AAGGCGGCAT TGCTGCGCGA GCTCTCGCAG GAGGCAACCA GGATAGCTCC GGAGCAACTT
GCTGCACTTA TCAAGGCCTT GCCCGTTCCA GTGCTTGCGG CGCGGCCGAT CGCGGAGGCG
ATCTCGTCGG CCGGCGGTGT CCGCCTGGAC GGCGTCGATG AACGCTATAT GGTGAAGGCC
GTACCCGGCC TCTTCGTCGC CGGCGAGATG CTCGACTGGG AAGCGCCAAC GGGCGGCTAT
CTCCTTACAG CTTGCTTTGC CACGGGTCGC GCGGCCGCGC GGGGCGTGAA GGCATGGCTG
GACGCCCGTC CGTGA
 
Protein sequence
MQSSEIAIIG GGPAGLMAAE ILSRSGHAVT IYEAMPSAAR KFLLAGKSGL NITHSEHSKA 
FMQRFADASA RLQPALDAFA PKDVRAWADE LGAETFVGSS GRVFPRAMKA SPLLRAWLRR
LEAQGVRLLT RHRWSGFAED GYVFDTPEGR TLVRCDAALM ALGGASWPRL GSDAAWVPPL
RARGVPIRDL RPANCGFDVA WSGAFRERFA GQALKAVTAT SGAGTIPGEF VMSRHGIEGS
LVYAHAACLR DRLEQDGKAS LMLDLAPGRT AERLARDLAR QDRKASLSNR LRKGAGLDGV
KAALLRELSQ EATRIAPEQL AALIKALPVP VLAARPIAEA ISSAGGVRLD GVDERYMVKA
VPGLFVAGEM LDWEAPTGGY LLTACFATGR AAARGVKAWL DARP