Gene Smed_0336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0336 
Symbol 
ID5321169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp365106 
End bp366236 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content62% 
IMG OID640789271 
Producthypothetical protein 
Protein accessionYP_001326029 
Protein GI150395562 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.115948 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATAG CCGAACGGAC AACGGCCAAG CGTCGGCGAA AGCGGGAAGC GGCGAAAGAA 
ATCGCTCTGG CCGAGGCATC GGCGATCGTT GCCCGGAAAC GCGACAGCAT AGAGCTTGCC
GGCAGCTGGA GCGTGATCGG TCTGTTCGTG ATCGCCTGCG CAGCGGTGGT TTATGCGATG
GAACCGATCC TCCTGCCGAT AACGCTTGCG GTCGTTGTCG GTATCGTCCT CGGCCGGGCT
GCCGATGAGC TTGCCCGGTT CGGTCTTCCG CCTATCTTCG GCGGGCTGTT GCTGGCGCTT
TTCTTCCTGC TGGGCCTGTC ATATCTGGTC AATGCCATTC TCTGGCCGAT TACCGAGGTC
GCGCGCGAAG CGCCGCGGCT GGTCGAAGGC GTGATCGAGC GGATACTGCC CTATCTGCAG
CGCTTCGAAT GGCTGAACCT GGTGCTTGCC CGCGGGACGG AAGAGGAGGC CTTTGCCGAC
GTCATCGTCA AGAATGCGGG GCCGCTGATC GGTGGCGCGG CGGCCAGCCT CACCCCGGCT
CTCGTGCAGA CCCTGATCTT TTTGGCGGCG CTCGTTCTAT TTCTGCTCGG GCGCGTTCAA
CTGCGCAGCA CGATCATTCT TGCCTTCCCC AGCCGCGAAG GGCGTTTGAC GGCAATCCGG
GTCATGAACG CCCTCGAGGA TGCGCTTGGG CATTATTTCT CGACCGCAAG CCTGATCTAC
CTGGCGCTTG GCGCAGTTAC CATGGTGGTC GCGCTCGTCG GCGGATTGGC GATGCCGCCG
CTTTGGGGCC TTTTCGCCTT CGTCTCGAGT TTCATTCCCT ATCTCGGTGT CACGTTCATG
ACCCTGGCTT TGCTCGTTGG CGGGCTGATG ACCCATGATG CGCTCATCGT TGCGCTCGCC
CCGGCCACCG CCTTCTTCTT CGTTCACCTC GCCATGGAGA ACCTGCTGGT GCCCGCCATC
CTCGGCCAGC GCTTCGATAT CAATCCATTC CTGATCTTCG TAGCGATCAT CTTCTGGACG
TGGATGTGGG GCGCCGTCGG CGCGATCCTC GCCTTTCCGC TATCGCTGAT CGCGATGATC
ATCTTCGAAC AGGTGCTGCT GCCGCCGCAG GAACGGCAGC TGCCGGGCTG A
 
Protein sequence
MDIAERTTAK RRRKREAAKE IALAEASAIV ARKRDSIELA GSWSVIGLFV IACAAVVYAM 
EPILLPITLA VVVGIVLGRA ADELARFGLP PIFGGLLLAL FFLLGLSYLV NAILWPITEV
AREAPRLVEG VIERILPYLQ RFEWLNLVLA RGTEEEAFAD VIVKNAGPLI GGAAASLTPA
LVQTLIFLAA LVLFLLGRVQ LRSTIILAFP SREGRLTAIR VMNALEDALG HYFSTASLIY
LALGAVTMVV ALVGGLAMPP LWGLFAFVSS FIPYLGVTFM TLALLVGGLM THDALIVALA
PATAFFFVHL AMENLLVPAI LGQRFDINPF LIFVAIIFWT WMWGAVGAIL AFPLSLIAMI
IFEQVLLPPQ ERQLPG