Gene Smed_0643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0643 
Symbol 
ID5321479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp695208 
End bp696146 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content61% 
IMG OID640789579 
Producthypothetical protein 
Protein accessionYP_001326334 
Protein GI150395867 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.254759 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.394325 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCGG ACGCTCCGAG CCCGATGAAG GGGATGGTGC TCAAGGTGCT CTCCGTTGTC 
GTCTTCGTGT GCATGTCAAC CTCTATCAAG GCTGCCGGCA ACGATATCGC CACGGGTCAG
ATCACCTTCT ATCGCTCGGC ATTCGCGATG GTGCCGATCC TCGGCTTCCT GGCCTGCCGA
GGCGCGCTGC GCGACGCTTT CCGGACCAGC AACGTCACGG GACATGTGGC GCGTGGCTTC
GTCGGCATCC TCGCCATGAG TTGCGGCTTT TACGGTCTCG TCCATCTGCC CCTGCCGGAG
GCGATCGCGA TCGGCTACGC TATGCCGCTT CTCGCGGTGG CTTTCGCGGC GATCTTTCTT
GGAGAGATCG TGCGGCTCTA TCGCTGGTCG GCCGTGCTTA TCGGGCTCAT CGGGGTGTTT
ATCATCATCT GGCCACGGCT TACGCTTTTC AACCAGGGCG GCTTCGGGTC GGCGGAGGCT
ATGGGTGCCG TCGCGGTGCT TTTTTCGGCG GCGCTCGGAG CAACGGCGAT GGTGCTCGTG
CGCAAGCTCG TACAGAAGGA ACGCACCCAT ACGATCGTCC TCTATTTCTC GCTTTCCGCT
GCAATGTTCT CGCTTGCGAC GCTGCCCTTC GGCTGGTCTG AACTCTCATG GGAGGCATTC
TTCCTCCTGA TGATCGCCGG GTTTTGCGGC GGCATCGGGC AGATCCTGCT GACGGAGAGT
TATCGCCACG CCGATATGTC GACGATCGCC CCCTTCGAAT ACACATCCAT CGTGCTGGGC
ATCGTCATCG GTTATTTTCT TTTCGGAGAT GTGCCGACGG CAACCATGCT CGCGGGAACG
GCGATCGTCG TCGGCGCGGG CATCTTCATC ATCTACCGGG AGCACCAGCT GGGACTGGAG
CGCAGGGGTG CCAGGAAGCA CGTTACCCCG CAGGGTTGA
 
Protein sequence
MDADAPSPMK GMVLKVLSVV VFVCMSTSIK AAGNDIATGQ ITFYRSAFAM VPILGFLACR 
GALRDAFRTS NVTGHVARGF VGILAMSCGF YGLVHLPLPE AIAIGYAMPL LAVAFAAIFL
GEIVRLYRWS AVLIGLIGVF IIIWPRLTLF NQGGFGSAEA MGAVAVLFSA ALGATAMVLV
RKLVQKERTH TIVLYFSLSA AMFSLATLPF GWSELSWEAF FLLMIAGFCG GIGQILLTES
YRHADMSTIA PFEYTSIVLG IVIGYFLFGD VPTATMLAGT AIVVGAGIFI IYREHQLGLE
RRGARKHVTP QG