Gene Smed_4275 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4275 
Symbol 
ID5319037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp764210 
End bp765205 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content63% 
IMG OID640776080 
Producthypothetical protein 
Protein accessionYP_001313013 
Protein GI150376417 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.122751 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTGCT CTGCCCGAAT ATGCGAATGG CTGGCCGAGA ACGTTCCCGC GGAGACGCGG 
TCCCGCTTTG AAGGAATCGA ACCGATGGCC GTCGACAGTG TCCCCGGCAA ACTTTCGCAT
GAAAATGGAA CCGCCCGCGC CGGCGTGATC GTCATGCTCC TCGGCATGCT CATGTTTTCG
GTAAACGACG TCATGGGAAA GTGGCTGGTA GCCACCTACT CGGTCGGTCA GGTGGTGCTG
ATCCGCAGCA TCGCAGCGGT CCTCCTGCTC GCGCCGTTTC TATGGGTGAG CGGCCCGAAA
AAGCTCTTCA CCCTGGAGCG GCCCGGCCTT CAGCTTGCCC GCGTGGTCGC CTCGACCGCG
GAAGTGATCG CCTTCTATTT CGCCGTCGTC TACCTGCCGC TCGCAGATGT CATGACCTAT
TGGCTGGCTG CGCCGATCTA TGTCGCGGCC ATTTCGCCGC TGGTCCTCAA GGAACCGGTC
GGCTGGCGGC GCTGGACAGC GATCGCCATA GGCTTCGTCG GCGTCGTCGT CGCACTCGAA
CCGTCGTCGC AGGCTTTCAC ACTGCCGGCC GTCATTTCGA TCCTTGGCAG CATGGCCTTC
GCCTTCATGA TGATTTCCGG GCGGTCGCTG CGCGGCACTC CTGATACGAC CCTCGCCTTT
TGGCAGATTG CCGGCGCCGC GGTGGCCGGC CTCGTATGGG CGCCCTTCGA CTGGACACCC
CTCAAGCCGC TCGACACGGC GCTGCTCTGT CTCCTTGGCG TCGTCGCAAT GGTCGCCCAC
GTGCTTGTCA ACCGGGCGCT GAAGCTCGCC GACGCCGCGA CGGTAGCCCC GCTGCAATAC
ACGCTCCTTT TCTGGGCAAT CTTCTTCGGA TGGCTGATCT TCGGCGATAC GCCGCGGCTT
TCGATGGTAC TCGGCGCCGG CCTTATCGTC GCCTCGGGCC TCTTCATCTT TTTCCGCGAA
CAGCAGCTGA AGAGGCAGGG GCGGCTGAAA GGCTGA
 
Protein sequence
MDCSARICEW LAENVPAETR SRFEGIEPMA VDSVPGKLSH ENGTARAGVI VMLLGMLMFS 
VNDVMGKWLV ATYSVGQVVL IRSIAAVLLL APFLWVSGPK KLFTLERPGL QLARVVASTA
EVIAFYFAVV YLPLADVMTY WLAAPIYVAA ISPLVLKEPV GWRRWTAIAI GFVGVVVALE
PSSQAFTLPA VISILGSMAF AFMMISGRSL RGTPDTTLAF WQIAGAAVAG LVWAPFDWTP
LKPLDTALLC LLGVVAMVAH VLVNRALKLA DAATVAPLQY TLLFWAIFFG WLIFGDTPRL
SMVLGAGLIV ASGLFIFFRE QQLKRQGRLK G