Gene Smed_4468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4468 
Symbol 
ID5318170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp949400 
End bp950611 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content63% 
IMG OID640776269 
Productputative transcriptional regulator protein 
Protein accessionYP_001313201 
Protein GI150376605 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0284418 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCTGA CCAAGTCCAG CACCGAACTC GTGCGTCAGC AGAACAGCGC GCTGGTGCTC 
GCGGCGCTTC GGCGGAAGGG CTCGCTTTCC CACACGGAGA TTGCGGCACA GACCGGGCTT
GCCTCGGCCA CGATCTCCGT TATCACGGCC GAGCTGGAGC GCGCCGGGAT CATCGCCAAA
ATCGAGAGGC ATGTGCAGGG CGGTCGCGGC CGGCCGCGGG TTCTCTTCGA GCCGAGACGC
GATTGCGGTC ACGTGATCGT CGTGCGGATC TCTTCCGACG TGGTGCAATA CTCGCTTGCT
GATTATGGCG GCATTCTGCT CGATCGTTTC GAGGAGGTGC GCGGTGACCT CGGCGGGACA
GCTGCCTTCG GGGAGGTCCT TGCAGCGGCA CTTGAACGGT TGCTCCTCCG CTCGAATATA
TCGAAGGACG AGGTGCTTGC CATCTCGATC AGCAGCAAGG GGCTCGTGGC CGCCGATGGC
GCGCGACTTA TCTGGTCCCC CGTCTTCGGC GGCGAGCAAC TCGATTTCGT GAAGCTGCTG
CGTCCTGATT GGCGCGCCAG GATCATGCTC AGCAATGAGT CGCTCCTCGT GGCGCACGCG
CTTGCCGTAA AGGAAGAGGA AAAACAGGTC GGTTTCCGCG CGCTCGCCGC CGTGTCGCTC
GGACACAGCA TCGGCCTCGG CCTTGCCCGA AGTGGCAGCT CCGGTGAGCT CGATGTCTCG
GCTCCGAACT TCGGCCATAT GCTGCATCAG GATGCCGCCG GGCTCTGCCG CTGCGGCAGT
TTTGGCTGCG TCGAGGCCGC CGCCGGTTTC TACGGGATTC TGCGCACGGC CTTCGAGGTG
CCCTCCAATA CCATCCCAGC CAAGTTCGTG CCGCTTTCGG AGATGGACAA GATCGCCGCG
CGTGCACGGC AAGGACATCG GATGGCCGCC TATGCCTTCC GACAGGCCGG CGTTGCGCTC
GGCAACGGAA TTTCCCGGAT GCTCAGTCTC TACGAGCCGA TGCCGATCTT CGTGACTGGA
CCGGGCACGC GCTATTTCGA TCTCCTGCAG AAGGGACTGG AGGAGGGCAT GGCACATTCG
CTGCAGGTGC GCCTTGAGGG CATGCCGCAG ATATCGGTGG TTATTGACGA ACAACGGCTC
GTTCTGGATG GTCATCTCGA CCGGGCGCTA GGGGCAATCG ACGGCGACAT TGCCGCCTCG
GGCCACCCAT GA
 
Protein sequence
MMLTKSSTEL VRQQNSALVL AALRRKGSLS HTEIAAQTGL ASATISVITA ELERAGIIAK 
IERHVQGGRG RPRVLFEPRR DCGHVIVVRI SSDVVQYSLA DYGGILLDRF EEVRGDLGGT
AAFGEVLAAA LERLLLRSNI SKDEVLAISI SSKGLVAADG ARLIWSPVFG GEQLDFVKLL
RPDWRARIML SNESLLVAHA LAVKEEEKQV GFRALAAVSL GHSIGLGLAR SGSSGELDVS
APNFGHMLHQ DAAGLCRCGS FGCVEAAAGF YGILRTAFEV PSNTIPAKFV PLSEMDKIAA
RARQGHRMAA YAFRQAGVAL GNGISRMLSL YEPMPIFVTG PGTRYFDLLQ KGLEEGMAHS
LQVRLEGMPQ ISVVIDEQRL VLDGHLDRAL GAIDGDIAAS GHP