Gene Smed_0101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0101 
Symbol 
ID5320929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp111629 
End bp112858 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content64% 
IMG OID640789033 
ProductROK family protein 
Protein accessionYP_001325796 
Protein GI150395329 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00205324 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTTGA CAGAAGGCCC TCATGCGGGG GTGAACCAGC CTGATGTGAT CGACCCGAGC 
GGCGGGGCGA ACCAGACGCG CGTGCGCGCT TATAATGAAC GGCTCGTCAT GTCGCTGGTG
CGCCGTCACG GCAGTCTTTC CAAGGCCGAA ATCGCGCGCC GCTCCGGCCT TTCGGCGCAG
ACCGTATCGG TCATCATGCG ATCGCTCGAA GCCGACGGGC TGCTTGTCCG TGGTGCCCCG
GTACGCGGCC GCGTCGGTCA GCCATCCATC CCCATGCGGC TCAATCCGGA TGCGGTCTAT
TCGTTCGGGG TCAAGATCGG GCGGCGTAGC GCCGACCTGG TGCTGATGGA TTTCCTCGGC
ACCATCCGGC TGCACCTGCA TCAGATCCAC ACCTATCCGC TACCCGAGGA TATCGTCAAC
TTCATCGTCA ACGGCATCGA CAAGCTCGAG AGAGAGCTTG GCCCCGGCGA GCGCGGGCGC
ATCGTCGGCG TCGGCGTCGC CACGCCGTTC GAGTTGTGGA ACTGGGCGGA GGAAGTCGGC
GCACCGCGGA ACGAGATGGA CAGGTGGCGC GACTTCGATC TGCAGGCGGC GGTCTCCTCG
CGAATCTCAC ATCCCGTCTT TCTGCAGAAT GACGGGACCA GCGCCTGCGG TGCCGAACTC
GCCTTCGGCG TCGGCGCCAG CTATCCGGAC TTTGTCTATT TCTACATAGG CTCCTTCATC
GGCGGCGGTG TCGTCATCAA TTCCGCGCTT TTCTCCGGCC GAACCGGAAC CGCCGGTGCG
GTCGGCCCGC TGCCCGTTGC AGGCAAGGAC GGCAAGTCGA CGCAATTGCT GAAGATCGCC
TCGGTCTTCG TGCTGGAAAA ACTCCTGCGA GAACGCGGGA TGGACCCCCA GCCGCTCTGG
TACTCCGCCG ACGACTGGAT CGATTTCGGC GAACCGCTGG AGGTCTGGAT CCAGGATGCG
GGCGCGGCGC TTGCGCAGGC CGTCGTTTCC GCCGTCTCGA TCGTCGATTT TTCCGCGGTC
GTGATCGACG GCGGCTTCCC GCCTTGGGTT CGTGTGCGCC TTCTTGCGGC AACGCGCAAG
GCCCTCAATA CGCTCGACCT GCAGGGCGTC ACGCTTCCGG ACCTCGTGGA AGGCACCGTC
GGCAGCCACG CCCGTGCGAT CGGCGGTGCC AGCCTGCCGC TCTTTTCCCG CTATCTGCTG
GACACCAATG TCCTCTTCAA GGAGCTTTGA
 
Protein sequence
MSLTEGPHAG VNQPDVIDPS GGANQTRVRA YNERLVMSLV RRHGSLSKAE IARRSGLSAQ 
TVSVIMRSLE ADGLLVRGAP VRGRVGQPSI PMRLNPDAVY SFGVKIGRRS ADLVLMDFLG
TIRLHLHQIH TYPLPEDIVN FIVNGIDKLE RELGPGERGR IVGVGVATPF ELWNWAEEVG
APRNEMDRWR DFDLQAAVSS RISHPVFLQN DGTSACGAEL AFGVGASYPD FVYFYIGSFI
GGGVVINSAL FSGRTGTAGA VGPLPVAGKD GKSTQLLKIA SVFVLEKLLR ERGMDPQPLW
YSADDWIDFG EPLEVWIQDA GAALAQAVVS AVSIVDFSAV VIDGGFPPWV RVRLLAATRK
ALNTLDLQGV TLPDLVEGTV GSHARAIGGA SLPLFSRYLL DTNVLFKEL