Gene Smed_5008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5008 
Symbol 
ID5318657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1525936 
End bp1526967 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content64% 
IMG OID640776790 
Productregulatory protein LacI 
Protein accessionYP_001313722 
Protein GI150377126 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0284302 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGAC GGCCGACGAT TGCGGATCTG GCCCGCGCAT CAGGCGTGAG CGTTGCAACG 
GTCGATCGCG TGCTGAACGG CCGTCATCCG GTGCGCGAGG AAACGGCGCG CCGCGTCTAC
GATGCGGCGA AGGCGATCGG CTACCACGCG GTCGGCCTGC TTCGCCAACG GGTTTTCGAG
GACCTGCCGC AATTTCGGCT CGGCTTTCTC CTGCAGAAGC CGGAGCAATC ATTCTACAAG
GCGCTCGCAA AGGAGATCGA AAACGCGGCC CTTGCGGTGA CGCATGTGCG TGCCGTTCCG
CAGGTGGATT TCGTCGCAAG CTCCACGCCG CAAGGGATCA TCGAAAAGTT GAAGGCCATG
GCGGCGCGCA ATCAGGCGAT CGCCCTGGTG TCGCCGGATT ATCCGGCCGT GACCGCCGCG
GTCGAGGACC TCAGGGATCG CGGCATTCCC GTCGTCGCAC TGCTTTCCGA CTTTGCCGCC
GGCGTGCGCG AGGCCTATGT GGGCCTCAAC AATCAGAAGG TCGGCAGAAC GGCGGCATGG
ATGATCGCCA AGGCGGCGAA GCGTCCGGGA AAGGTCGCCG CCTTCGTCGG CAGCCACCGC
TTCCATGGAC ACGAGCTGCG CGAGATCGGC TTTCGTTCGT ATTTTCGCGA AAACGCACCC
GAGTTCGAAG TCCTCGACAC GATGGTGAAC CTCGACACGC CCGAGATCAC CCATGAGGCA
ACGCTCGATC TCCTGCAACG CCACCCCGAT GTCCTCGGTT TCTACGTCTG TGGCGGCGGC
ATGGAGGGTG CCATTTCGGC GATCCGGGAG GAAAGGCTCG AGGGCAAGCT GCTCGTGGTC
GTCAACGAGC TGACGCCGGA ATCGCGCGCG GCACTCGCCG ATGAAACATT GCTTATGGCG
ATCTCGACGC CCGCCTCGGC ATTGGCCCGG GAATCGGTGA GCCTGATGAT CGGCGCGATC
GACCGGGAGG CCGCGAGCGT GCCCGGCCAA ACCTTCCTGC CCTTCGACAT CTACACGCCC
GAGAACATCT GA
 
Protein sequence
MTRRPTIADL ARASGVSVAT VDRVLNGRHP VREETARRVY DAAKAIGYHA VGLLRQRVFE 
DLPQFRLGFL LQKPEQSFYK ALAKEIENAA LAVTHVRAVP QVDFVASSTP QGIIEKLKAM
AARNQAIALV SPDYPAVTAA VEDLRDRGIP VVALLSDFAA GVREAYVGLN NQKVGRTAAW
MIAKAAKRPG KVAAFVGSHR FHGHELREIG FRSYFRENAP EFEVLDTMVN LDTPEITHEA
TLDLLQRHPD VLGFYVCGGG MEGAISAIRE ERLEGKLLVV VNELTPESRA ALADETLLMA
ISTPASALAR ESVSLMIGAI DREAASVPGQ TFLPFDIYTP ENI