Gene Smed_4638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4638 
Symbol 
ID5319283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1147533 
End bp1148732 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content62% 
IMG OID640776436 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001313368 
Protein GI150376772 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1174] ABC-type proline/glycine betaine transport systems, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.508179 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.272859 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATTG GTATAATCCA GGAAAAAGAA CGCTTGGCAT TCCGTTTCGA CAAACTCGGG 
GTCATCATCT CGATACTCGC CCTTTATGGC GCGCTCCTTG CCCCTTTCGC GACATATCGC
GCAAACAGAA TCATTCAGGG CGAAGCGCGG GGTATAATGG AAGCTCTTCC TCCACACCTC
GGCTACGGGC TCCTCTTCCT CGTGGTCGCC GGCGCTGCAG TCGCGCTTCT GAGGACGCCG
GCCCTTTTCC GGATGGCCAT GGCGCTCGCT GCACTGGCGG CTCTCGCCGG TCTGATCGGA
ATTGCGGCAG ATCATCTGAC GCCGCCGGAG AATAGCTATG CACGGGTGTC CCCCGCATCC
GGCTTCTGGG TTCTGGTTTT CGCATTCTCG CTGCTGCTCA CCGATGCCCT GACTCGCCTT
AATCCAGGAC CCGGTCTGCG CCTTCTCGTA CTGGCGGGCG TTCTCGTTCT TGCAGGGTCA
ATGCTCGTCG CCGGCCGCTG GGACGGGCTT TCGATCATGA AGGAATATGC CAATCGGGCC
GATTTGTTCT GGGCCGAGGC TGGCCGCCAC GTGACGCTGG CGCTCGGCTC GCTGGCCGCC
GCCACCGTCG TCGGTTTGCC TCTAGGCATT CTGTGCCACC GGGTCGAACG GCTGCGGGCG
GGCGTGCTCA ACGTCCTGAA CGCCATCCAG ACGATCCCAT CCATTGCGCT TTTTGGTATT
TTGATCGCGC CGCTCGGCTG GATCGCCGCA AATATTCCCG GCGCTTCGGC GGTCGGCATT
CGCGGCATCG GTGCGGCCCC CGCCTTCGTC GCACTTTTCC TCTATTCTCT GCTTCCGGTC
GTTGCCAACA CAGTGGTGGG GCTCGCGGGT GTGCCGCGCG CAGCCAACGA CGCGGCCCGA
GGTATCGGCA TGACGGATCG GCAGCGCCTT GTTACGATCG AGTTCCCGTT GGCCTTTCCG
GTCATCCTGA CCGGCATTCG TATCGTGCTC GTGCAGAATA TCGGGCTCGC CACGATCGCC
GCTCTTATCG GCGGAGGCGG CTTCGGCGTC TTCGTCTTCC AAGGGGTCGG TCAGACGGCG
ATGGATCTGG TGCTGCTGGG CGCCATCCCG ACGGTCGTGC TGGCCTTCAC CGCGGCGATC
GTACTGGACG CATTGATCGA AATGACCGTG CCAAATAGCA GTCAGGGCAA TGCCGCATGA
 
Protein sequence
MAIGIIQEKE RLAFRFDKLG VIISILALYG ALLAPFATYR ANRIIQGEAR GIMEALPPHL 
GYGLLFLVVA GAAVALLRTP ALFRMAMALA ALAALAGLIG IAADHLTPPE NSYARVSPAS
GFWVLVFAFS LLLTDALTRL NPGPGLRLLV LAGVLVLAGS MLVAGRWDGL SIMKEYANRA
DLFWAEAGRH VTLALGSLAA ATVVGLPLGI LCHRVERLRA GVLNVLNAIQ TIPSIALFGI
LIAPLGWIAA NIPGASAVGI RGIGAAPAFV ALFLYSLLPV VANTVVGLAG VPRAANDAAR
GIGMTDRQRL VTIEFPLAFP VILTGIRIVL VQNIGLATIA ALIGGGGFGV FVFQGVGQTA
MDLVLLGAIP TVVLAFTAAI VLDALIEMTV PNSSQGNAA