Gene Smed_4142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4142 
Symbol 
ID5319138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp613914 
End bp615041 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content62% 
IMG OID640775947 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001312880 
Protein GI150376284 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.697509 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCG AAGTAGGAAG TACGCTTCCG CTGGCCGTCG AACCCGAGGA AAAGCACCAC 
CACGAGAGCT ACTCCGCACT CGTATGGCGC CGATTGAAGC GCTCCTGGAC CGGCCTGCTC
GGCCTGATCC TCGTCTGCCT GCTGATCCTG ATGGCCGTCT TTGCCGATTT CCTGTCGCCG
GTGGATCCGA AGGTGACGGA TGTCGCCTTC GCGCAGCCCC AGACGATCAG TTTCCGCGAC
AAGGACGGCA ATTTCGTCTT TCCGCCGCGG AGCTATCCGG TGCGGGAGAC GGAGGAACTC
GATCCGATCA CCTTCCAGCC GATCATCGGC CCCGACTACG AAAATCCACA GGTGCTGGGC
TTCTTCGTCA AGGGCGCTCC GTACCGGCTC TTCGGGTTGA TCCCGGCCGA GCGCCATCTC
TTCGGCGCAG TCGACGGAAC GCCGGTGCAT CTGCTCGGCA CCGACAAATT CGGCCGGGAC
GTACTGTCCC GCATTCTCTA CGGCTCGCGC ATCTCGCTGA TGATTGCGCT CACCGTTGTC
TTCATCGTCA CAGTGGTCGG CACGACGGTC GGCATGGTTT CCGGCTATTT CGGCGGTCGC
TTCGACGCCT GGGTGCAGCG CTTCGTCGAA CTCGTGCTCG CCTTTCCGCA ATTGCCGCTC
TATCTGGCGC TCGCCTCGCT GATTCCGGTT ACAGCGCCGA CCAACGTCTT CCTCGCCTTC
GTCATCATCG TAATGTCGGC GCTCGGCTGG GCGCAGATGT CGCGCGAGGT GCGCGGCAAG
ACCCTGGCGC TTGCCCGGAT CGAATATGTG CGGGCGGCGA TCGCCATCGG CGCGACGGAT
CGGCGCATCA TCTTCCAGCA CATCTTCCCG AATGTGATGA GCCACGTCAT CGTCGCCGTG
ACGCTCGCCA TTCCGCAGGT GGTGCTGCTT GAATCCTTCC TCGGCTTCCT CGGCTTTGCG
GTCAAGCCGC CGCTGATCTC CTGGGGGCTG ATGTTGCAGG ACACGGCCAA TTATTCGGCG
ATCGGTTCCT ATCCCTGGAT CCTCTCGCCT GTCGCCTTCG TGCTCGTCAC CGTCTTTGCC
TTCAACGCGT TGGGCGACGG CTTGCGCGAC GCAATCGACC CCTATTGA
 
Protein sequence
MTIEVGSTLP LAVEPEEKHH HESYSALVWR RLKRSWTGLL GLILVCLLIL MAVFADFLSP 
VDPKVTDVAF AQPQTISFRD KDGNFVFPPR SYPVRETEEL DPITFQPIIG PDYENPQVLG
FFVKGAPYRL FGLIPAERHL FGAVDGTPVH LLGTDKFGRD VLSRILYGSR ISLMIALTVV
FIVTVVGTTV GMVSGYFGGR FDAWVQRFVE LVLAFPQLPL YLALASLIPV TAPTNVFLAF
VIIVMSALGW AQMSREVRGK TLALARIEYV RAAIAIGATD RRIIFQHIFP NVMSHVIVAV
TLAIPQVVLL ESFLGFLGFA VKPPLISWGL MLQDTANYSA IGSYPWILSP VAFVLVTVFA
FNALGDGLRD AIDPY