Gene Smed_3338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3338 
Symbol 
ID5324222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3538714 
End bp3539847 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content62% 
IMG OID640792289 
Productinner-membrane translocator 
Protein accessionYP_001328994 
Protein GI150398527 
COG category[R] General function prediction only 
COG ID[COG4603] ABC-type uncharacterized transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.945315 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.666399 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCC CTTATGCGAA ACTCCCGGGA TGGGTGGAAT ATGGCCTCAT CCCGCTGATC 
AACCTTGCCG TCGCTTTTCT CGTCGCCGGT CTCGTCGTTC TTCTCGTCGG CGAAAACCCG
CTTGAAGCCG CCTATCATCT CATCAATGGT GCTTTCGGCC GCGGCGAATA TATCGGATTC
ACGCTCTATT ACGCAACGAC CTTCATCTTC ACCGGTCTCG CGGTGGCAGT CGCCTTTCAT
TCAGGGCTTT TCAATATAGG CGGCGAGGGT CAGGCCTATG TCGGGGGCAT CGGTGTGGCG
CTCGCTTGCC TCTGGCTCGA TCAAATGATG CCCTGGTATG TGGTCTTTCC GCTGGCCATC
GTCGGCTCCG TCTTCTTCGG CGCGCTCTGG GCGTTCCTGC CGGGCTGGCT GCAGGCCAGG
CGCGGCAGCC ATATCGTCAT CACCACCATC ATGTTCAATT TCATTGCATC GAGCCTGATG
GTCTACCTTC TGACGCGGGT GCTGAAACCA CTTGGCTCTA TGGCGCCGCA GACGCGGACG
TTCGCCGAGG GAGGGCAATT GCCGAAGCTC GACTGGCTGT TGTCGATCTT CGGGCTGAAT
ATCGGTACGG CACCGTTCAA CATCTCCTTC CTGCTGGCGC TGGCCGCAGC CTTCGCCGTA
TGGGTGCTGA TCTGGCGCAC CAGGCTCGGC TACGAAATGC GCACCATGGG GCACAGCCCG
TCGGCAGCAC GTTACGCCGG TATCAGCGAA AGCCGCATCA CCGTCGTCGC CATGATGATG
TCCGGCGGCC TCGCCGGCAT GATGGCGCTG AACCCGATCA TGGGCGAGCA GTTCCGCATG
CAGCTCGACT TCGTGCAAGG CGCGGGCTTC GTCGGCATCG CGGTGGCGCT GATGGGACGC
TCGCATCCGG GCGGCATCAT CCCCGCCGCG ATCCTCTTCG GCGTTCTCTA TCAGGGCGGC
GCCGAGATCG CCTTCGAGAT GCCTTCGATC TCGAGGGACA TGATCGTGAT CATCCAGGGT
CTCGTCATTC TCTTTGCCGG CGCGCTCGAG AACATGTTCC GGCCGGCGAT CACGCGCGTT
TTCGCGGCGC GCGGCCAGCG CGCGGCTGCG CTCGTGCAGA CCAAGGGAGC CTGA
 
Protein sequence
MSTPYAKLPG WVEYGLIPLI NLAVAFLVAG LVVLLVGENP LEAAYHLING AFGRGEYIGF 
TLYYATTFIF TGLAVAVAFH SGLFNIGGEG QAYVGGIGVA LACLWLDQMM PWYVVFPLAI
VGSVFFGALW AFLPGWLQAR RGSHIVITTI MFNFIASSLM VYLLTRVLKP LGSMAPQTRT
FAEGGQLPKL DWLLSIFGLN IGTAPFNISF LLALAAAFAV WVLIWRTRLG YEMRTMGHSP
SAARYAGISE SRITVVAMMM SGGLAGMMAL NPIMGEQFRM QLDFVQGAGF VGIAVALMGR
SHPGGIIPAA ILFGVLYQGG AEIAFEMPSI SRDMIVIIQG LVILFAGALE NMFRPAITRV
FAARGQRAAA LVQTKGA