Gene Smed_4066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4066 
Symbol 
ID5317895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp528304 
End bp529494 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content66% 
IMG OID640775873 
Productmajor facilitator transporter 
Protein accessionYP_001312806 
Protein GI150376210 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCTGG CAATCTTCGC ATTGACGATC GCGGCCTATG CGATCGGCAC CACCGAATTC 
GTGATCGTCG GTCTCTTGCC GACGGTCGCA ACCGACCTCG CGATCAGCCT GCCGCTTGCC
GGCCTCATCG TCAGCGTCTA CGCCCTCGGC GTCACGTTCG GGGCGCCGGT GCTCACGGCT
TTGTCAAGGC GGATCGAACG CAAGCCCCTG CTTCTCGGCC TGATGGCGCT CTTCATCGGC
GGCAACACGG TGGCCGCACT ATCCCCGAAC TACGAAATGC TGCTCGTCGC CCGCGTACTT
TCCGCCTTCG CTCACGGCGT CTTCTTCTCG GTCGGCGCGA CGATAGCCGC CGACCTCGTG
CCGGAAGACC GCCGCGCTTC GGCTATCGCC ATGATGTTCA TGGGCCTCAC GGTCGCGATC
GTCACGGGCG TGCCGATCGG GACCTATATC GGCCAGGCTT TCGGCTGGCG GGCCACCTTC
TGGGGCGTTG CTGCACTCGG CGTCGTAGCC TTTGCCGGAA TTGCCACCCT GCTGCCGGGC
TCGCTCACCA AAGCCGCTCC GGCGAGCCTT CTCGACCAGG TGCGAGTGCT CGGCTCCGGC
AGGCTGCTGT TGGTCTTCGC CATGACCGCA CTCGGCTATG GGGGCACCTT CGTCGCCTTC
ACCTTTCTGG CGCCGATCCT GCAGGAGGTC ACCGGCTTTT CCGAAAACAG CGTCAGCCTG
ATCCTGGTGC TCTACGGCGT CGCGATCGCC GGCGGCAACA TCGCCGGCGG GCGGATCGCC
AACACCAACC CCGTCAAGGC GCTCATCGGC CTTTTCCTTC TCCAGGCGCT CGTATTGGTT
ATCTTCAGCT TCACTGCCGC CTCGCCGGCA CTCACTCTGG TGACACTTAC CGCTCTCGGC
TTCCTGTCCT TCGCCAATGT GCCGGGCCTG CAGCTCTACG TGGTGCAGCT TGCCAAGGAG
CATCGCCCCG GTGCCGTCGA CGTCGCCTCC GCGCTCAACA TCGCGGCCTT CAATCTCGGC
ATCGCGCTTG GCGCCTGGCT CGGCGGCCGG GTGGTCGGCT CTCCGCTCGG ACTTGCCTCC
ACCCCCTGGG TCGGCGCAAT CCTCGTCTCA GGCGCGCTCC TGCTCACGCT TTGGAGCGGC
CTTCTCGACC GGCGCGGCGA GTGCGGCGAG CCTTTGGCTG CAGCAAACTG A
 
Protein sequence
MPLAIFALTI AAYAIGTTEF VIVGLLPTVA TDLAISLPLA GLIVSVYALG VTFGAPVLTA 
LSRRIERKPL LLGLMALFIG GNTVAALSPN YEMLLVARVL SAFAHGVFFS VGATIAADLV
PEDRRASAIA MMFMGLTVAI VTGVPIGTYI GQAFGWRATF WGVAALGVVA FAGIATLLPG
SLTKAAPASL LDQVRVLGSG RLLLVFAMTA LGYGGTFVAF TFLAPILQEV TGFSENSVSL
ILVLYGVAIA GGNIAGGRIA NTNPVKALIG LFLLQALVLV IFSFTAASPA LTLVTLTALG
FLSFANVPGL QLYVVQLAKE HRPGAVDVAS ALNIAAFNLG IALGAWLGGR VVGSPLGLAS
TPWVGAILVS GALLLTLWSG LLDRRGECGE PLAAAN