Gene Smed_4169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4169 
Symbol 
ID5319198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp643415 
End bp644620 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content64% 
IMG OID640775974 
Productmajor facilitator transporter 
Protein accessionYP_001312907 
Protein GI150376311 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.240515 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.577442 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGATA CAGCGACTAC CCTGGATGCC GCGCTCATAC CGGAAACCAA CGAGCGGATC 
GATCCGGCCT GGTCCGGTGT CGGCTCACTT GCGCTCGGCG TGTTCGGCCT CGTAACCGCA
GAATTCCTCC CCGTCAGCAT CCTCACCCCG ATGGCGGCCG ATCTCGGCAT CAGCACCGGC
ACGGCCGGCC AGGCGGTCAC GGCTACCGCC GTCGTCGGCG CGGTCGCGGG ACTGACCGTT
GCCATCGCCA CGCGCCGGTT CGACCGCCGG CTCGTGCTTT GGGGGTTTAC TCTCGCGCTG
ATTGTTTCCA GTCTCATCGC AGCTTTTGCG ACCAACCTGG CAATGCTCCT TACCGCACGG
GTCATTCTTG GCGTTGGGCT CGGCGGCTTC TGGTCGATGA TGGCGGCAAT CGCCTTGCGT
CTCGTGCCGA TGTCTCTCGT GCCGCGCGCT ATGTCGATCG TCTTCACCGG GGTCTCGGTG
GCAACGGTCT GTGCCGCGCC GATCGGCGCT TACATCGGCG ATCTCTGGGG TTGGCGCGCC
GCATTTTTGA TGGCCGCGGC AGTCGGCGCC GTGACGCTGA TCGTGCAGAT GGTCAGCATA
CCCCGGCTGC CGCCGCAATC CGCCCCGAGT TTGAAGCTTC TGTTCGATCT CCTGGGCAGG
CCGAGCATCC GTATCGGGAT CATCACGGTT CTGCTATTCG TCTCGGGTCA CTTTGCCGGT
TTCACCTATG TGCGTCCGTT CCTGGAGAAA GTGCCGGAAT TCGGTATCGA GGCGATCTCG
CTGATCCTGC TCGCCTACGG CATAGGCGGC TTCTTCGGCA ACATCGCCGG CGGGCTCATC
ATCGAACGCA GCATCACGGC GGCCGTTGGT CTCGCAACGC TGCTTATCGC GGCCATGGCA
TTTCTCCTTG TGACCTTCGG TTCGCTGGAT GTCGTTTCGG CCACGGCGGT AACCATCTGG
GGCTTTGCCT TCGGAGCCTT GCCGGTCGCC GTCCAGACCT GGATGGTGCG CGCCGCTCCC
GAACATGCGG AAAGCACAGG CGGCCTCATC GTCGCGACAT TCCAGGTCGG CATCGCCAGC
GGCGCCGTGC TCGGCGGGCT CTTCGTGGAC GCTTTCGGCC CGCTCGGCGC CATCACCTAC
TGCGCCGCCG CGACATTTGC CAGCGCGCTT TATGTCGCGA CTTCCCGGCG GAGCATCGGT
GTTTGA
 
Protein sequence
MTDTATTLDA ALIPETNERI DPAWSGVGSL ALGVFGLVTA EFLPVSILTP MAADLGISTG 
TAGQAVTATA VVGAVAGLTV AIATRRFDRR LVLWGFTLAL IVSSLIAAFA TNLAMLLTAR
VILGVGLGGF WSMMAAIALR LVPMSLVPRA MSIVFTGVSV ATVCAAPIGA YIGDLWGWRA
AFLMAAAVGA VTLIVQMVSI PRLPPQSAPS LKLLFDLLGR PSIRIGIITV LLFVSGHFAG
FTYVRPFLEK VPEFGIEAIS LILLAYGIGG FFGNIAGGLI IERSITAAVG LATLLIAAMA
FLLVTFGSLD VVSATAVTIW GFAFGALPVA VQTWMVRAAP EHAESTGGLI VATFQVGIAS
GAVLGGLFVD AFGPLGAITY CAAATFASAL YVATSRRSIG V