Gene Smed_4451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4451 
Symbol 
ID5318603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp935103 
End bp936311 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content62% 
IMG OID640776253 
Productmajor facilitator transporter 
Protein accessionYP_001313186 
Protein GI150376590 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.915935 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAATCG GGACCGACAC CATGACGCTG ACCTACAACG AGAACATCCA TAGCAGGCCC 
AGGGCCTGGG GCGCCGTGTT CTCCATGACA TTGTGCGTAT TCGTGCTGAT CGCTTCGGAG
TTCATGCCCG TGAGCCTGCT GACGCCGATC GCCGCTGATC TCGGTGTTTC AGAAGGGAGT
GCGGGCCAGG CAATCTCCAT CTCAGGCATC TTCGCGGTTT TCACCAGCCT CTTCATTGCC
GCGCTGACCC GGCGGCTCGA TCGGCGCGTG GTCGTGCTGG CCTTGACGTT TCTGCTGATG
CTGTCTGGGG TCGCGGTTAC CTTTGCGCCT TCCTATCCCA TGCTGATGCT GGGTCGCGCG
CTGCTTGGAA TTTCCATCGG CGGCTTCTGG TCGATGTCCA CGTCGATCGT GATGCGTCTC
GTCTCCCGCG ACCAGGTGCC GAAGGCACTT GCACTGCTCA ATGCGGGCAA TGCGATCGCG
GCCACCATCT CCGCACCCCT GGGAAGTCTC CTCGGGTCCT ATATCGGGTG GCGCGGCGCC
TTCTTTCTCG TGGTTCCCGT GGGCTTGCTT GCGCTTATCT GGCAATGGAT CAGCCTGCCG
ACGCTTTCGC CCCGGCGCGA TGGCGCATCC CGGAACGTCC TCCGGCTGCT GGCGCGTCCA
CCCGTCGCAT TGGGCATGGC AGCAATCCTG CTGCTGTTCA TGGGGCAGTT TGCTTTCTTC
ACTTATCTGC GGCCGTTCCT GGAGCAGGTG ACCCACCTTG GCATCGAGAC GCTCTCGCTC
ATGCTTCTCG TGATGGGATT GTCGGGAGTG GCCGGAACAT CGCTCGTCGG CCGGCTGCTG
ACTCATCGCC TGTTCAGCAT TCTCATCGTC ATTCCGTTCC TCATGGCCTG TATCGCTTTG
GCAATGATTG GCATCGGTGA GATGAGGACC CCTGTCGTCA TGTCCCTCAT CGGTTGGGGT
TTTCTCGGGA CCGCAGCGCC GGTGGCCTGG GGCACCTGGC TGAGCCGTGT TCTTGCCGAT
GATGCTGAGG CAGGCGGTGG ACTCCAGGTC GCTGTGATCC AGCTCGCCAT CACCGCCGGA
GCGTCTCTCG GCGGACTTCT CTTCGACGCT CTCGGCTGGT GGTCGACCTT CTCGCTCAGC
GCCCTGCTTC TGTTCGGCTC TTCCCTGGCG TCCTTCGCCG CATGGCTTTC GGCCAGGAGA
GCATCATGA
 
Protein sequence
MPIGTDTMTL TYNENIHSRP RAWGAVFSMT LCVFVLIASE FMPVSLLTPI AADLGVSEGS 
AGQAISISGI FAVFTSLFIA ALTRRLDRRV VVLALTFLLM LSGVAVTFAP SYPMLMLGRA
LLGISIGGFW SMSTSIVMRL VSRDQVPKAL ALLNAGNAIA ATISAPLGSL LGSYIGWRGA
FFLVVPVGLL ALIWQWISLP TLSPRRDGAS RNVLRLLARP PVALGMAAIL LLFMGQFAFF
TYLRPFLEQV THLGIETLSL MLLVMGLSGV AGTSLVGRLL THRLFSILIV IPFLMACIAL
AMIGIGEMRT PVVMSLIGWG FLGTAAPVAW GTWLSRVLAD DAEAGGGLQV AVIQLAITAG
ASLGGLLFDA LGWWSTFSLS ALLLFGSSLA SFAAWLSARR AS