Gene Smed_0474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0474 
Symbol 
ID5321308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp512451 
End bp513647 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content62% 
IMG OID640789409 
Productmajor facilitator transporter 
Protein accessionYP_001326166 
Protein GI150395699 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.812694 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACGG TCACGTCCGC GGGCAGCATC AGCCCCGAAA AGACAGCATT CTCGGTCATT 
TTGGCAGTCA GCTTCTGCCA CATGCTGAAC GACATCATGC AATCGCTGCT GACGGCGCTC
TATCCTTTGC TCAAAGAAAA CTACGCTCTC GATTTCGTCC AGATCGGCCT TTTGACATTC
ACGTTCCAGG TGACGGCTTC GATGCTGCAG CCGCTCGTCG GCATCGTGAC GGATCGTTGG
GCCCTGCCCT ATTCGCTGCC CTTCGCCATG CTCTCGACCT GCATGGGCCT GCTGCTCCTC
GCCAATGCCG ACCACTTCTG GATGCTGCTT GTCGCAGCGA GCCTGATCGG CATCGGATCG
GCGATCTTCC ATCCGGAATC CTCCCGCGTG GCCCGCCTCG CCTCGGGCGG TCGCCACGGC
TTGGCGCAGT CGCTGTTCCA GGTCGGCGGC AATGCCGGAA GCGCGCTGGG GCCGCTGCTG
GCCGCTTTCA TTGTTCTCCC TTTCGGACAG GGCAGCCTCG GCTGGTTTTC CGTCGTCGCG
ATCACGGGGT TCTTCGTGCT CTCCTGGGTG AGCACGTGGT ACGTGCGCCA CCGTCGCTCG
ACCATGAGCC GCCCTGTCCC GAGCCGGGTG CTGCCGCTGC CGAAGACGCG CGTCATGTGG
ACGATTGCGA TCCTCGTCCT GCTGACGGCG ACGAAGAACG TCTACCTCAC CAGCATCTCG
AGCTACTTCA CCTTCTTCGC CATTGAAAAA TTCGGCACGA GCGTACAGCA GGCGCAGTTG
ATGCTTTTCC TGTTCCTCGG CTCGGCAGCC GCAGGCACGT TCCTCGGAGG CCCGATCGGC
GACCGGTTCG GCGCGCGCTT CGTCATCTGG TTCTCGATCC TCGGCGTCGT GCCGTTCACC
CTTCTGCTGC CCTATGCAAA CCTTTTCTGG ACGGGCGTGC TGAGCGTCAT CATCGGCCTT
ATCTTTTCGT CGGCCTTCTC GGCGATCGTC GTCTTCGCGC AGGAACTCGT TCCGGGCAGG
GTGGGGCTCA TCGCCGGTGT CTTCTTCGGC TTTGCCTTCG GCGCCGGCGG CATGGGCGCC
GCGGTTCTCG GGGTCGTCGC CGACCGGCAG GGAATCGAGT TCGTCTACCT GATCTGCTCG
TATCTCCCGC TCCTCGGACT GCTCACCATA TTTTTGCCGA AGCTGCCTGC GCGATAG
 
Protein sequence
MATVTSAGSI SPEKTAFSVI LAVSFCHMLN DIMQSLLTAL YPLLKENYAL DFVQIGLLTF 
TFQVTASMLQ PLVGIVTDRW ALPYSLPFAM LSTCMGLLLL ANADHFWMLL VAASLIGIGS
AIFHPESSRV ARLASGGRHG LAQSLFQVGG NAGSALGPLL AAFIVLPFGQ GSLGWFSVVA
ITGFFVLSWV STWYVRHRRS TMSRPVPSRV LPLPKTRVMW TIAILVLLTA TKNVYLTSIS
SYFTFFAIEK FGTSVQQAQL MLFLFLGSAA AGTFLGGPIG DRFGARFVIW FSILGVVPFT
LLLPYANLFW TGVLSVIIGL IFSSAFSAIV VFAQELVPGR VGLIAGVFFG FAFGAGGMGA
AVLGVVADRQ GIEFVYLICS YLPLLGLLTI FLPKLPAR