Gene Smed_0801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0801 
Symbol 
ID5321638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp861962 
End bp863197 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content61% 
IMG OID640789738 
Productmajor facilitator transporter 
Protein accessionYP_001326492 
Protein GI150396025 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.419426 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCAGA TCCGCCCGCT CATCCCGCTG CTCGTCACCG CAGGCATACT GATCGGCGGC 
AACGGGCTGC AGGGCACCTT CATTTCATTG AGAGCGCTGG ACGAGGGGTT CTCGACCTCG
CTCATCGGCG TGGTCGGCGC CGGCTACAAT ATCGGGTTCG CGATCGGCTG CATCTACGTC
ACCCGCATCC TCCGCGCGAT CGGTCACATC CGCACCTTTT CGGCAATGGC GGCCATAGCC
TCGGCTGCCG CGATCTCCAT GGTTCTCATT ATCGATCCCT GGTTCTGGTT CCTGATGCGA
CTCGTCGCCG GGATCTGCTT CGCAAGCCTC TTCGCCACGG TGGAGAGCTG GTTGAATGCC
AGCGTCACCA ACGCCAACAG GGGACGCACA TTGTCGGTCT ACCGTCTGGT CGATCTCGGT
TCGGTCACAG CGGCGCAATA CGCCATACCC GGCATCGGCA TCGGCGGGTT TGAGCTCTTT
GCGATCATTT CCATGGCGCT GACGCTCTCG CTCGTGCCGA TTTCCTTCGC CGACAGATCG
AGCCCGGTCA CTCCGGAAGC GATCCGATTC GACGTCAAGA CGCTCTGGAA CATCTCGCCG
CTGGCCACCA TTGGCTGCAT CGTCGTGGGC CTGACCAATG CCGCATTCCG CTCGCTCGGC
CCGATCTATG CGCAGGAGAT CGGGCTTTCG GTAACGGCAA TCGCGACCTT CATGAGCGCG
GGCATCATCG GCGGCGTCGT GTTGCAATAT CCCCTAGGCT ACTACTCCGA CCGGATCGAC
CGCAGGCTGA TCATCCTGCT CGCAACCTTC GGCTCCCTGC TTGCGGGCCT CTTCCTCGCC
TTCGGCGCCG GCAGCGACGA GTGGCTGAAC TTCGCCGGTA TCTTTGCCTT CGGCGCCTTC
GCTATGCCGC TATTTTCGCT ATGCTCGGCA CAAGCCAACG ACCATGCGGC TGAAGGCCAG
CATGCGCTGG TTTCGGCAGG CATGCTCTTC TTCTGGTCGC TCGGAGCTAT TATCGGGCCG
CTCTTCGCAT CCTTCCTGCT CGAGATATTC GCCCCGCAGG TGCTTTTCAT CTACACGGCC
GCGATCCTGG GGGCTTTCAT GCTCTACACA CTCTTGCGCA TGACTGCGCG TAAGCCGGTT
CCAACCGAGG AACGGTCGAT GCGCTTTCGC AATCTCCTTC GCACATCGTC CTTCTTCAAC
AAGCTTGCCG GCGGCCACGC GCGAAAAGAG CCGTGA
 
Protein sequence
MSQIRPLIPL LVTAGILIGG NGLQGTFISL RALDEGFSTS LIGVVGAGYN IGFAIGCIYV 
TRILRAIGHI RTFSAMAAIA SAAAISMVLI IDPWFWFLMR LVAGICFASL FATVESWLNA
SVTNANRGRT LSVYRLVDLG SVTAAQYAIP GIGIGGFELF AIISMALTLS LVPISFADRS
SPVTPEAIRF DVKTLWNISP LATIGCIVVG LTNAAFRSLG PIYAQEIGLS VTAIATFMSA
GIIGGVVLQY PLGYYSDRID RRLIILLATF GSLLAGLFLA FGAGSDEWLN FAGIFAFGAF
AMPLFSLCSA QANDHAAEGQ HALVSAGMLF FWSLGAIIGP LFASFLLEIF APQVLFIYTA
AILGAFMLYT LLRMTARKPV PTEERSMRFR NLLRTSSFFN KLAGGHARKE P