Gene Smal_3972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmal_3972 
Symbol 
ID6474856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStenotrophomonas maltophilia R551-3 
KingdomBacteria 
Replicon accessionNC_011071 
Strand
Start bp4468495 
End bp4470156 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content64% 
IMG OID642733175 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002030354 
Protein GI194367744 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.727747 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGCA CACCCAGCAC CGCACACAAA GCGGGCACCC TGACCCAGGG CCACAAGAAG 
GTCATTTTCG CGTCGAGCCT CGGCACGGTG TTCGAGTGGT ATGACTTCTT CCTGTACGGC
TCGCTTGCAG CGATCATCGC CAAGCAGTTC TTCAGTGGCG TCAATGAAAC CACGGGCATG
ATCTTCGCCC TGCTGGCGTT CGCTGCCGGC TTCTTCGTGC GTCCGTTCGG CGCGGCCTTC
TTCGGCAGCC TCGGCGACCG CATCGGCCGC AAGTACACCT TCCTGGTCAC GATCCTGATC
ATGGGCATCT CGACCTTCCT GGTCGGCGTG CTGCCCAACT ACGCTTCGAT CGGTTTCGCC
GCACCGGTGA TCCTGATCAT CCTGCGCCTG GCCCAGGGCC TGGCGATGGG CGGCGAGTAC
GGCGGTGCCG CCACCTACGT GGCCGAACAC GCACCGGACG ACAAGCGCGG CCTGTACACC
AGCTTCATCC AGTGCACGGC CACGCTCGGC CTGTTCATGT CGCTGCTGAT CATCCTGGCC
TGCCGCTACT TCCTCGGCAA CGAAGCCTTC GAAGCCTGGG GCTGGCGTAT TCCGTTCCTG
GTCTCGATCC TGCTGCTGGG CGTGTCGGTG TGGATCCGCC TGCAGCTGAG CGAGTCGCCG
CTGTTCCAGC AGATGAAGTC CGAGGGCAAG GGTTCCAAGA CGCCGTTCCG TGACAGTCTG
AAGGGCGGCA ACCTGAAGCT GATGCTGCTG GTCCTGCTGG GTGCTGCGGC CGGCCAGGCG
GTGGTGTGGT ACGGCGGCCA GTTCTACGCG CTGTTCTTCC TCAGCAGCAT GCTGAAGGTC
GATGCCACCA CGTCCTACCT GCTGATCGCC GCCGCGCTGG CGCTGGGCGT GCCGTTCTTC
ATCTTCTTCG GCTGGCTGTC CGACCGTATC GGCCGCAAGA AGATCATCCT GGCCGGCTGC
CTGCTGGCCG CCGTCACCTA CATCCCGATC TTCAAGGGCC TGACCCACTT CGCCAACCCG
GCCATCGAAG AAGCCCGCAC CAACTCGCCG GCGCTGGTGG TTGCCGATCC GAACACCTGT
TCGTTCCAGT TCGATCCGGT CGGCCTGCGC AAGTTCACCA GCTCCTGCGA CGTCGCTACC
GCTGCGCTGA CCAAGGCCGG TGTGCCGTAT GACGTGCAGC CCGCCGCCGC CGGTTCGCTG
GCGATGGTGA ACGTGGGCAG CGCCAGCGTC ACCTCGTATG AGGCTGCTGG CCTGACCAAG
GAAGACGGCA AGGCCAAGGC CGATGCGTTC GGTGCGGAAC TGAAGACCGC CCTGACCACC
GCCGGCTACC CGGCCAAGGC GGATGGCGCC CGCATCAACA TCGCCGGCAC CATCTTCATG
CTGTGGCTGC TGGTGCTGTA CGTGACCATG GTCTACGGCC CGATCGCCGC TTACCTGGTC
GAACTGTTCC CGACCCGCAT CCGCTACACC TCGATGTCGC TGCCGTACCA CATCGGCAAC
GGCTGGTTCG GTGGCTTCCT GCCGGCGATC TCGTTCGCGC TGGTGGCCGG TACCGGCAAC
CTGTACTACG GCCTGTGGTA CCCGATCATC ATCGCGCTGA TGTCGGTGGT GATCGGCGGC
CTGTTCCTGC GCGAGACCAA GGACGTGGAT ATCACCAAGT AA
 
Protein sequence
MSSTPSTAHK AGTLTQGHKK VIFASSLGTV FEWYDFFLYG SLAAIIAKQF FSGVNETTGM 
IFALLAFAAG FFVRPFGAAF FGSLGDRIGR KYTFLVTILI MGISTFLVGV LPNYASIGFA
APVILIILRL AQGLAMGGEY GGAATYVAEH APDDKRGLYT SFIQCTATLG LFMSLLIILA
CRYFLGNEAF EAWGWRIPFL VSILLLGVSV WIRLQLSESP LFQQMKSEGK GSKTPFRDSL
KGGNLKLMLL VLLGAAAGQA VVWYGGQFYA LFFLSSMLKV DATTSYLLIA AALALGVPFF
IFFGWLSDRI GRKKIILAGC LLAAVTYIPI FKGLTHFANP AIEEARTNSP ALVVADPNTC
SFQFDPVGLR KFTSSCDVAT AALTKAGVPY DVQPAAAGSL AMVNVGSASV TSYEAAGLTK
EDGKAKADAF GAELKTALTT AGYPAKADGA RINIAGTIFM LWLLVLYVTM VYGPIAAYLV
ELFPTRIRYT SMSLPYHIGN GWFGGFLPAI SFALVAGTGN LYYGLWYPII IALMSVVIGG
LFLRETKDVD ITK