Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A3953 |
Symbol | |
ID | 6873586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 3788001 |
End bp | 3789218 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642786912 |
Product | major facilitator superfamily transporter |
Protein accession | YP_002217540 |
Protein GI | 198243187 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.515945 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 96 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGAAC CCGTCGCTGA ACCGGCGCTA AACGGATTGC GCCTCAATCT GCGTATTGTC TCCATTGTGA TGTTTAACTT TGCCAGCTAC CTGACCATCG GCCTGCCGCT CGCCGTCTTG CCCGGCTATG TGCATGATGC GATGGGATTC AGCGCGTTCT GGGCGGGGCT TATTATCAGC CTGCAATACT TCGCCACTCT GTTAAGCCGT CCCCATGCCG GGCGGTATGC GGATGTATTA GGGCCGAAAA AAATCGTTGT CTTTGGCTTA TGCGGCTGTT TTTTAAGCGG ACTCGGCTAC CTGCTGGCGG ATATCGCCAG CGCCTGGCCG ATGATCAGTT TGTTGCTACT GGGGCTGGGT CGCGTGATTT TGGGGATTGG GCAAAGTTTT GCCGGCACCG GTTCGACACT ATGGGGCGTC GGCGTCGTCG GGTCGTTGCA TATTGGTCGG GTCATCTCCT GGAACGGTAT CGTCACCTAC GGCGCAATGG CGATGGGCGC GCCGCTGGGC GTGCTGTGTT ATGCCTGGGG CGGGTTACAG GGACTGGCGC TAACGGTGAT GGGCGTGGCG CTGTTGGCGG TACTGTTAGC CCTTCCACGT CCGTCGGTGA AGGCGAACAA AGGCAAGCCG CTGCCGTTTC GCGCGGTGCT GGGGCGTGTC TGGCTGTATG GTATGGCGTT GGCGCTGGCC TCGGCAGGGT TTGGCGTCAT CGCGACGTTT ATTACCTTAT TTTATGATGC TAAAGGCTGG GATGGCGCCG CCTTTGCGCT CACGTTATTT AGCGTCGCGT TTGTCGGCAC GCGTTTGCTG TTCCCTAACG GCATCAATCG TTTAGGCGGG TTGAATGTCG CCATGATCTG CTTTGGCGTG GAGATTATTG GTCTGTTACT TGTGGGGACG GCAGCCATGC CGTGGATGGC AAAAATCGGC GTTTTACTCA CGGGGATGGG GTTTTCGCTG GTCTTTCCGG CGCTGGGCGT GGTGGCCGTC AAAGCCGTGC CGCCGCAGAA CCAGGGCGCG GCGCTGGCGA CCTATACCGT CTTTATGGAT ATGTCTTTGG GGATTACCGG GCCGCTGGCG GGGCTGGTGA TGACCTGGGC GGGCGTGCCG GTGATTTATC TGGCGGCAGC CGGGCTGGTA ACGATGGCGC TATTGCTGAC TTGGCGCTTA AAAAAACGGC CTCCGTCTGC ACTGCCGGAG GCCGCATCAT CATCGTAA
|
Protein sequence | MPEPVAEPAL NGLRLNLRIV SIVMFNFASY LTIGLPLAVL PGYVHDAMGF SAFWAGLIIS LQYFATLLSR PHAGRYADVL GPKKIVVFGL CGCFLSGLGY LLADIASAWP MISLLLLGLG RVILGIGQSF AGTGSTLWGV GVVGSLHIGR VISWNGIVTY GAMAMGAPLG VLCYAWGGLQ GLALTVMGVA LLAVLLALPR PSVKANKGKP LPFRAVLGRV WLYGMALALA SAGFGVIATF ITLFYDAKGW DGAAFALTLF SVAFVGTRLL FPNGINRLGG LNVAMICFGV EIIGLLLVGT AAMPWMAKIG VLLTGMGFSL VFPALGVVAV KAVPPQNQGA ALATYTVFMD MSLGITGPLA GLVMTWAGVP VIYLAAAGLV TMALLLTWRL KKRPPSALPE AASSS
|
| |