Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A1983 |
Symbol | |
ID | 6872321 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 1915356 |
End bp | 1916591 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 642785099 |
Product | major facilitator family transporter |
Protein accession | YP_002215765 |
Protein GI | 198243701 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.0000795412 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCTCAGA ACAAGGCTCG CAACATGCCA TATTTGCTGG CTGTTATCTG CATTTATTTT AGTTACTTTC TCCACGGCAT GAGTGTTATT ACACTAGCCC AGAACATGAC CTCCCTTGCA CAGAAATTCT CCACGGATAG TGCCGGTATC GCCTATTTAA TCTCTGGCAT TGGTCTTGGC CGTCTGGTCA GTATTTTATT CTTTGGCGTA CTGTCCGATA AATTTGGCCG TCGGGCAATA ATACTGCTTG GCGCCGTACT ATATATGCTA TTTTTCTTCG GTATTCCCGC CAGTCCTAAC CTGATGATCG CTTTCATATT AGCGGTGTGT GTCGGCGTGG CGAACTCCGC GCTGGATACC GGCGGATACC CTGCATTAAT GGAGTGTTTT CCCAAAGCAT CGGGCTCGGC AGTTATTCTG GTTAAAGCGA TGGTCTCTTT TGGGCAAATG ATTTATCCCC TTATTGTCAG CGCCTTGTTA GTCAACCATA TCTGGTACGG CTACGCGGTG GTAATCCCTG GTATCCTTTT CGTCCTCATC ACGTTGATGC TGTTGAAAAG CCGTTTTCCC AGCCAACTTG TCGATGCCAG TATTGCGAAA GAATTACCCC AGATGAACAG TACTCCCCTC GTCTGGCTGG AAGGCGTAGC TTCCGTTTTA TTTGGCGTCG CCGCGTTCTC AACCTTCTAT GTGATTGTGG TCTGGATGCC TAAATATGCG ATGGCCTTCG CCGGAATGGC GGAATCCGAC GCGCTGAAAA CCATCTCTTA TTACAGTATG GGATCGTTGG TTTGCGTGTT TATTTTTGCC GCATTGCTGA AAAAAATGGT TCGCCCCATC TGGGCCAATG TTTTCAATGC CGGGCTGGCG ACACTCACCG CTGCGGCAAT TTACCTGTAT CCCTCTCCAC TGATCTGTAA TGCTGGCGCC TTCGTGATTG GTTTTTCCGC TGCTGGAGGT ATTTTACAAT TAGGCGTATC GGTAATGTCG GAATTTTTCC CTAAGAGTAA AGCTAAAGTC ACCAGTATAT ATATGATGAT GGGGGGCGTA GCTAACTTTC TTATCCCACT GATTACCGGT TATCCCTCTA CTATTGGCCT GCAATATATC ATTTTGTTAG ATTTTGCCTT TGCACTACTG ACATTTATCA CCGCCATTAT TGTATTTATT CGCTATTATC GCGTATTTAA GATCCCGCAA AACGATGTCC GATTTGGCGA GCGTTATTTC CAGTAA
|
Protein sequence | MSQNKARNMP YLLAVICIYF SYFLHGMSVI TLAQNMTSLA QKFSTDSAGI AYLISGIGLG RLVSILFFGV LSDKFGRRAI ILLGAVLYML FFFGIPASPN LMIAFILAVC VGVANSALDT GGYPALMECF PKASGSAVIL VKAMVSFGQM IYPLIVSALL VNHIWYGYAV VIPGILFVLI TLMLLKSRFP SQLVDASIAK ELPQMNSTPL VWLEGVASVL FGVAAFSTFY VIVVWMPKYA MAFAGMAESD ALKTISYYSM GSLVCVFIFA ALLKKMVRPI WANVFNAGLA TLTAAAIYLY PSPLICNAGA FVIGFSAAGG ILQLGVSVMS EFFPKSKAKV TSIYMMMGGV ANFLIPLITG YPSTIGLQYI ILLDFAFALL TFITAIIVFI RYYRVFKIPQ NDVRFGERYF Q
|
| |