Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B4701 |
Symbol | |
ID | 6793227 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 4593649 |
End bp | 4595085 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642778774 |
Product | sugar transporter |
Protein accession | YP_002149336 |
Protein GI | 197249228 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00879] MFS transporter, sugar porter (SP) family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.32702 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCAGA GAAGTAAGTA CAATTCGGCC TATGTGTACG TCCTGTGTTG TATTGCGGCG CTGGCTGGAT TGATGTTTGG TTATTCAACG GCGGTGATTA CCGGGGTGGT ATTGCCTTTA CAGCAGTATT ACCAACTGAC GCCAACCGAG ACCGGATGGG CCGTTTCCAG TATCGTGATT GGTTGTATCA TCGGCGCGCT GGTCGGTGGA AAAATTGCCG ATAAACTGGG GCGTAAACCT GCGCTTCTGA TCATTGCGAT CATTTTTATC GCTTCTTCCT TAGGGGCGGC GATGAGTGAA TCGTTCATGA TCTTCTCCCT TTCCCGCATT GTGTGTGGTT TTGCGGTTGG GATGGCCGGA ACGGCATCCA CCATGTATAT GTCTGAACTG GCGCCTGCTG AAATTCGCGG CAAAGCGCTG GGCATTTACA ATATCTCCGT GGTATCTGGC CAGGTTATCG TGTTTATAGT CAACTATCTG ATAGCAAAAG GAATGCCTGC TGATGTGCTG GTTTCCCAGG GCTGGAAGAC TATGCTTTTT GCCCAAGTGG TACCCTCCAT TGCGATGTTA GCGATTACGC TTTTCCTACC CGAATCACCG GCATGGTGCG CCCGTAACAA CCGCAGCCAA GCCCGTTCGA TAAAGGTGCT TACCCGGATC TACAGTGGAT TAACGGCCAC AGATGTGGCC GCTATTTTTG ACAGCATGAA AGAAACCGTA CGTCCACAGG ACAACGTCGC CGGGGGAGAA CGCACCAACC TGAAAAGCTC GCCGGTGCTC CGCTATATTC TGTTGGTTGG ATGCTGTATC GCCGTTTTGC AACAGTTCAC AGGCGTTAAC GTAATGAACT ATTATGCGCC GCTGGTGTTG CAGAACAGCA GTACCGAAGT GGTTATGTTC CAGACCATTT TTATCGCGGT ATGTAATGTG GTGGGCAGTT TTATCGGCAT GATCCTGTTC GACCGCTATG GCCGTATACC GATTATGAAA ATTGGTACCA TCGGCTCAAT TGTCGGCCTG TTGATCGCGT CATACGGTTT GTACACCCAC GATACAGGCT ACATTACCAT CTTTGGCATC CTGTTTTTTA TGCTGCTGTT TGCCGTCAGC TGGAGCGTTG GCGCATGGGT ACTGATTTCT GAGGTTTTCC CTGAAAAGAT AAAAGGTTTT GGGATGGGGC TGGCGGTGAG CCTGATGTGG ATAGCCAACT TCCTCATCTC ACTGTTGTTC CCGGTCATAA ATGATAACGC CTGGCTGCAG GAGACCTTCG GCGGCGCTTT CTCGATGTGG ATTTTTGTCG TCTTTAATTT GGTCTGCTAT GTCTTTATTT CTCGTTATGT GCCGGAAACA AAAGGGGTGC CGCTAACAGA AATTGAACGG CTGGCCGAGA ACAAGCTGCG TGAAATTCAG GGGAAACGTC GCGATGTAAT AGCCTGA
|
Protein sequence | MSQRSKYNSA YVYVLCCIAA LAGLMFGYST AVITGVVLPL QQYYQLTPTE TGWAVSSIVI GCIIGALVGG KIADKLGRKP ALLIIAIIFI ASSLGAAMSE SFMIFSLSRI VCGFAVGMAG TASTMYMSEL APAEIRGKAL GIYNISVVSG QVIVFIVNYL IAKGMPADVL VSQGWKTMLF AQVVPSIAML AITLFLPESP AWCARNNRSQ ARSIKVLTRI YSGLTATDVA AIFDSMKETV RPQDNVAGGE RTNLKSSPVL RYILLVGCCI AVLQQFTGVN VMNYYAPLVL QNSSTEVVMF QTIFIAVCNV VGSFIGMILF DRYGRIPIMK IGTIGSIVGL LIASYGLYTH DTGYITIFGI LFFMLLFAVS WSVGAWVLIS EVFPEKIKGF GMGLAVSLMW IANFLISLLF PVINDNAWLQ ETFGGAFSMW IFVVFNLVCY VFISRYVPET KGVPLTEIER LAENKLREIQ GKRRDVIA
|
| |