Gene SeD_A1983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1983 
Symbol 
ID6872321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1915356 
End bp1916591 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content47% 
IMG OID642785099 
Productmajor facilitator family transporter 
Protein accessionYP_002215765 
Protein GI198243701 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.0000795412 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTCAGA ACAAGGCTCG CAACATGCCA TATTTGCTGG CTGTTATCTG CATTTATTTT 
AGTTACTTTC TCCACGGCAT GAGTGTTATT ACACTAGCCC AGAACATGAC CTCCCTTGCA
CAGAAATTCT CCACGGATAG TGCCGGTATC GCCTATTTAA TCTCTGGCAT TGGTCTTGGC
CGTCTGGTCA GTATTTTATT CTTTGGCGTA CTGTCCGATA AATTTGGCCG TCGGGCAATA
ATACTGCTTG GCGCCGTACT ATATATGCTA TTTTTCTTCG GTATTCCCGC CAGTCCTAAC
CTGATGATCG CTTTCATATT AGCGGTGTGT GTCGGCGTGG CGAACTCCGC GCTGGATACC
GGCGGATACC CTGCATTAAT GGAGTGTTTT CCCAAAGCAT CGGGCTCGGC AGTTATTCTG
GTTAAAGCGA TGGTCTCTTT TGGGCAAATG ATTTATCCCC TTATTGTCAG CGCCTTGTTA
GTCAACCATA TCTGGTACGG CTACGCGGTG GTAATCCCTG GTATCCTTTT CGTCCTCATC
ACGTTGATGC TGTTGAAAAG CCGTTTTCCC AGCCAACTTG TCGATGCCAG TATTGCGAAA
GAATTACCCC AGATGAACAG TACTCCCCTC GTCTGGCTGG AAGGCGTAGC TTCCGTTTTA
TTTGGCGTCG CCGCGTTCTC AACCTTCTAT GTGATTGTGG TCTGGATGCC TAAATATGCG
ATGGCCTTCG CCGGAATGGC GGAATCCGAC GCGCTGAAAA CCATCTCTTA TTACAGTATG
GGATCGTTGG TTTGCGTGTT TATTTTTGCC GCATTGCTGA AAAAAATGGT TCGCCCCATC
TGGGCCAATG TTTTCAATGC CGGGCTGGCG ACACTCACCG CTGCGGCAAT TTACCTGTAT
CCCTCTCCAC TGATCTGTAA TGCTGGCGCC TTCGTGATTG GTTTTTCCGC TGCTGGAGGT
ATTTTACAAT TAGGCGTATC GGTAATGTCG GAATTTTTCC CTAAGAGTAA AGCTAAAGTC
ACCAGTATAT ATATGATGAT GGGGGGCGTA GCTAACTTTC TTATCCCACT GATTACCGGT
TATCCCTCTA CTATTGGCCT GCAATATATC ATTTTGTTAG ATTTTGCCTT TGCACTACTG
ACATTTATCA CCGCCATTAT TGTATTTATT CGCTATTATC GCGTATTTAA GATCCCGCAA
AACGATGTCC GATTTGGCGA GCGTTATTTC CAGTAA
 
Protein sequence
MSQNKARNMP YLLAVICIYF SYFLHGMSVI TLAQNMTSLA QKFSTDSAGI AYLISGIGLG 
RLVSILFFGV LSDKFGRRAI ILLGAVLYML FFFGIPASPN LMIAFILAVC VGVANSALDT
GGYPALMECF PKASGSAVIL VKAMVSFGQM IYPLIVSALL VNHIWYGYAV VIPGILFVLI
TLMLLKSRFP SQLVDASIAK ELPQMNSTPL VWLEGVASVL FGVAAFSTFY VIVVWMPKYA
MAFAGMAESD ALKTISYYSM GSLVCVFIFA ALLKKMVRPI WANVFNAGLA TLTAAAIYLY
PSPLICNAGA FVIGFSAAGG ILQLGVSVMS EFFPKSKAKV TSIYMMMGGV ANFLIPLITG
YPSTIGLQYI ILLDFAFALL TFITAIIVFI RYYRVFKIPQ NDVRFGERYF Q