Gene SeD_A1853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1853 
Symbol 
ID6871629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1794145 
End bp1795398 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content55% 
IMG OID642784983 
Productinner membrane transport protein YnfM 
Protein accessionYP_002215651 
Protein GI198241768 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTCGTA CAACTATTCT CGATGACGCC ACGGCGAGCG ATATCGATGA ACAACGCCAT 
TCTCAGCCGG TTCAATTTAT TAAACGCGGT ACAGCGCCTT TCATGCGCGT CACGCTGGCC
CTTTTTTCCG CGGGCCTGGC GACTTTCGCG CTCCTTTATT GCGTTCAGCC TATACTACCG
GTACTTTCCC AGGAGTTTGG CGTCTCCCCC GCCAGCAGCA GCGTTTCACT TTCTATTTCT
ACCGCCATGC TGGCTATCGG CTTACTTTTT ACCGGCCCGC TTTCTGATGC GATAGGCCGT
AAACCGGTGA TGGTCACCGC CTTGTTATTA GCCTCCTGCT GCACATTGTT ATCAACCATG
ATGACCAGCT GGCATGGTAT TCTGATTATG CGCGCGCTGA TCGGTCTGTC GTTAAGCGGC
GTCGCCGCGG TGGGAATGAC CTACCTGAGC GAAGAAATCC ATCCCAGCTT TGTCGCTTTC
TCTATGGGGC TTTATATTAG CGGTAATTCC ATCGGCGGGA TGAGCGGGCG TTTATTAAGC
GGCGTCATGA CCGACTTTTT TAACTGGCGC ATCGCGCTGG CGGCCATCGG ATGCTTTGCG
CTGGCGTCCG CGCTGATGTT CTGGAAGATT TTACCCGCCT CGCAACATTT CCGTCCCACG
TCGCTACGGC CTAAAACGCT ATTTATTAAT TTCCGTCTCC ACTGGCGCGA TCGGGGGTTA
CCTCTACTGT TTGCAGAAGG TTTTTTACTG ATGGGCGCGT TCGTCACCCT GTTTAACTAC
ATTGGTTATC GCCTGATGCT CTCCCCCTGG GAATTAAGCC AGGCCGTGGT CGGACTGTTA
TCCGTAGCCT ATCTCACCGG CACATGGAGC TCGCCGAAAG CTGGCGCCAT GACCTCACGC
TATGGCCGGG GGCCGGTGAT GCTGTTCTCT ACCGCAGTAA TGCTGGGTGG GCTTTTACTG
ACGCTTTTTA CCTCGCTTTG GCTGATTTTT GCTGGGATGC TGCTTTTTTC CGCCGGCTTT
TTCGCCGCGC ACTCCGTGGC CAGTAGCTGG ATTGGCCCGC GTGCGCGTCG GGCGAAGGGA
CAGGCCTCGT CACTTTATCT TTTCAGCTAT TATCTGGGGT CCAGTATTGC CGGGACGCTA
GGCGGTGTAT TCTGGCATAG TTACGGCTGG AACGGCGTAG GTGGTTTTAT CGCGCTAATG
CTGGTGCTGG CCATCCTGGT CGGGACGCGG TTACATCACC GTCTTCATGC CTGA
 
Protein sequence
MSRTTILDDA TASDIDEQRH SQPVQFIKRG TAPFMRVTLA LFSAGLATFA LLYCVQPILP 
VLSQEFGVSP ASSSVSLSIS TAMLAIGLLF TGPLSDAIGR KPVMVTALLL ASCCTLLSTM
MTSWHGILIM RALIGLSLSG VAAVGMTYLS EEIHPSFVAF SMGLYISGNS IGGMSGRLLS
GVMTDFFNWR IALAAIGCFA LASALMFWKI LPASQHFRPT SLRPKTLFIN FRLHWRDRGL
PLLFAEGFLL MGAFVTLFNY IGYRLMLSPW ELSQAVVGLL SVAYLTGTWS SPKAGAMTSR
YGRGPVMLFS TAVMLGGLLL TLFTSLWLIF AGMLLFSAGF FAAHSVASSW IGPRARRAKG
QASSLYLFSY YLGSSIAGTL GGVFWHSYGW NGVGGFIALM LVLAILVGTR LHHRLHA