Gene SeD_A4918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4918 
Symbol 
ID6874738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4754167 
End bp4755351 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content56% 
IMG OID642787794 
Productmajor facilitator family transporter 
Protein accessionYP_002218387 
Protein GI198244661 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.844872 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value0.222303 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCATG CTCAGAGCGC TTCACCTCGC CATCCCATTT TTACGGCGCT TTTCGGTATG 
ATGGTATTGA CGCTGGGGAT GGGAGTTGGC CGCTTTCTGT ATACGCCCAT GCTGCCGGTA
ATGCTGGCGG AAAAGCAGCT AACGTTTAAT CAGCTCTCCT GGATTGCCAG CGCCAATTAT
GCAGGGTACC TGGCGGGAAG CTTACTGTTT TCCTTTGGCC TTTTTCATCT CCCCTCTCGT
CTACGCCCCA TGCTGCTGGC TTCGGCGGTC GCCACCGGTA TTCTTATTCT GTCTATGGCG
ATATTTACTC AGCCCGCGGT CGTCATGTTG GTGCGCTTCC TGGCAGGCGT CGCGAGTGCG
GGAATGATGA TTTTTGGATC AATGATTGTG TTGCATCATA CCCGCCATCC GTTTGTGATC
GCCGCGCTCT TTTCCGGCGT CGGGGCCGGG ATTGCACTGG GTAACGAATA TGTCATTGGC
GGTTTACATT ATGCGCTTTC GGCCCATTCG CTATGGCTGG GGGCCGGGGC GCTCGCCGGT
ATATTGCTAT TAATCGTGGC AATGTTGATT CCCCCCCGCG CTCATGCGCT GCCGCCTGCG
CCTTTAGCAA GAATCGAAAA TCAGCCTATG CCCTGGTGGC AACTGGCGCT GTTGTACGGT
TTCGCCGGAT TCGGCTACAT CATTGTCGCT ACCTATTTGC CGCTGATGGC GAAAAGCGCG
GGCTCTCCGC TACTCACGGC GCACCTTTGG TCGTTGGTCG GCCTGGCGAT CATTCCCGGC
TGCTTCGGCT GGCTGTGGGC GGCAAAACAT TGGGGTGTCC TGCCATGCCT GACCGCGAAC
CTGTTGATCC AGAGCGCCTG CGTGCTGCTG TCTCTCGCCA GCGACTCGTT GTTGCTGTTA
ATACTGAGCA GTATTGGTTT TGGCGCCACG TTTATGGGCA CAACCTCGCT GGTGATGCCG
CTAGCCCGAC AGCTCAGCGC GCCGGGCAAT ATTAATTTAT TAGGCCTGGT GACGCTAACG
TATGGTATTG GGCAAATTCT CGGTCCGCTC GCCGCCAGTC TGTCAGGCAA TGGCGCGTCG
GCAATTATCA ACGCCACGCT TTGCGGCGCC GCGGCGCTCT TTTTTGCCGC GCTGATCAGC
GCCGCGCAGC AGATAAAACA AAAACGGTTT GTGATACGTG AATAA
 
Protein sequence
MVHAQSASPR HPIFTALFGM MVLTLGMGVG RFLYTPMLPV MLAEKQLTFN QLSWIASANY 
AGYLAGSLLF SFGLFHLPSR LRPMLLASAV ATGILILSMA IFTQPAVVML VRFLAGVASA
GMMIFGSMIV LHHTRHPFVI AALFSGVGAG IALGNEYVIG GLHYALSAHS LWLGAGALAG
ILLLIVAMLI PPRAHALPPA PLARIENQPM PWWQLALLYG FAGFGYIIVA TYLPLMAKSA
GSPLLTAHLW SLVGLAIIPG CFGWLWAAKH WGVLPCLTAN LLIQSACVLL SLASDSLLLL
ILSSIGFGAT FMGTTSLVMP LARQLSAPGN INLLGLVTLT YGIGQILGPL AASLSGNGAS
AIINATLCGA AALFFAALIS AAQQIKQKRF VIRE