Gene SeD_A4694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4694 
Symbol 
ID6873344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4558585 
End bp4560015 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content53% 
IMG OID642787592 
Productmelibiose:sodium symporter 
Protein accessionYP_002218190 
Protein GI198241954 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID[TIGR00792] sugar (Glycoside-Pentoside-Hexuronide) transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.772252 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATCT CTCTGACAAC AAAGCTGAGT TACGGGTTCG GTGCGTTTGG TAAGGATTTC 
GCCATCGGCA TTGTGTATAT GTACCTGATG TATTACTACA CCGATGTGGT GGGACTTTCG
GTCGGCCTCG TCGGCACCCT CTTTCTGGTC GCGCGAATCT GGGATGCGAT AAACGATCCC
ATCATGGGCT GGATTGTCAA CGCCACGCGT TCGCGGTGGG GGAAATTTAA GCCGTGGATA
TTGATCGGCA CCTTAACCAA TTCGCTGGTG CTTTTCCTGC TGTTCAGCGC CCATCTTTTT
GAGGGAACCG CGCAGGTTGT ATTTGTCTGC GTAACCTACA TCCTGTGGGG CATGACGTAT
ACCATTATGG ATATCCCATT TTGGTCGCTG GTGCCGACCA TTACGCTTGA TAAGCGAGAA
CGCGAACAAC TGGTGCCGTT CCCGCGTTTC TTCGCCAGTC TGGCTGGCTT CGTCACTGCC
GGTATAACGC TGCCGTTTGT GAACTACGTT GGTGGAGCGG ATCGTGGGTT CGGCTTTCAG
ATGTTTACGC TGGTACTGAT TGCGTTTTTT ATCGCCTCGA CTATCGTGAC ATTACGCAAC
GTACATGAGG TGTACTCCTC CGACAACGGT GTAACGGCGG GCCGCCCACA TCTGACGTTA
AAAACGATCG TTGGATTGAT ATACAAAAAC GATCAGCTCT CTTGCCTGTT GGGAATGGCG
CTGGCGTATA ACATTGCCTC CAATATTATC AATGGCTTTG CGATCTACTA CTTCACCTAT
GTGATTGGCG ATGCCGATCT TTTTCCCTAT TACCTTTCTT ACGCCGGCGC GGCGAATCTG
CTGACGCTGA TTGTCTTCCC CCGGCTGGTG AAAATGTTAT CGCGGCGGAT ATTGTGGGCG
GGCGCCTCCG TGATGCCCGT TCTGAGTTGC GCAGGGCTCT TCGCGATGGC GTTGGCGGAT
GTCCATAATG CCGCTTTAAT CGTGGCGGCG GGTATTTTCC TGAATATCGG GACCGCGCTC
TTTTGGGTGC TTCAGGTGAT CATGGTGGCG GATACGGTCG ATTATGGGGA ATTTAAGCTC
AATATTCGCT GCGAGAGTAT CGCTTATTCC GTACAGACGA TGGTCGTGAA GGGCGGCTCG
GCGTTTGCGG CGTTCTTTAT CGCTTTGGTG CTGGGGCTGA TTGGCTACAC GCCGAACGTG
GCGCAGTCTG CGCAAACCCT GCAGGGGATG CAGTTTATTA TGATTGTCCT GCCGGTACTG
TTTTTCATGA TGACGTTGGT TCTCTACTTC CGCTACTACC GTTTGAACGG CGATATGCTG
CGCAAGATTC AGATCCACCT GCTGGATAAA TACCGGAAAA CGCCGCCATT CGTCGAACAG
CCGGATAGCC CGGCGATTTC TGTGGTAGCG ACCAGCGATG TAAAAGCGTG A
 
Protein sequence
MSISLTTKLS YGFGAFGKDF AIGIVYMYLM YYYTDVVGLS VGLVGTLFLV ARIWDAINDP 
IMGWIVNATR SRWGKFKPWI LIGTLTNSLV LFLLFSAHLF EGTAQVVFVC VTYILWGMTY
TIMDIPFWSL VPTITLDKRE REQLVPFPRF FASLAGFVTA GITLPFVNYV GGADRGFGFQ
MFTLVLIAFF IASTIVTLRN VHEVYSSDNG VTAGRPHLTL KTIVGLIYKN DQLSCLLGMA
LAYNIASNII NGFAIYYFTY VIGDADLFPY YLSYAGAANL LTLIVFPRLV KMLSRRILWA
GASVMPVLSC AGLFAMALAD VHNAALIVAA GIFLNIGTAL FWVLQVIMVA DTVDYGEFKL
NIRCESIAYS VQTMVVKGGS AFAAFFIALV LGLIGYTPNV AQSAQTLQGM QFIMIVLPVL
FFMMTLVLYF RYYRLNGDML RKIQIHLLDK YRKTPPFVEQ PDSPAISVVA TSDVKA