Gene SeAg_B4563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B4563 
Symbol 
ID6795232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp4465950 
End bp4467380 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content53% 
IMG OID642778649 
Productmelibiose:sodium symporter 
Protein accessionYP_002149215 
Protein GI197247890 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID[TIGR00792] sugar (Glycoside-Pentoside-Hexuronide) transporter 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATCT CTCTGACAAC AAAGCTGAGT TACGGGTTCG GTGCGTTTGG TAAGGATTTC 
GCCATCGGCA TTGTGTATAT GTACCTGATG TATTACTACA CCGATGTGGT GGGACTCTCG
GTCGGCCTCG TCGGCACCCT CTTTCTGGTC GCGCGAATCT GGGATGCGAT AAACGATCCC
ATCATGGGCT GGATTGTCAA CGCCACGCGT TCGCGGTGGG GGAAATTTAA GCCGTGGATA
TTGATCGGCA CCTTAACCAA TTCGCTGGTG CTTTTCCTGC TGTTCAGCGC CCATCTTTTT
GAGGGAACCG CGCAGGTTGT ATTTGTCTGC GTAACCTACA TCCTGTGGGG CATGACGTAT
ACCATTATGG ATATCCCATT TTGGTCGCTG GTGCCGACCA TTACGCTTGA TAAGCGAGAA
CGCGAACAAC TGGTGCCGTT CCCGCGTTTC TTCGCCAGCC TGGCTGGCTT CGTCACTGCC
GGTATAACGC TGCCGTTTGT GAACTACGTT GGCGGAGCGG ATCGTGGGTT CGGCTTTCAG
ATGTTTACGC TGGTACTGAT TGCGTTTTTT ATCGCCTCGA CTATCGTGAC ATTACGCAAC
GTCCATGAGG TGTACTCCTC CGACAACGGT GTAACGGCGG GCCGCCCACA TCTGACGTTA
AAAACGATCG TTGGATTGAT ATACAAAAAC GATCAGCTCT CTTGCCTGTT GGGAATGGCG
CTGGCGTATA ACATTGCCTC TAATATTATC AATGGCTTTG CGATCTACTA CTTCACCTAT
GTGATTGGCG ATGCCGATCT TTTTCCCTAT TACCTTTCTT ACGCCGGCGC GGCGAATCTG
CTGACGCTGA TTGTCTTCCC CCGGCTGGTG AAAATGTTAT CGCGACGGAT ATTGTGGGCG
GGCGCCTCCG TGATGCCCGT TCTGAGTTGC GCAGGGCTCT TCGCGATGGC GTTGGCGGAT
GTCCATAATG CCGCTTTAAT CGTGGCGGCG GGTATTTTCC TGAATATCGG GACCGCGCTC
TTTTGGGTGC TTCAGGTGAT CATGGTGGCG GATACGGTCG ATTATGGGGA ATTTAAGCTC
AATATTCGCT GCGAGAGTAT CGCTTATTCC GTACAGACGA TGGTTGTGAA GGGCGGCTCG
GCGTTTGCGG CGTTCTTTAT CGCTCTGGTG CTGGGGCTGA TTGGCTACAC GCCGAACGTG
GCGCAGTCTG CGCAAACCCT GCAGGGGATG CAGTTTATTA TGATTGTCCT GCCGGTACTG
TTTTTCATGA TGACGTTGGT TCTCTACTTC CGCTACTACC GTTTGAACGG CGACATGCTG
CGCAAGATTC AGATCCACCT GCTGGATAAA TACCGGAAAA CGCCGCCATT CGTCGAACAG
CCGGATAGCC CGGCGATTTC TGTGGTAGCG ACCAGCGATG TAAAAGCGTG A
 
Protein sequence
MSISLTTKLS YGFGAFGKDF AIGIVYMYLM YYYTDVVGLS VGLVGTLFLV ARIWDAINDP 
IMGWIVNATR SRWGKFKPWI LIGTLTNSLV LFLLFSAHLF EGTAQVVFVC VTYILWGMTY
TIMDIPFWSL VPTITLDKRE REQLVPFPRF FASLAGFVTA GITLPFVNYV GGADRGFGFQ
MFTLVLIAFF IASTIVTLRN VHEVYSSDNG VTAGRPHLTL KTIVGLIYKN DQLSCLLGMA
LAYNIASNII NGFAIYYFTY VIGDADLFPY YLSYAGAANL LTLIVFPRLV KMLSRRILWA
GASVMPVLSC AGLFAMALAD VHNAALIVAA GIFLNIGTAL FWVLQVIMVA DTVDYGEFKL
NIRCESIAYS VQTMVVKGGS AFAAFFIALV LGLIGYTPNV AQSAQTLQGM QFIMIVLPVL
FFMMTLVLYF RYYRLNGDML RKIQIHLLDK YRKTPPFVEQ PDSPAISVVA TSDVKA