Gene SeAg_B4710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B4710 
Symbol 
ID6795537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp4603521 
End bp4604687 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content61% 
IMG OID642778783 
Productmajor facilitator superfamily transporter 
Protein accessionYP_002149345 
Protein GI197247375 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.019834 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGCTC TCACACAACA TTCCACATCG ATCACGCTGT TTCGCATCAG CGCCGCGATG 
TTTCTCAACT ATCTGACTAT TGGCATCCCG TTAGTCATGC TGCCGCTGTA CGTTCAGCAG
CAGTTGCATC TGAGCGATCT GTTGATCGGT ATCGCGGTAG GCAGCCAGTT TATCGCCACC
TTGCTTACCC GGGGCGCCGC GGGTCGAAAA GCCGACACGT CAGGCGGACG TCGGACGGTT
ATCACCGGTC AGTTCTACTG CGCCGCATCA GGTCTGCTGA TGCTGGTCAG TCTAATCGCT
CACCCGGTTC CGCTGCTCGC ATGGGCCATA CTGATTGTTG GTCGTGTGCT GCTGGGCATC
GGTGAAAGTT TCATTTTGAC CGGTAATTTA ACCTGGGGGA TGTGGCTGGC GGGCTCGACC
CACGCGGGTC AGGTTATTTC CTGGAACGGC ATGGCGACCT ACGGTGCGCT GGCTATCGGC
GCGCCGCTGG GGCTGTCGCT TTACGCCAGG GCCGGCCTGG CGCTTCCGGC GCTCCTCGTC
GTGCTGTTGC CGATCATCGC CAGCGGAGTC ATCTATGGCA TTCCGGGGAA TATCCCAACA
GCCCGTCCCC GCGTGCCAGT GCTACGGGTC GTCGGCCTGG TGTGGCGGCC CGGAACAGGG
CTTGTGCTCC AGGGCATTGG TTTTGCCACC CTAAGCGCGT TTACCGCCCT CTGGTTCAAC
GAGCGTCACT GGGATAACAC CGGTTTTGCG ATGACGCTGT TTGGCATTGC GTTTATCGCT
GTCCGCTTTT TCTGCGCCAA ATTTCCCGAC CGTTACGGCG GCGCCACGGT CGCAACCTTT
TCACTACTGG TAGAGGGGAC CGGACTGGCG GTAATGTGGG CCGCTCCGTC AGCAGGCGCT
GCGCTGATTG GCGCGGCGAT TACCGGCTGC GGCTGTTCGC TGATGTTCCC ATCGCTGGGC
GTAGAGGTTG TGCGCAGAGT GCCACCGGAG ATTCGTGGTA CGGCCCTCGG CGTCTGGTCA
GCGTTTCAGG ATCTGGCCTA CGGATTTACC GGACCAATAG CGGGCCTGCT GACGCCGTTT
ATCGGCTATC AGCAGGTCTT TCTTCTGGCC GCGGCCTGCG CGCTGCTGGG CGCGGCGGTT
GTCCATCTCC TGCTGCGGCA ACATTAA
 
Protein sequence
MPALTQHSTS ITLFRISAAM FLNYLTIGIP LVMLPLYVQQ QLHLSDLLIG IAVGSQFIAT 
LLTRGAAGRK ADTSGGRRTV ITGQFYCAAS GLLMLVSLIA HPVPLLAWAI LIVGRVLLGI
GESFILTGNL TWGMWLAGST HAGQVISWNG MATYGALAIG APLGLSLYAR AGLALPALLV
VLLPIIASGV IYGIPGNIPT ARPRVPVLRV VGLVWRPGTG LVLQGIGFAT LSAFTALWFN
ERHWDNTGFA MTLFGIAFIA VRFFCAKFPD RYGGATVATF SLLVEGTGLA VMWAAPSAGA
ALIGAAITGC GCSLMFPSLG VEVVRRVPPE IRGTALGVWS AFQDLAYGFT GPIAGLLTPF
IGYQQVFLLA AACALLGAAV VHLLLRQH