Gene SeAg_B1458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B1458 
Symbol 
ID6793797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp1417806 
End bp1419455 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content54% 
IMG OID642775701 
Productpeptide transport periplasmic protein SapA 
Protein accessionYP_002146337 
Protein GI197249778 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.483006 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCTGG TTTTATCGTC TCTGATCGTG ATAGCGGGTC TACTGAGTAG TCAGGCTACG 
GCTGCGACTG CGCCCGAACA AACTGCGAGT GCAGATATTC GCGATAGCGG CTTTGTGTAT
TGTGTTAGCG GGCAGGTCAA CACCTTTAAT CCGCAAAAAG CGAGCAGCGG CCTCATCGTC
GATACCCTGG CCGCCCAGTT ATATGATCGC CTGTTGGATG TCGATCCCTA TACTTATCGT
TTAGTCCCAG AGCTGGCAGA AAGCTGGGAA GTGCTGGATA ACGGGGCAAC GTACCGTTTT
CACCTGCGCC GCGACGTTTC CTTTCAAAAA ACCGCCTGGT TTACGCCGAC CCGAAAACTC
AATGCTGATG ATGTCGTCTT TACCTTTCAG CGGATTTTCG ATCGTCGACA TCCGTGGCAT
AACATCAACG GCAGTAGCTT CCCCTACTTT GATAGCCTAC AGTTCGCCGA CAATGTAAAA
AGCGTGCGTA AGCTGGACAA TAACACCGTT GAGTTTCGCC TGACGCAGCC AGACGCCTCC
TTTTTATGGC ATCTGGCCAC ACACTACGCT TCCGTCATGT CCGCTGAGTA CGCCGCGCAG
CTTAGCCGAA AAGATCGTCA GGAACTGCTA GACCGCCAAC CGGTTGGCAC CGGGCCTTTC
CAGCTTTCGG AGTACCGTGC CGGGCAGTTT ATTCGTCTCC AGCGCCACGA TGGGTTTTGG
CGCGGCAAAC CGCTGATGCC GCAAGTGGTG GTGGATTTAG GCTCCGGCGG TACCGGGCGT
TTATCGAAAT TACTGACCGG TGAATGCGAT GTTCTGGCCT GGCCCGCCGC CAGCCAGCTA
ACTATTTTAC GCGACGATCC GCGTTTACGT CTGACGTTGC GCCCGGGGAT GAATATCGCC
TATCTGGCCT TTAACACCGA TAAGCCGCCG TTGAATAATC CCGCAGTGCG CCATGCGCTG
GCCTTATCGA TCAACAACCA GCGTCTGATG CAGTCGATTT ATTACGGCAC GGCGGAAACC
GCAGCCTCCA TTTTACCGAG AGCCTCATGG GCTTACGATA ACGATGCCAA AATTACGGAG
TACAATCCGG AAAAATCGCG CGAACAGCTA AAAGCGCTGG GCATTGAGAA TCTTACGCTG
CATCTCTGGG TGCCGACCAG TTCTCAGGCC TGGAACCCAA GTCCGCTAAA AACGGCGGAG
CTTATTCAGG CGGATATGGC GCAGGTTGGC GTAAAAGTGG TCATTGTGCC GGTTGAAGGT
CGTTTTCAGG AGGCGCGCCT GATGGATATG AATCACGATC TGACCTTATC CGGCTGGGCC
ACGGACAGCA ACGATCCGGA TAGCTTTTTC AGACCACTGT TAAGCTGTGC GGCCATCAAT
TCGCAAACCA ATTTCGCCCA CTGGTGTAAC CCTGAATTTG ACAGCGTGCT GCGTAAAGCA
CTGTCGTCGC AGCAGTTGGC TTCGCGCATA GAAGCATATG AGGAAGCGCA GAATATCCTG
GAGAAAGAGC TGCCGATACT GCCGCTGGCA TCATCACTAC GCCTGCAGGC TTACCGCTAC
GATATTAAAG GGCTGGTGTT AAGCCCGTTC GGCAATGCGT CTTTTGCCGG CGTCTCCCGC
GAAAAACACG AAGAGGTGAA AAAACCATGA
 
Protein sequence
MRLVLSSLIV IAGLLSSQAT AATAPEQTAS ADIRDSGFVY CVSGQVNTFN PQKASSGLIV 
DTLAAQLYDR LLDVDPYTYR LVPELAESWE VLDNGATYRF HLRRDVSFQK TAWFTPTRKL
NADDVVFTFQ RIFDRRHPWH NINGSSFPYF DSLQFADNVK SVRKLDNNTV EFRLTQPDAS
FLWHLATHYA SVMSAEYAAQ LSRKDRQELL DRQPVGTGPF QLSEYRAGQF IRLQRHDGFW
RGKPLMPQVV VDLGSGGTGR LSKLLTGECD VLAWPAASQL TILRDDPRLR LTLRPGMNIA
YLAFNTDKPP LNNPAVRHAL ALSINNQRLM QSIYYGTAET AASILPRASW AYDNDAKITE
YNPEKSREQL KALGIENLTL HLWVPTSSQA WNPSPLKTAE LIQADMAQVG VKVVIVPVEG
RFQEARLMDM NHDLTLSGWA TDSNDPDSFF RPLLSCAAIN SQTNFAHWCN PEFDSVLRKA
LSSQQLASRI EAYEEAQNIL EKELPILPLA SSLRLQAYRY DIKGLVLSPF GNASFAGVSR
EKHEEVKKP