Gene SeSA_A1817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A1817 
Symbol 
ID6516008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp1756853 
End bp1758502 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content54% 
IMG OID642746918 
Productpeptide transport periplasmic protein SapA 
Protein accessionYP_002114721 
Protein GI194734041 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.690019 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.173891 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTGG TTTTATCGTC TCTGATCGTG ATAGCGGGTC TACTGAGTAG TCAGGCTACG 
GCTGCGGCTG CGCCCGAACA AACTGCGAGT GCAGATATTC GCGATAGCGG CTTTGTGTAT
TGTGTTAGCG GGCAGGTCAA CACCTTTAAT CCGCAAAAAG CGAGCAGCGG CCTCATCGTC
GATACCCTGG CCGCCCAGTT ATATGATCGC CTGTTGGATG TCGATCCCTA TACTTATCGT
TTAGTCCCAG AGCTGGCAGA AAGCTGGGAA GTGCTGGATA ACGGGGCAAC GTACCGTTTT
CACCTGCGCC GCGACGTTTC CTTTCAAAAA ACCGCCTGGT TTACGCCGAC CCGAAAACTC
AATGCTGATG ATGTTGTCTT TACCTTTCAG CGGATTTTTG ATCGTCGGCA TCCGTGGCAT
AACATCAACG GCAGTAGCTT CCCCTACTTT GATAGCCTAC AGTTCGCCGA CAATGTAAAA
AGCGTGCGTA AGCTGGACAA TAACACCGTT GAGTTTCGCC TGACGCAGCC AGACGCCTCC
TTTTTATGGC ATCTGGCTAC ACACTACGCT TCCGTCATGT CCGCTGAGTA CGCCGCGCAG
CTTAGCCGAA AAGATCGTCA GGAACTGCTA GACCGCCAAC CGGTCGGCAC CGGGCCTTTC
CAGCTTTCGG AGTACCGTGC CGGGCAGTTT ATTCGTCTCC AGCGCCACGA TGGTTTTTGG
CGCGGCAAAC CGCTGATGCC GCAAGTAGTA GTGGATTTAG GCTCCGGCGG TACCGGGCGT
TTATCAAAAT TACTGACCGG TGAATGCGAT GTTCTGGCCT GGCCCGCCGC CAGCCAGCTA
ACTATTTTAC GCGACGATCC GCGTTTGCGC CTGACGTTGC GCCCGGGGAT GAATATCGCC
TATCTGGCCT TTAACACCGA TAAGCCGCCG TTGAATAATC CCGCAGTGCG CCATGCGCTG
GCCTTATCGA TCAACAACCA GCGCCTGATG CAGTCGATTT ATTACGGCAC GGCAGAAACC
GCAGCCTCCA TTTTACCGAG AGCCTCATGG GCTTACGATA ACGATGCCAA AATTACGGAG
TACAATCCGG AAAAATCGCG CGAACAGCTA AAAGCGCTGG GCATTGAGAA TCTTACGCTG
CATCTCTGGG TGCCGACTAG TTCTCAGGCC TGGAACCCAA GTCCGCTAAA AACGGCGGAG
CTTATTCAGG CGGATATGGC GCAGATTGGC GTAAAAGTGG TCATTGTGCC GGTTGAAGGT
CGTTTCCAGG AGGCGCGCCT GATGGATATG AATCACGATC TGACCTTATC CGGCTGGGCC
ACGGACAGCA ACGATCCGGA TAGCTTTTTC AGACCGCTGT TAAGCTGTGC AGCCATCAAT
TCGCAAACCA ATTTCGCCCA CTGGTGTAAC CCTGAATTTG ACAGCGTGCT GCGTAAAGCA
CTGTCGTCGC AGCAGTTGGC TTCGCGCATA GAAGCGTATG AGGAAGCGCA GAATATCCTG
GAGAAAGAGC TGCCGATACT GCCGCTGGCA TCATCACTAC GCCTGCAGGC TTACCGCTAC
GATATTAAAG GGCTGGTGTT AAGCCCGTTC GGCAATGCGT CTTTTGCCGG CGTCTCCCGC
GAAAAACACG AAGAGGTGAA AAAACCATGA
 
Protein sequence
MRLVLSSLIV IAGLLSSQAT AAAAPEQTAS ADIRDSGFVY CVSGQVNTFN PQKASSGLIV 
DTLAAQLYDR LLDVDPYTYR LVPELAESWE VLDNGATYRF HLRRDVSFQK TAWFTPTRKL
NADDVVFTFQ RIFDRRHPWH NINGSSFPYF DSLQFADNVK SVRKLDNNTV EFRLTQPDAS
FLWHLATHYA SVMSAEYAAQ LSRKDRQELL DRQPVGTGPF QLSEYRAGQF IRLQRHDGFW
RGKPLMPQVV VDLGSGGTGR LSKLLTGECD VLAWPAASQL TILRDDPRLR LTLRPGMNIA
YLAFNTDKPP LNNPAVRHAL ALSINNQRLM QSIYYGTAET AASILPRASW AYDNDAKITE
YNPEKSREQL KALGIENLTL HLWVPTSSQA WNPSPLKTAE LIQADMAQIG VKVVIVPVEG
RFQEARLMDM NHDLTLSGWA TDSNDPDSFF RPLLSCAAIN SQTNFAHWCN PEFDSVLRKA
LSSQQLASRI EAYEEAQNIL EKELPILPLA SSLRLQAYRY DIKGLVLSPF GNASFAGVSR
EKHEEVKKP