Gene SNSL254_A1816 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1816 
Symbol 
ID6483696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1780034 
End bp1781683 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content54% 
IMG OID642737192 
Productpeptide transport periplasmic protein SapA 
Protein accessionYP_002040944 
Protein GI194443730 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.440252 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.00000000112139 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCCTGG TTTTATCGTC TCTGATCGTG ATAGCGGGTC TACTAAGTAG TCAGGCTACG 
GCTGCGGCTG CGCCCGAACA AACTGCGAGT GCAGATATTC GCGATAGCGG CTTTGTGTAT
TGTGTCAGCG GGCAGGTCAA CACCTTTAAT CCGCAAAAAG CGAGCAGCGG CCTTATCGTC
GATACCCTGG CAGCCCAGTT ATATGACCGC CTGTTGGATG TCGATCCCTA TACTTATCGT
TTAGTCCCAG AGCTGGCAGA AAGCTGGGAA GTGCTGGATA ACGGGGCAAC GTACCGTTTT
CACCTGCGCC GCGACGTTTC CTTTCAAAAA ACCGCCTGGT TTACGCCGAC CCGAAAACTC
AATGCTGATG ATGTCGTCTT TACCTTTCAG CGGATTTTTG ATCGTCGGCA TCCGTGGCAT
AACATCAACG GCAGTAGCTT CCCCTACTTT GATAGCCTAC AGTTCGCCGA CAATGTAAAA
AGCGTGCGTA AGCTGGACAA TAACACCGTT GAGTTTCGCC TGACGCAGCC AGACGCCTCC
TTTTTATGGC ATCTGGCTAC ACACTACGCT TCCGTCATGT CCGCTGAGTA CGCCGCGCAG
CTTAGCCGAA AAGATCGTCA GGAACTGCTA GACCGCCAAC CGGTCGGCAC CGGGCCTTTC
CAGCTTTCGG AGTACCGTGC CGGACAGTTT ATTCGTCTCC AGCGCCACGA TGGGTTTTGG
CGCGGCAAAC CGCTGATGCC GCAAGTGGTA GTGGATTTAG GCTCCGGCGG TACCGGGCGT
TTATCGAAAT TACTGACCGG TGAATGCGAT GTTCTGGCCT GGCCCGCCGC CAGCCAGCTA
ACTATTTTAC GCGACGATCC GCGTTTGCGC CTGACGTTGC GCCCGGGGAT GAATATCGCC
TATCTGGCCT TTAACACCGA TAAGCCGCCG TTGAATAATC CCGCAGTGCG CCATGCGCTG
GCCTTATCGA TCAACAACCA GCGTCTGATG CAGTCGATTT ATTACGGCAC GGCGGAAACC
GCAGCCTCCA TTTTACCGAG AGCCTCATGG GCTTACGATA ACGATGCCAA AATTACGGAG
TACAATCCGG AAAAATCGCG CGAACAGCTA AAAGCGCTGG GCATTGAGAA TCTTACGCTG
CATCTCTGGG TGCCGACCAG TTCTCAGGCC TGGAACCCAA GTCCGCTAAA AACGGCGGAG
CTTATTCAGG CGGATATGGC GCAGGTTGGC GTAAAAGTGG TCATTGTGCC GGTTGAAGGT
CGTTTTCAGG AGGCGCGCCT GATGGATATG AATCACGATC TGACCTTATC CGGCTGGGCC
ACGGACAGCA ACGATCCGGA TAGCTTTTTC AGACCGCTGT TAAGCTGTGC GGCCATCAAT
TCGCAAACCA ATTTCGCCCA CTGGTGTAAC CCTGAATTTG ACAGCGTGCT GCGTAAGGCA
CTGTCGTCGC AGCAGTTGGC TTCGCGCATA GAAGCGTATG AGGAAGCGCA GAATATCCTG
GAGAAAGAGC TGCCGATACT GCCGCTGGCA TCATCACTAC GCCTGCAGGC TTACCGCTAC
GATATTAAAG GGCTGGTGTT AAGCCCGTTC GGCAATGCGT CTTTTGCCGG CGTCTCCCGC
GAAAAACACG AAGAGGTGAA AAAACCATGA
 
Protein sequence
MRLVLSSLIV IAGLLSSQAT AAAAPEQTAS ADIRDSGFVY CVSGQVNTFN PQKASSGLIV 
DTLAAQLYDR LLDVDPYTYR LVPELAESWE VLDNGATYRF HLRRDVSFQK TAWFTPTRKL
NADDVVFTFQ RIFDRRHPWH NINGSSFPYF DSLQFADNVK SVRKLDNNTV EFRLTQPDAS
FLWHLATHYA SVMSAEYAAQ LSRKDRQELL DRQPVGTGPF QLSEYRAGQF IRLQRHDGFW
RGKPLMPQVV VDLGSGGTGR LSKLLTGECD VLAWPAASQL TILRDDPRLR LTLRPGMNIA
YLAFNTDKPP LNNPAVRHAL ALSINNQRLM QSIYYGTAET AASILPRASW AYDNDAKITE
YNPEKSREQL KALGIENLTL HLWVPTSSQA WNPSPLKTAE LIQADMAQVG VKVVIVPVEG
RFQEARLMDM NHDLTLSGWA TDSNDPDSFF RPLLSCAAIN SQTNFAHWCN PEFDSVLRKA
LSSQQLASRI EAYEEAQNIL EKELPILPLA SSLRLQAYRY DIKGLVLSPF GNASFAGVSR
EKHEEVKKP