Gene SeD_A1640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1640 
Symbol 
ID6872874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1583278 
End bp1584927 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content54% 
IMG OID642784784 
Productpeptide transport periplasmic protein SapA 
Protein accessionYP_002215452 
Protein GI198243286 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.00000341731 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCCTGG TTTTATCGTC TCTGATCGTG ATAGCGGGTC TACTGAGTAG TCAGGCTACG 
GCTGCGGCTG CGCCCGAACA AACTGCGAGT GCAGATATTC GCGATAGCGG CTTTGTGTAT
TGTGTCAGCG GGCAGGTCAA CACCTTTAAT CCGCAAAAAG CGAGCAGCGG CCTCATCGTC
GATACCCTGG CAGCCCAGTT ATATGACCGC CTGTTGGATG TCGATCCCTA TACTTATCGT
TTAGTCCCAG AGCTGGCAGA AAGCTGGGAA GTGCTGGATA ACGGGGCAAC GTACCGTTTT
CACCTGCGCC GCGACGTTTC CTTTCAAAAA ACCGCCTGGT TTACGCCGAC CCGAAAACTC
AATGCTGATG ATGTCGTCTT TACCTTTCAG CGGATTTTTG ATCGTCGGCA TCCGTGGCAT
AACATCAACG GCAGTAGCTT CCCCTACTTT GATAGCCTAC AGTTCGCCGA CAATGTAAAA
AGCGTGCGTA AGCTGGACAA TAACACCGTT GAGTTTCGCC TGACGCAGCC AGACGCCTCC
TTTTTATGGC ATCTGGCTAC ACACTACGCT TCCGTCATGT CCGCTGAGTA CGCCGCGCAG
CTTAGCCGAA AAGATCGTCA GGAACTGCTA GACCGCCAAC CGGTCGGCAC CGGGCCTTTC
CAGCTTTCGG AGTACCGTGC CGGACAGTTT ATTCGTCTCC AGCGCCACGA TGGGTTTTGG
CGCGGCAAAC CGCTGATGCC GCAAGTGGTA GTGGATTTAG GCTCCGGCGG TACCGGGCGT
TTATCGAAAT TACTGACCGG TGAATGCGAT GTTCTGGCCT GGCCCGCCGC CAGCCAGCTA
ACTATTTTAC GCGACGATCC GCGTTTACGT CTGACGTTGC GCCCGGGGAT GAATATCGCC
TATCTGGCCT TTAACACCGA TAAGCCGCCG TTGAATAATC CCGCAGTGCG CCATGCGCTG
GCCTTATCGA TCAACAACCA GCGTCTGATG CAGTCGATTT ATTACGGCAC GGCGGAAACC
GCAGCCTCCA TTTTACCGAG AGCCTCATGG GCTTACGATA ACGATGCCAA AATTACGGAG
TACAATCCGG AAAAATCGCG CGAACAGCTA AAAGCGCTGG GCATTGAGAA TCTTACGCTG
CATCTCTGGG TGCCGACCAG TTCTCAGGCC TGGAACCCAA GTCCGCTAAA AACGGCGGAG
CTTATTCAGG CGGATATGGC GCAGGTTGGC GTAAAAGTGG TCATTGTGCC GGTTGAAGGT
CGTTTTCAGG AGGCGCGCCT GATGGATATG AATCACGATC TGACCTTATC CGGCTGGGCC
ACGGACAGCA ACGATCCGGA TAGCTTTTTC AGACCACTGT TAAGCTGTGC GGCCATCAAT
TCGCAAACCA ATTTCGCCCA CTGGTGTAAC CCTGAATTTG ACAGCGTGCT GCGTAAAGCA
CTGTCGTCGC AGCAGTTGGC TTCGCGCATA GAAGCATATG AGGAAGCGCA GAATATCCTG
GAGAAAGAGC TGCCGATACT GCCGCTGGCA TCATCACTAC GCCTGCAGGC TTACCGCTAC
GATATTAAAG GGCTGGTGTT AAGCCCGTTC GGCAATGCGT CTTTTGCCGG CGTCTCCCGC
GAAAAACACG AAGAGGTGAA AAAACCATGA
 
Protein sequence
MRLVLSSLIV IAGLLSSQAT AAAAPEQTAS ADIRDSGFVY CVSGQVNTFN PQKASSGLIV 
DTLAAQLYDR LLDVDPYTYR LVPELAESWE VLDNGATYRF HLRRDVSFQK TAWFTPTRKL
NADDVVFTFQ RIFDRRHPWH NINGSSFPYF DSLQFADNVK SVRKLDNNTV EFRLTQPDAS
FLWHLATHYA SVMSAEYAAQ LSRKDRQELL DRQPVGTGPF QLSEYRAGQF IRLQRHDGFW
RGKPLMPQVV VDLGSGGTGR LSKLLTGECD VLAWPAASQL TILRDDPRLR LTLRPGMNIA
YLAFNTDKPP LNNPAVRHAL ALSINNQRLM QSIYYGTAET AASILPRASW AYDNDAKITE
YNPEKSREQL KALGIENLTL HLWVPTSSQA WNPSPLKTAE LIQADMAQVG VKVVIVPVEG
RFQEARLMDM NHDLTLSGWA TDSNDPDSFF RPLLSCAAIN SQTNFAHWCN PEFDSVLRKA
LSSQQLASRI EAYEEAQNIL EKELPILPLA SSLRLQAYRY DIKGLVLSPF GNASFAGVSR
EKHEEVKKP