Gene SNSL254_A1747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1747 
Symbol 
ID6484831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1717646 
End bp1718872 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content39% 
IMG OID642737127 
Productsecreted effector protein 
Protein accessionYP_002040879 
Protein GI194442907 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG3240] Phospholipase/lecithinase/hemolysin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.343873 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value0.229892 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATTGA GTGTTGGACA GGGTTATTTC ACATCATCTA TCAGTTCTGA AAAATTTAAT 
GCGATAAAAG AAAGCGTACG CCTTCCGGAA TTAAGTTTAT GGGAGAAAAT CAAAGCATAT
TTCTTTACCA CCCACCATGC AGAGGCGCTC GAATGTATCT TTAATCTTTA CCACCATCAG
GAACTGAATC TAACACCGGT ACAGGTTCGC GGAGCCTACA TCAAACTTCG AGCCTTAGCG
TCTCAGGGAT GTAAAGAACA GTTTATTATA GAATCACAGG AACACGCCGA TAAGTTGATT
ATTAAAGATG ATAATGGTGA AAATATTTTA TCTATTGAGG TTGAATGTCA TCCGGAAGCT
TTTGGTCTTG CCAAAGAAAT CAATAAATCA CATCCCAAGC CCAAAAATAT TTCTTTGGGT
GATATTACCA GACTGGTATT TTTTGGCGAC AGCTTGTCTG ACTCCTTAGG GCGTATGTTT
GAAAAAACAC ATCATATCTT ACCCTCCTAT GGTCAATACT TTGGCGGAAG GTTTACTAAT
GGATTTACCT GGACTGAGTT TTTATCATCT CCACACTTCT TAGGTAAAGA GATGCTTAAT
TTTGCTGAAG GGGGAAGTAC ATCGGCAAGC TATTCCTGCT TTAATTGCAT CGGTGACTTT
GTATCAAATA CGGACAGACA AGTCGCATCT TACACCCCTT CTCACCAGGA CCTGGCGATA
TTTTTATTGG GGGCTAATGA CTATATGACA CTACACAAAG ATAATGTAAT AATGGTCGTT
GAGCAACAAA TTGATGATAT TGAAAAAATA ATTTCCGGTG GAGTTAATAA TGTTCTGGTC
ATGGGGATTC CCGATTTGTC TTTAACACCC TATGGCAAGC ATTCTGATGA AAAAAGAAAA
CTTAAGGATG AAAGCATCGC TCACAATGCC CTGCTAAAAA CTAATGTTGA AGAATTAAAA
GAAAAATACC CCCAGCATAA AATATGCTAT TACGAGACTG CCGATGCATT TAAGGTGATA
ATGGAGGCGG CCAGTAATAT TGGTTATGAT ACGGAAAACC CTTATACTCA CCACGGCTAT
GTACATGTTC CCGGGGCTAA AGACCCTCAG CTAGATATAT GTCCGCAATA CGTCTTCAAC
GACCTTGTCC ATCCAACCCA GGAAGTCCAT CATTGTTTTG CCATAATGTT AGAAAGTTTT
ATAGCTCATC ATTATTCCAC TGAATAA
 
Protein sequence
MPLSVGQGYF TSSISSEKFN AIKESVRLPE LSLWEKIKAY FFTTHHAEAL ECIFNLYHHQ 
ELNLTPVQVR GAYIKLRALA SQGCKEQFII ESQEHADKLI IKDDNGENIL SIEVECHPEA
FGLAKEINKS HPKPKNISLG DITRLVFFGD SLSDSLGRMF EKTHHILPSY GQYFGGRFTN
GFTWTEFLSS PHFLGKEMLN FAEGGSTSAS YSCFNCIGDF VSNTDRQVAS YTPSHQDLAI
FLLGANDYMT LHKDNVIMVV EQQIDDIEKI ISGGVNNVLV MGIPDLSLTP YGKHSDEKRK
LKDESIAHNA LLKTNVEELK EKYPQHKICY YETADAFKVI MEAASNIGYD TENPYTHHGY
VHVPGAKDPQ LDICPQYVFN DLVHPTQEVH HCFAIMLESF IAHHYSTE