Gene SNSL254_A0476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0476 
SymbolphnS 
ID6482449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp486545 
End bp487558 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content57% 
IMG OID642735897 
Product2-aminoethylphosphonate ABC transporter 2-aminoethylphosphonate binding protein 
Protein accessionYP_002039671 
Protein GI194443954 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID[TIGR03227] 2-aminoethylphosphonate ABC transporter, periplasmic 2-aminoethylphosphonate binding protein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value0.547307 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTTT CCCGACTTGC TCTGCTGTCT GTCTTCGCTC TCGCCAGCGC CCCGTCATGG 
GCGGAATCGG TGGTCACGGT GTACTCCATC GACGGGCTGC ACGATGGCGA TAACAGCTGG
TACCAGGTGC AGTTTGACGC GTTCACCAAA GCGACCGGCA TTACCGTACG CTATGTTGAA
GGCGGTGGTG GCGTGGTAGT GGAACGTCTG GCAAAAGAGC ATACGAATCC GCAGGCCGAC
GTGCTGGTAA CCGCGCCGCC ATTCATTCAG CGCGCCGCCG CCGAAAAGCT GCTGGCGAAC
TTTAACACCG ACGCCGCATC GGCTATCCCC GATGCCAACA ACCTTTATTC GCCGCTGGTA
AAGAACTATC TGAGCTTTAT CTACAACAGC AAGCTGCTGA AAACTGCCCC GGCGAGCTGG
CAGGATCTGC TTGACGGTAA CTTCAAAAAT AAACTCCAGT ATTCCACGCC AGGTCAGGCC
GCTGACGGCA CGGCGGTGAT GCTGCAGGCT TTCCACAGCT TCGGCAGTAA AGATGCCGGT
TTTGCGTATC TCGGCAAGCT GCAGGCCAAT AACGTCGGGC CATCTGCCTC TACCGGCAAG
CTAACCGCGC TGGTTAATAA AGGTGAAATC TACGTCGCTA ACGGCGACCT GCAAATGAAC
CTCGCGCAGA TGGAACGTAA CCCGAACGTG AAAATCTTCT GGCCGGCCAA CGACAAAGGC
GAGCGCAGCG CGCTGGCCAT CCCTTATGTC ATTGGCCTGG TCCAGGGGGC GCCGCAGAGT
GAAAATGGTA AAAAGCTGAT TAACTTCCTG CTGAGTAAAG AAGCGCAGAC TCGCGTCAGC
GAACTCTCCT GGGGAATGCC GGTACGCAGC GACGTGACGC CGAGCGACGA ACATTACAAG
ACCGCCACTG CCGCGTTAGA AGGCGTGCAG AGCTGGCAGC CAAATTGGGA TGACGTAGCC
GTTTCGCTGT CGGCAGATAT TAGCCGTTGG CACAAAGTGA CCGAAAGCGA GTAA
 
Protein sequence
MKLSRLALLS VFALASAPSW AESVVTVYSI DGLHDGDNSW YQVQFDAFTK ATGITVRYVE 
GGGGVVVERL AKEHTNPQAD VLVTAPPFIQ RAAAEKLLAN FNTDAASAIP DANNLYSPLV
KNYLSFIYNS KLLKTAPASW QDLLDGNFKN KLQYSTPGQA ADGTAVMLQA FHSFGSKDAG
FAYLGKLQAN NVGPSASTGK LTALVNKGEI YVANGDLQMN LAQMERNPNV KIFWPANDKG
ERSALAIPYV IGLVQGAPQS ENGKKLINFL LSKEAQTRVS ELSWGMPVRS DVTPSDEHYK
TATAALEGVQ SWQPNWDDVA VSLSADISRW HKVTESE