Gene SeD_A0470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0470 
SymbolphnS 
ID6871560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp485462 
End bp486475 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content57% 
IMG OID642783696 
Product2-aminoethylphosphonate ABC transporter 2-aminoethylphosphonate binding protein 
Protein accessionYP_002214383 
Protein GI198241864 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID[TIGR03227] 2-aminoethylphosphonate ABC transporter, periplasmic 2-aminoethylphosphonate binding protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.301651 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones95 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTTT CCCGACTTGC TCTGCTGTCT GTCTTCGCTC TCGCCAGCGC CCCGTCATGG 
GCGGAATCGG TGGTCACGGT GTACTCCATC GACGGGCTGC ACGATGGCGA TAACAGCTGG
TACCAGGTGC AGTTTGACGC ATTCACCAAA GCGACCGGCA TTACCGTACG CTATGTTGAA
GGCGGTGGTG GCGTGGTAGT GGAACGTCTG GCAAAAGAGC GCACGAATCC ACAGGCCGAC
GTGCTGGTAA CCGCGCCGCC ATTCATTCAG CGCGCCGCCG CCAAAAAGCT GCTGGCGAAC
TTTAACACCG ACGCCGCATC GGCTATCCCC GATGCCAACA ACCTTTATTC GCCGCTGGTA
AAGAACTATC TGAGCTTTAT CTACAACAGC AAGCTGCTGA AAACTGCCCC GGCGAGCTGG
CAGGATCTGC TTGACGGTAA ATTCAAAAAT AAACTCCAGT ATTCCACGCC AGGTCAGGCC
GCTGACGGCA CGGCGGTGAT GCTGCAGGCT TTCCACAGCT TCGGCAGTAA AGATGCCGGT
TTTGCGTATC TCGGCAAGCT GCAGGCCAAT AACGTCGGGC CATCTGCCTC TACCGGCAAG
CTAACCGCGC TGGTTAATAA AGGTGAAATC TACGTCGCTA ACGGCGACCT GCAAATGAAC
CTCGCGCAGA TGGAACGTAA CCCGAACGTG AAAATCTTCT GGCCGGCCAA CGACAAAGGC
GAGCGCAGCG CGCTGGCCAT CCCTTATGTC ATTGGCCTGG TCCAGGGGGC GCCGCAGAGT
GAAAATGGTA AAAAGCTGAT TAACTTCCTG CTGAGTAAAG AAGCGCAGAC TCGCGTCAGC
GAACTCTCCT GGGGAATGCC AGTACGCAGC GACGTGACGC CGAGCGACGA ACATTACAAG
GCCGCCACTG CCGCGTTAGA AGGCGTGCAG AGCTGGCAGC CAAATTGGGA TGACGTAGCC
GTTTCGCTGT CGGCAGATAT TAGCCGTTGG CACAAAGTGA CCGAAAGCGA GTAA
 
Protein sequence
MKLSRLALLS VFALASAPSW AESVVTVYSI DGLHDGDNSW YQVQFDAFTK ATGITVRYVE 
GGGGVVVERL AKERTNPQAD VLVTAPPFIQ RAAAKKLLAN FNTDAASAIP DANNLYSPLV
KNYLSFIYNS KLLKTAPASW QDLLDGKFKN KLQYSTPGQA ADGTAVMLQA FHSFGSKDAG
FAYLGKLQAN NVGPSASTGK LTALVNKGEI YVANGDLQMN LAQMERNPNV KIFWPANDKG
ERSALAIPYV IGLVQGAPQS ENGKKLINFL LSKEAQTRVS ELSWGMPVRS DVTPSDEHYK
AATAALEGVQ SWQPNWDDVA VSLSADISRW HKVTESE