Gene SNSL254_A3066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3066 
Symbol 
ID6485699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2982872 
End bp2983789 
Gene Length918 bp 
Protein Length305 aa 
Translation table11 
GC content54% 
IMG OID642738381 
Productperiplasmic chelated iron-binding protein YfeA 
Protein accessionYP_002042105 
Protein GI194446797 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.354258 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.0719754 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAATC TACATCGTCT GAAAACACTC CTGATTGCCG GTATTGTCGC GATACTCGCG 
CTATCGCCAG CCTATGCAAA AGAGAAATTT AAAGTCATCA CCACGTTTAC CGTTATTGCC
GACATGGCGA AAAACGTGGC GGGCGACGCA GCGGAAGTCA GCTCAATTAC CAAGCCCGGC
GCTGAAATCC ATGAGTATCA GCCAACGCCC GGCGATATTA AACGAGCGCA GGGGGCACAG
CTTATCCTCG CGAATGGTCT GAACCTGGAG CGATGGTTCG CCCGCTTTTA TCAGCACCTT
TCCGGCGTGC CGGAAGTCGT CGTCTCCACC GGTGTCAAAC CGATGGGCAT TACCGAAGGC
CCGTATAACG GTAAACCGAA CCCGCACGCC TGGATGTCGG CAGAAAACGC GCTGATTTAT
GTCGATAACA TTCGCGACGC CCTGGTGAAG TACGATCCGG ATAATGCGCA GATCTATAAG
CAAAACGCCG AACGCTATAA AGCGAAAATT CGCCAGATGG CCGATCCGTT GCGTGCCGAA
CTGGAAAAAA TTCCCGCCGA TCAGCGCTGG CTGGTCACCA GTGAAGGCGC GTTCTCTTAC
CTGGCGCGCG ATAACGACAT GAAAGAGCTT TATCTCTGGC CAATTAACGC CGATCAACAG
GGGACGCCAA AACAGGTGCG TAAAGTGATT GATACCATTA AAAAGCACCA TATTCCCGCC
ATCTTTAGCG AGAGTACGGT TTCCGATAAA CCGGCCCGTC AGGTCGCGCG TGAATCCGGC
GCGCATTATG GCGGCGTACT GTATGTCGAT TCTCTGAGCG CCGCTGACGG CCCTGTGCCA
ACCTATCTGG ATCTGCTGCG CGTCACGACC GAAACCATCG TCAACGGCAT TAACGACGGA
CTGAGGAGTC AACAATGA
 
Protein sequence
MTNLHRLKTL LIAGIVAILA LSPAYAKEKF KVITTFTVIA DMAKNVAGDA AEVSSITKPG 
AEIHEYQPTP GDIKRAQGAQ LILANGLNLE RWFARFYQHL SGVPEVVVST GVKPMGITEG
PYNGKPNPHA WMSAENALIY VDNIRDALVK YDPDNAQIYK QNAERYKAKI RQMADPLRAE
LEKIPADQRW LVTSEGAFSY LARDNDMKEL YLWPINADQQ GTPKQVRKVI DTIKKHHIPA
IFSESTVSDK PARQVARESG AHYGGVLYVD SLSAADGPVP TYLDLLRVTT ETIVNGINDG
LRSQQ