Gene SNSL254_A4689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4689 
Symbol 
ID6482285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4572666 
End bp4573928 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content51% 
IMG OID642739907 
Productinner membrane protein YjeH 
Protein accessionYP_002043588 
Protein GI194442613 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCGG ATAATGAGCT GATGAATGAA CTAAAAAAAG AGCTTGGGTT GGTTCAGGGC 
GTTATCTTGT TAACCACATC ATTATTAGGG ACTGGCGTTT TTGCGCTTCC GGAGCTGGCT
GCGCTGGCGG CAGGCGACAT CAGCCTTTGG GCATGGCCGC TGCTCATTAT TTTGATTTTC
CCTATTGCGA TTGTCTTTGC TGTGCTGGGG CGTCACTTTC CACACGCTGG CGGCGTAGCG
CATTTTGTCG GGATGGCATT TGGCCCGCGT CTGCAACGAG TGATCAGTTG GCTGTTTTTA
TCAGTCATTC CTGTTAGTTT TCCCGCAGCC TTGCATATTG CGGTGGGTTT CGGCCAGGCG
CTGTTTGGCT GGCAAAGCGA ACAGCTCTTA TTTGGGGAGC TGGGTACTCT AGGTTTACTA
TGGTTTATGG GATCGCGCGG CGCCAGCTCC AGCGCCAACC TGCAAGCCAT TATCGCTGGA
TTAATTATCG CGCTTATTGC CGCCATTCTG TGGAAAGGCG CGATTAAACC TGCAGATATC
ACCTTCCCAG CCGCAAACGA AATCACCTTT TCCCGGCTGT GTACCGCCCT GGCGATCATG
TTCTGGGGAT TTGTGGGAAT TGAGGCTTTT ACGCATTTGT CGTCTGAATT TAAAAATCCT
GAACGTGATT TTCCGCGCGC ATTGATCATT GGCCTGATGC TGGCCGGCTC CATTTATTGG
ACCTGTACCG CCGTGGTGCT GCATTTTGGC GTCTATAGCG ACAAGATAGC GGCAACAGCA
TCGCTACCGC TTATTATTGT TCATCTCTTC GGTATCCAGG CGTTGTGGAT AGCCTGCATT
ATTGGTTATC TCACCTGCTT TGCCAGCCTG AATGTGTATG CTCAGAGTTT TGCGCGTCTG
ATATGGACGC AAATGCAATA TCAGCCCGAT CACTATCTGG CTCAACTCTC TCCCGGGCGC
CTTCCCTTGC ACGCGTTAAA CGTTATTCTG GCCTGTTGTT GCGTGAGTTC CCTGGTCGTC
TACGCCCTGA AGATTAACCT CAATGCGCTG ATCGTCTATG CTAACGGTAT TTTTATTATG
CTCTATCTGC TTTGTATGCT GGCGGGCTGT CGACTATTGA AAGGGCGCTG CTATGCACTG
GCGGTGACGG GTTGTCTACT GTGCCTGTTA TTGCTGGTAA TGCTGGGGTG GAAAAGCTTA
TATGCCATTA TCATGCTGGC AGCATTATGG CTGTTTTTAC CGAAGCGAAA ACGCATGGCG
TAA
 
Protein sequence
MPPDNELMNE LKKELGLVQG VILLTTSLLG TGVFALPELA ALAAGDISLW AWPLLIILIF 
PIAIVFAVLG RHFPHAGGVA HFVGMAFGPR LQRVISWLFL SVIPVSFPAA LHIAVGFGQA
LFGWQSEQLL FGELGTLGLL WFMGSRGASS SANLQAIIAG LIIALIAAIL WKGAIKPADI
TFPAANEITF SRLCTALAIM FWGFVGIEAF THLSSEFKNP ERDFPRALII GLMLAGSIYW
TCTAVVLHFG VYSDKIAATA SLPLIIVHLF GIQALWIACI IGYLTCFASL NVYAQSFARL
IWTQMQYQPD HYLAQLSPGR LPLHALNVIL ACCCVSSLVV YALKINLNAL IVYANGIFIM
LYLLCMLAGC RLLKGRCYAL AVTGCLLCLL LLVMLGWKSL YAIIMLAALW LFLPKRKRMA