Gene SNSL254_A2018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2018 
Symbol 
ID6482576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1964514 
End bp1965428 
Gene Length915 bp 
Protein Length304 aa 
Translation table11 
GC content41% 
IMG OID642737377 
Producthypothetical protein 
Protein accessionYP_002041127 
Protein GI194442542 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.00230313 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCAAGG TGTCGATATC TATATTATTC ATGCTAGTTT CACTGACTTG GGGGACTACA 
TGGTTGGCCA TGCGAATTGC CGTTGAAACT ATCCCTCCAG TGTTTGCAAC CGGAATGAGG
TTTATGTTTG CAGCACCTTT TTTAATCATC ATTGCATGGT TAAGAAAAAA AACACTGTTG
TTTCCTCCCG GACAACGATT ATTCCAGTTT GTGATATGCA TCTTTTATTT TTGCATTCCT
TTCTCACTAA TGATTTATGG TGAAACCTAT GTCAACTCTG GGCTTGCTGC CATTATCTTT
GCGAATATGC CTGTGGCCGT TTTGATAGCA TCGGTTTTGT TTCTAAATGA AAAAGCGAAA
CTAATGCAGA TCGCGGGCTT AACTATCGCA ATCACTGCAT TGACGGGGAT ACTTCTTGAA
GAAACGAATA CAAGTACAGA GAGTCACTGG CAGGGTATCA CTGCGCTTAT TTCTGCTGTG
TTAATCCATG CCATAATATA TACACAATGT AAGAAAAGAA GTTGTACTGT CTCTGTTATC
ACATTTAATG CGCTCCCGTG CCTTTTAGCT GGGTTGATAC TTTCTGCGAC AGGATGGTTT
TTTGAAAGAC CACAAGTATC AACCTTCTCA GTACACTCAA TATTAGCTAC CCTGTATCTC
GGGGCTTTTG CCGGAGTTTT TGGTATCCTG TGCTACTTTG CGCTTCAGCA AAAGGCTAAT
GCCTTCCAGG CTTCGCTTGT ATTTCTTATC TTTCCGCTGA TTGCGGTAAG TCTGGAAGAC
TATATTTATG GATATGCTAT TTCAACACAC TCAATGCTGC TTATTATACC ATTAGTTATC
GGGATATTTC TTACTCTTGT CGCCAGAAAT ATTCCTGTAA CCAGCAGATG CCGGGATAAC
TCATCACAGA AATAA
 
Protein sequence
MRKVSISILF MLVSLTWGTT WLAMRIAVET IPPVFATGMR FMFAAPFLII IAWLRKKTLL 
FPPGQRLFQF VICIFYFCIP FSLMIYGETY VNSGLAAIIF ANMPVAVLIA SVLFLNEKAK
LMQIAGLTIA ITALTGILLE ETNTSTESHW QGITALISAV LIHAIIYTQC KKRSCTVSVI
TFNALPCLLA GLILSATGWF FERPQVSTFS VHSILATLYL GAFAGVFGIL CYFALQQKAN
AFQASLVFLI FPLIAVSLED YIYGYAISTH SMLLIIPLVI GIFLTLVARN IPVTSRCRDN
SSQK