Gene SNSL254_A4333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4333 
Symbol 
ID6484788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4221215 
End bp4222249 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content53% 
IMG OID642739577 
Productrhamnose-proton symporter 
Protein accessionYP_002043271 
Protein GI194446545 
COG category 
COG ID 
TIGRFAM ID[TIGR00776] RhaT L-rhamnose-proton symporter family protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.0224713 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAACG CGATTACGAT GGGTATTTTC TGGCATTTGA TAGGGGCGGC CAGTGCAGCC 
TGCTTCTATG CCCCGTTCAA GCAAGTGAAA CAGTGGTCAT GGGAAACCAT GTGGTCAGTG
GGCGGCATCG TCTCATGGCT TATTCTGCCG TGGACAATTA GCGCTCTGTT ACTGCCTGAT
TTCTGGGCCT ATTATGGGCA GTTTAACCTC TCCACCCTTT TACCGGTTTT TCTGTTCGGC
GCCATGTGGG GCATCGGCAA TATTAATTAC GGTCTAACCA TGCGTTATCT CGGGATGTCG
ATGGGTATCG GCATCGCTAT CGGCATTACG CTTATCGTCG GCACGCTGAT GACGCCTATC
ATCAACGGTA ACTTCGATGT GTTAATCCAT ACCGAAGGGG GACGCATGAC GCTACTTGGC
GTTTTTGTCG CGCTGATCGG CGTCGGGATT GTGACGCGCG CCGGACAGTT AAAAGAACGC
AAAATGGGCA TTAAAGCGGA GGAGTTCAAT CTGAAGAAAG GGCTTCTGCT GGCAGTGATG
TGCGGTATTT TCTCGGCGGG GATGTCTTTT GCCATGAACG CCGCGAAACC GATGCATGAA
GCTGCTGCCG CGCTTGGCGT TGACCCGCTC TATGTCGCGC TGCCGAGTTA CGTGGTGATT
ATGGGCGGCG GCGCGCTGGT GAACCTCGGT TTCTGTTTTA TCCGCCTGGC AAAAGTGCAA
AATCTGTCGA TAAAAGCCGA CTTCTCGCTG GCAAGACCGT TGATTATCAG CAATATTCTG
TTGTCCGCGC TTGGCGGTCT GATGTGGTAT TTACAGTTCT TTTTCTATGC CTGGGGTCAC
GCGCGCATTC CCGCGCAATA TGACTACATG AGCTGGATGC TGCACATGAG CTTCTATGTG
CTGTGCGGGG GGCTTGTCGG TCTGGTGCTA AAAGAGTGGA AAAATGCTGG CCGCCGTCCC
GTTGCCGTAT TAAGCCTCGG CTGCGTGGTA ATTATTATCG CGGCGAATAT TGTCGGTTTA
GGCATGGCCA GTTAA
 
Protein sequence
MSNAITMGIF WHLIGAASAA CFYAPFKQVK QWSWETMWSV GGIVSWLILP WTISALLLPD 
FWAYYGQFNL STLLPVFLFG AMWGIGNINY GLTMRYLGMS MGIGIAIGIT LIVGTLMTPI
INGNFDVLIH TEGGRMTLLG VFVALIGVGI VTRAGQLKER KMGIKAEEFN LKKGLLLAVM
CGIFSAGMSF AMNAAKPMHE AAAALGVDPL YVALPSYVVI MGGGALVNLG FCFIRLAKVQ
NLSIKADFSL ARPLIISNIL LSALGGLMWY LQFFFYAWGH ARIPAQYDYM SWMLHMSFYV
LCGGLVGLVL KEWKNAGRRP VAVLSLGCVV IIIAANIVGL GMAS