Gene SNSL254_A4030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4030 
Symbol 
ID6482870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3920056 
End bp3921438 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content55% 
IMG OID642739289 
Productputative transporter 
Protein accessionYP_002042999 
Protein GI194444395 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID[TIGR00792] sugar (Glycoside-Pentoside-Hexuronide) transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGTG AAATTCTCTC CGTTAAGGAG AAGATTGGCT ATGGTATGGG CGATGCCGCC 
AGCCACATCA TCTTTGATAA TGTCATGTTA TATATGATGT TCTTCTATAC CGATATTTTC
GGTATTCCCG CTGGTTTTGT CGGCACCATG TTTTTACTGG CGCGTGCGCT TGATGCCATC
TCCGACCCTT GCATGGGCCT GCTGGCCGAC CGCACCCGCT CTCGCTGGGG CAAATTCCGG
CCCTGGGTGC TGTTTGGCGC GTTGCCGTTT GGTATCGTTT GTGTGCTGGC TTATAGCACG
CCGGACCTCA GTCTGAACGG CAAAATGATT TATGCCGCCA TCACCTACAC GTTGCTCACC
CTTCTGTACA CTGTGGTCAA CATCCCTTAC TGCGCGTTGG GGGGTGTAAT AACCAATGAC
CCAACGCAGC GTATCTCCCT GCAATCCTGG CGCTTTGTGC TGGCAACGGC GGGCGGAATG
CTCTCTACCG TACTGATGAT GCCTCTGGTG AAACTGATTG GCGGCGAGAA TAAGGCGCTG
GGCTTCCAGG GGGGTATCGC GGCGCTCTCG GTGGTGGCGT TCCTGATGCT GGCGTTCTGC
TTCTTTACCA CCAAAGAGCG CGTTGAAGCG CCTGCCACCC ATACCTCCAT GCGTGAAGAC
CTGCGTGATA TCTGGCACAA CGACCAGTGG CGCATAGTCG GCCTGCTCAC CATCCTGAAT
ATTCTGGCGG TATGCGTGCG CGGCGGGGCG ATGATGTATT ACGTCACCTG GATATTGGGC
AAACCGGGCG TGTTTGTCGC CTTCCTCACC ACCTATTGTG TCGGCAACCT GATTGGCTCG
GCGCTGGCAA AACCGTTGAC CGACTGGAAA TGCAAAGTGA GCGTTTTCTG GTGGACCAAC
GCCTTACTCG CAGTAATCAG CGTGGCGATG TTCTTCGTAC CGATGCACGC CACGATCGCT
ATGTTCGTCT TTATCTTCGT GATTGGCGTA TTGCACCAGT TAGTCACGCC TATCCAGTGG
GTGATGATGT CTGACACCGT CGACTACGGC GAATGGTGTA ACGGCAAACG CCTGACGGGG
ATCAGTTTTG CCGGCACGTT GTTCGTGCTG AAACTGGGTC TTGCCCTCGG CGGGGCGCTG
ATTGGCTGGA TGCTGGCAGG CGGCGGTTAC GACGCGGCGG CGAAAACGCA AAACAGCGCC
ACGATCAGCA TCATCATCGC TCTGTTCACT ATCGTTCCGG CCATCTGTTA TCTGCTGAGC
GCCGCGATCG CTAAACGCTA CTACACCCTG AAAAGCCCGT TCCTGAAAAC CATTCTGGAA
CAACTGGCGC AGGGCGCACA CCGCAACGAA CAAGAATTTA CCCATAAAGA ATTGCAGAAC
TAA
 
Protein sequence
MKSEILSVKE KIGYGMGDAA SHIIFDNVML YMMFFYTDIF GIPAGFVGTM FLLARALDAI 
SDPCMGLLAD RTRSRWGKFR PWVLFGALPF GIVCVLAYST PDLSLNGKMI YAAITYTLLT
LLYTVVNIPY CALGGVITND PTQRISLQSW RFVLATAGGM LSTVLMMPLV KLIGGENKAL
GFQGGIAALS VVAFLMLAFC FFTTKERVEA PATHTSMRED LRDIWHNDQW RIVGLLTILN
ILAVCVRGGA MMYYVTWILG KPGVFVAFLT TYCVGNLIGS ALAKPLTDWK CKVSVFWWTN
ALLAVISVAM FFVPMHATIA MFVFIFVIGV LHQLVTPIQW VMMSDTVDYG EWCNGKRLTG
ISFAGTLFVL KLGLALGGAL IGWMLAGGGY DAAAKTQNSA TISIIIALFT IVPAICYLLS
AAIAKRYYTL KSPFLKTILE QLAQGAHRNE QEFTHKELQN