Gene SNSL254_A1481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1481 
Symbol 
ID6486945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1449957 
End bp1451303 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content48% 
IMG OID642736869 
Productdicarboxylate/amino acid:cation (Na+ or H+) symporter family transporter 
Protein accessionYP_002040623 
Protein GI194445853 
COG category[R] General function prediction only 
COG ID[COG1823] Predicted Na+/dicarboxylate symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.175987 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.00000339976 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTTGTTA CGCTCGCGTA TATCGCACTT TTTTTGGTTT TCTCGTGGGT CATCTTGAGA 
ATTAATCAAA AAAGCGATTC TCTGTCGAAA AGCGTTTTTA TCGCTATCTT TTTAGGGGCT
GTTATTGGTT TATCCCTACA TTTTATTTCA GCAAATCACA CTAAAACTAT TATCGAATGG
TACAGCATCG TCGGCAATGG TTACGTCCAC CTGCTAAAAC TGGTCGCTAT ACCGCTAATT
TTTATTTCTA TTCTTTCCGC CATCAATAAA CTGGAAAATA GTGCCGGCAT CGGAAAAATG
TCGCTGACGA TCGTCGGATG CATGTTATGC CTGGTGATGG CTGCTGGTTT TATCGGATTA
CTGACCGCCC ATGTACTGGG GCTTGACGCC AGCGCTTTTG TGCACATGCC GTCAATGTTA
ACTACAGAAG AAGTCAATAA GACTGCCGCG GTGTCGATTC CCCAGTTAGT GACATCGCTG
ATTCCGACTA ATATTTTTCT TGATCTTACG GGGGCCAGAA GTGTTTCCGT TATCGGCATC
GTTATCTTCA CGCTAATAGC GGGGATCGCC CTGTTAAAGG TCAAAAAAGA GGCGCCGGAA
GAAGGTCAGA AATTAAGCGC AGGCATTAAC GCTATTCAGA TATGGGTCAT GAAGATGGTA
CGTATCGTTA TTGCATTAAC GCCCTATGGC GTCATGGCAT TAATGACTAC CGTATTTTCA
TCATACCACT TTGAACAATT CGCCAGCTTA CTTGGTTTTA TCGGTGCCTG TTACATCGCG
ATTTTTATGA TGTTTATCGT GCATGCCATC TTGCTGATCC TCAGCGGTAA TAATCCAGCG
CGTTATTTCA GAATGGTGTG GCCCGTCTTA ACCTTTGCGT TTGTTTCCCG CAGCAGCGCA
GCCTCTATCC CGCTGGCCAT TTCCGCCCAG GAAAAATTTG GCGTACAAAG CACCATTGCC
AATATTTCCG CCTCGTTTGG CTCCAGTATG GGGCAAAATG GCTGCGCCGG GATTTATCCG
GCTATTATGG TAGCGATGAT TGCGCCCACC ATCGGCATTG ATCCGCTCTC GCTGCATTTT
CTGGCCGCGA TGTTGCCTGC CATTGCGCTA GGGTCTATTG GCGTAGCCGG CGTCGGCGGC
GGTGGTACGT TCGCGGCGCT GATTGTCCTG TCGACGTTGA ATTTTCCCGT TGCGCTGGTC
GGTATTTTTA TTGCCATCGA ACCTATCGTT GATATGGCCC GCACGGCACT GAACGTTAAC
GGATCGATGA TGTCAGGTGT GCTGGCTAAC CGTATTTTGA ATAATCATAC GGCTGACGAC
ATGCCAGCGG TTATTGACAG ACCTTAG
 
Protein sequence
MVVTLAYIAL FLVFSWVILR INQKSDSLSK SVFIAIFLGA VIGLSLHFIS ANHTKTIIEW 
YSIVGNGYVH LLKLVAIPLI FISILSAINK LENSAGIGKM SLTIVGCMLC LVMAAGFIGL
LTAHVLGLDA SAFVHMPSML TTEEVNKTAA VSIPQLVTSL IPTNIFLDLT GARSVSVIGI
VIFTLIAGIA LLKVKKEAPE EGQKLSAGIN AIQIWVMKMV RIVIALTPYG VMALMTTVFS
SYHFEQFASL LGFIGACYIA IFMMFIVHAI LLILSGNNPA RYFRMVWPVL TFAFVSRSSA
ASIPLAISAQ EKFGVQSTIA NISASFGSSM GQNGCAGIYP AIMVAMIAPT IGIDPLSLHF
LAAMLPAIAL GSIGVAGVGG GGTFAALIVL STLNFPVALV GIFIAIEPIV DMARTALNVN
GSMMSGVLAN RILNNHTADD MPAVIDRP