Gene SNSL254_A3950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3950 
Symbol 
ID6484608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3831034 
End bp3832311 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content52% 
IMG OID642739210 
Product2,3-diketo-l-gulonate trap transporter large permease protein yian 
Protein accessionYP_002042920 
Protein GI194442924 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1593] TRAP-type C4-dicarboxylate transport system, large permease component 
TIGRFAM ID[TIGR00786] TRAP transporter, DctM subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value0.788142 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTGG TGATATTTCT CTGCTGCCTG CTCGGCGGGA TCGCGATAGG TTTACCCATC 
GCCTGGTCGC TGCTGCTTTG CGGCGCTGCT CTGATGACAT ACCTGGATAT GTTTGACGTG
CAGATTATGG CGCAAACCCT GGTTAACGGC GCGGACAGTT TCTCCCTACT GGCCATTCCC
TTTTTTGTTT TGGCCGGTGA AATCATGAAC GCGGGCGGCC TGTCAAAGCG AATTGTCGAC
CTGCCGATGA AGCTGGTCGG CCATAAGCCC GGCGGCCTGG GCTACGTGGG CGTTATTGCG
GCAATGATTA TGGCCAGCCT TTCCGGCTCT GCGGTAGCAG ATACCGCTGC GGTCGCCGCG
CTGCTGGTGC CGATGATGCG CTCCGCAAAC TACCCGATCA ACCGCTCCGT TGGGTTAATC
GCTTCCGGCG GGATCATTGC GCCAATTATT CCCCCCTCGA TTCCTTTTAT TATCTTCGGC
GTTTCCAGCG GCTTGTCGAT CAGCAAGCTG TTTATGGCCG GGATCGCACC GGGCATCATG
ATGGGCGCGG CGCTTATGCT CACCTGGTGG TGGCAGGCCG GGCGATTAAA TCTCCCTTCT
CAGCCTAAAG CAACACCGCG TGAAATCTGG CAATCATTGG TTTCAGGTAT CTGGGCGCTG
TTTTTACCGG TGATTATTAT CGGCGGCTTC CGTTCCGGAC TTTTCACGCC AACGGAGGCA
GGGGCGGTTG CCGCGTTTTA CGCCCTCTTT GTCGCCGTGG TTATCTATCG GGAATTAACG
TTTTCCAGTC TCTACCACGT GCTGGTCAAT GCCGCCAAAA CGACGTCAGT CGTCATGTTT
CTGGTGGCCG CGGCTCAGGT ATCCGCCTGG CTGATTACGA TCGCGGAATT ACCCATGATG
GTGTCAGATT TGCTGCAGCC GCTGGTCGAC TCTCCGCGAC TCTTATTTAT CGTCATTATG
ATCTCAATTA TGGTCGTCGG TATGGTGATG GACTTAACGC CAACGGTGTT AATTCTTACC
CCTGTATTAT TGCCATTAGT TAAAGAAGCC AATATTGACC CGATTTATTT CGGCGTCATG
TTCATTATTA ACTGCTCTAT TGGATTAATC ACACCGCCCG TTGGCAACGT CCTCAACGTT
ATTTCTGGGG TAGCAAAATT GAAATTTGAT GACGCGGTAA GAGGCGTATT CCCTTACGTT
GTCGTACTGA TGTCGCTGCT GGTTTTATTT ATTTTTATTC CCGAGCTAAT TATCACACCG
CTTAAATGGA TTAATTAA
 
Protein sequence
MAVVIFLCCL LGGIAIGLPI AWSLLLCGAA LMTYLDMFDV QIMAQTLVNG ADSFSLLAIP 
FFVLAGEIMN AGGLSKRIVD LPMKLVGHKP GGLGYVGVIA AMIMASLSGS AVADTAAVAA
LLVPMMRSAN YPINRSVGLI ASGGIIAPII PPSIPFIIFG VSSGLSISKL FMAGIAPGIM
MGAALMLTWW WQAGRLNLPS QPKATPREIW QSLVSGIWAL FLPVIIIGGF RSGLFTPTEA
GAVAAFYALF VAVVIYRELT FSSLYHVLVN AAKTTSVVMF LVAAAQVSAW LITIAELPMM
VSDLLQPLVD SPRLLFIVIM ISIMVVGMVM DLTPTVLILT PVLLPLVKEA NIDPIYFGVM
FIINCSIGLI TPPVGNVLNV ISGVAKLKFD DAVRGVFPYV VVLMSLLVLF IFIPELIITP
LKWIN