Gene SNSL254_A2300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2300 
Symbol 
ID6485556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2212921 
End bp2213967 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content55% 
IMG OID642737646 
Productpolysaccharide biosynthesis/export protein 
Protein accessionYP_002041388 
Protein GI194446455 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.631767 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value0.480899 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGAAAG ATGTGATCAA ACAGCAAGAC GCTGACTTTG ATCTCGACCG GATGGTCAAT 
GTGTATCCGC TGACGCCACG GTTGGTTGAG CAATTACGCC CGCGGCCCAA TGTCGCGCAA
CCGAATATGT CGCTGGACCA GGAGATCGCC AGCTATCAGT ATCGCGTCGG GCCTGGCGAT
GTGCTGAATG TCACCGTCTG GGATCACCCG GAATTGACCA CGCCAGCAGG CCAGTACCGT
AGCTCAAGCG ATACCGGCAA CTGGGTACAG CCGGACGGCA CCATGTTTTA TCCCTACATT
GGCAAGGTTA GCGTCGTCGG TAAAACTTTG TCAGAGATTC GTAGCGATAT TACCGGGCGT
TTAGCGAAGT ACATTGCGGA CCCGCAGGTG GATGTCAATA TCGCCGCTTT CCGCTCGCAA
AAAGCGTACA TCTCCGGCCA GGTGAATAAA TCCGGTCAGC AGGCTATTAC TAACGTACCG
CTAACCGTCC TGGATGCGAT TAACGCTGCG GGCGGCCTGA CCGATATGGC GGACTGGCGC
AACGTCGTGT TGACGCACAA CGGCAAAGAA CAGCGCATTT CGCTACAGGC GCTGATGCAA
AATGGCGATC TTAGCCAGAA CCGCTTGCTC TACCCTGGCG ACATTCTGTA TGTGCCGCGC
AATGACGATC TGAAAGTCTT TGTCATGGGC GAAGTGAAAA AACAGAGCAC CCTCAAAATG
GATTTCAGCG GCATGACGCT CACCGAAGCA TTAGGCAATG CGGAAGGCAT TGATCTGACC
ACCTCCAACG CCAGCGGCAT TTTTGTGATT CGTCCGTTGA AAGGCGAGGG GGGACGCGGC
GGCAAGATCG CCAATATCTA CCAGCTTGAT ATGTCTGACG CCACGTCATT GGTGATGGCG
ACGGAATTCC GACTTCAGCC TTACGATGTG GTGTACGTCA CGACCGCGCC GGTTGCTCGC
TGGAACCGTC TGATCAATCA GTTGCTGCCA ACCATTAGCG GTGTCCGTTA TATGACGGAT
ACGGCCAGCG ACATTCATTC CTGGTAA
 
Protein sequence
MGKDVIKQQD ADFDLDRMVN VYPLTPRLVE QLRPRPNVAQ PNMSLDQEIA SYQYRVGPGD 
VLNVTVWDHP ELTTPAGQYR SSSDTGNWVQ PDGTMFYPYI GKVSVVGKTL SEIRSDITGR
LAKYIADPQV DVNIAAFRSQ KAYISGQVNK SGQQAITNVP LTVLDAINAA GGLTDMADWR
NVVLTHNGKE QRISLQALMQ NGDLSQNRLL YPGDILYVPR NDDLKVFVMG EVKKQSTLKM
DFSGMTLTEA LGNAEGIDLT TSNASGIFVI RPLKGEGGRG GKIANIYQLD MSDATSLVMA
TEFRLQPYDV VYVTTAPVAR WNRLINQLLP TISGVRYMTD TASDIHSW