Gene SeD_A2458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2458 
Symbol 
ID6874455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2333134 
End bp2334273 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content54% 
IMG OID642785547 
Productpolysaccharide biosynthesis/export protein 
Protein accessionYP_002216205 
Protein GI198242572 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.0550214 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAAT CCAAAATGAA ATTGATGCCA TTATTGGCGT CGCTCAGCTT GATAAGTGGT 
TGCACAGTAC TTCCGGGCAG CAATATGTCT ACGATGGGGA AAGATGTGAT CAAACAGCAA
GACGCTGACT TTGATCTCGA CCGGATGGTC AATGTGTATC CGCTGACGCC ACGGCTGGTT
GAGCAATTAC GCCCGCGGCC CAATGTCGCG CAACCGAATA TGTCGCTGGA CCAGGAGATC
GCCAGCTATC AGTATCGCGT CGGGCCTGGC GATGTGCTGA ATGTCACCGT CTGGGATCAC
CCGGAATTGA CCACGCCAGC GGGCCAGTAC CGTAGCTCAA GCGATACCGG CAACTGGGTA
CAGCCGGACG GCACCATGTT TTATCCCTAC ATTGGCAAGG TTAGCGTCGT CGGTAAAACT
TTGTCAGAGA TTCGCAGCGA TATTACCGGG CGTTTAGCGA AGTACATCGC GGACCCGCAG
GTGGATGTCA ATATCGCCGC TTTCCGCTCG CAAAAAGCGT ATATCTCCGG CCAGGTGAAT
AAATCCGGTC AGCAGGCCAT TACTAACGTA CCGCTAACCG TCCTGGATGC GATTAACGCC
GCGGGCGGCC TGACCGATAT GGCGGACTGG CGCAACGTCG TGTTGACGCA CAACGGCAAA
GAACAGCGCA TTTCGCTACA GGCGCTGATG CAAAATGGCG ATCTTAGTCA GAACCGCTTG
CTCTACCCTG GCGACATTCT GTATGTGCCG CGCAATGACG ATCTGAAAGT CTTTGTCATG
GGCGAAGTGA AAAAACAGAG CACCCTCAAA ATGGATTTCA GCGGCATGAC GCTCACCGAA
GCATTGGGCA ATGCGGAAGG CATCGATCTG ACCACCTCCA ACGCCAGCGG CATTTTTGTG
ATTCGTCCGT TGAAAGGCGA GGGGGAACGC GGCGGCAAGA TCGCCAATAT CTACCAGCTT
GATATGTCTG ACGCCACGTC ATTGGTGATG GCGACGGAAT TCCGACTTCA GCCTTACGAT
GTGGTGTACG TCACGACCGC GCCGGTTGCT CGCTGGAACC GTCTGATCAA TCAGTTGCTG
CCAACCATTA GCGGTGTCCG TTATATGACG GATACGGCCA GCGACATTCA TTCCTGGTAA
 
Protein sequence
MMKSKMKLMP LLASLSLISG CTVLPGSNMS TMGKDVIKQQ DADFDLDRMV NVYPLTPRLV 
EQLRPRPNVA QPNMSLDQEI ASYQYRVGPG DVLNVTVWDH PELTTPAGQY RSSSDTGNWV
QPDGTMFYPY IGKVSVVGKT LSEIRSDITG RLAKYIADPQ VDVNIAAFRS QKAYISGQVN
KSGQQAITNV PLTVLDAINA AGGLTDMADW RNVVLTHNGK EQRISLQALM QNGDLSQNRL
LYPGDILYVP RNDDLKVFVM GEVKKQSTLK MDFSGMTLTE ALGNAEGIDL TTSNASGIFV
IRPLKGEGER GGKIANIYQL DMSDATSLVM ATEFRLQPYD VVYVTTAPVA RWNRLINQLL
PTISGVRYMT DTASDIHSW