Gene SeD_A0844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0844 
SymboltolB 
ID6873726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp840724 
End bp842019 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content56% 
IMG OID642784039 
Producttranslocation protein TolB 
Protein accessionYP_002214718 
Protein GI198244972 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0823] Periplasmic component of the Tol biopolymer transport system 
TIGRFAM ID[TIGR02800] tol-pal system beta propeller repeat protein TolB 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0211285 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAGC AGGCATTACG AGTAGCATTT GGTTTTCTGA TGCTGTGGGC GGCGGTGCTG 
CACGCAGAAG TCCGTATCGA GATCACCCAG GGGGTGGACT CGGCGCGACC GATTGGCGTT
GTGCCTTTTA AATGGGCCGG GCCGGGCGCT GCGCCTGAAG ATATCGGCGG CATCGTGGCG
GCAGATTTAC GTAATAGCGG TAAATTTAAT CCGTTAGACC GGTCCCGACT GCCGCAGCAG
CCAGCCACCG CTCAGGAAGT TCAGCCTACC GCATGGTCTG CGCTGGGTAT TGATGCCGTC
GTCGTTGGGC AGGTAACGCC GAATCCGGAC GGTTCCTACA ATGTTGCTTA TCAGCTGGTT
GACACTGGCG GCGCGCCGGG GACTGTACTG GCGCAAAATT CTTATAAAGT GAACAAGCAG
TGGCTGCGTT ATGCAGGTCA TACCGCCAGT GACGAAGTCT TTGAAAAACT GACGGGCATT
AAGGGCGCGT TCCGTACTCG TATCGCCTAT GTGGTACAGA CTAATGGCGG TCAGTTCCCG
TATGAACTGC GTGTGTCGGA TTACGATGGT TACAATCAGT TTGTGGTGCA CCGTTCTCCG
CAGCCGTTGA TGTCTCCGGC GTGGTCTCCG GACGGCTCAA AACTGGCTTA CGTGACATTT
GAAAGCGGTC GCTCCGCGCT GGTTATCCAG ACGCTGGCAA ACGGCGCAGT GCGTCAGGTT
GCGTCCTTCC CGCGTCACAA CGGCGCGCCG GCCTTCTCGC CGGATGGGAC GAAACTGGCG
TTCGCGTTAT CGAAAACCGG AAGTCTGAAC CTGTACGTTA TGGATCTTGC TTCCGGCCAG
ATTCGTCAGA TAACGGACGG GCGTAGCAAC AATACGGAGC CGACCTGGTT CCCGGACAGC
CAGACTCTGG CCTTTACCTC TGACCAGGCT GGACGTCCGC AAGTGTATAA AATGAACATT
AACGGCGGTG CGGCGCAGCG TATTACCTGG GAAGGTTCGC AAAACCAGGA TGCGGATGTC
AGCAGCGACG GTAAATTTAT GGTAATGGTA AGCTCAAATA ACGGGCAGCA GCACATTGCC
AAACAAGATC TGGTGACGGG TGGCGTACAG GTTCTGTCGT CAACGTTCCT GGATGAAACG
CCAAGTCTGG CACCTAACGG CACGATGGTA ATCTACAGCT CTTCTCAGGG GATGGGATCT
GTGCTGAATT TGGTTTCTAC AGATGGGCGT TTCAAAGCGC GTCTTCCGGC AACTGATGGT
CAGGTGAAAT CGCCTGCCTG GTCGCCGTAT CTGTGA
 
Protein sequence
MMKQALRVAF GFLMLWAAVL HAEVRIEITQ GVDSARPIGV VPFKWAGPGA APEDIGGIVA 
ADLRNSGKFN PLDRSRLPQQ PATAQEVQPT AWSALGIDAV VVGQVTPNPD GSYNVAYQLV
DTGGAPGTVL AQNSYKVNKQ WLRYAGHTAS DEVFEKLTGI KGAFRTRIAY VVQTNGGQFP
YELRVSDYDG YNQFVVHRSP QPLMSPAWSP DGSKLAYVTF ESGRSALVIQ TLANGAVRQV
ASFPRHNGAP AFSPDGTKLA FALSKTGSLN LYVMDLASGQ IRQITDGRSN NTEPTWFPDS
QTLAFTSDQA GRPQVYKMNI NGGAAQRITW EGSQNQDADV SSDGKFMVMV SSNNGQQHIA
KQDLVTGGVQ VLSSTFLDET PSLAPNGTMV IYSSSQGMGS VLNLVSTDGR FKARLPATDG
QVKSPAWSPY L