Gene SeD_A0843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0843 
SymboltolA 
ID6873023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp839425 
End bp840603 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content58% 
IMG OID642784038 
Productcell envelope integrity inner membrane protein TolA 
Protein accessionYP_002214717 
Protein GI198244347 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3064] Membrane protein involved in colicin uptake 
TIGRFAM ID[TIGR02794] TolA protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0151621 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCAAAGG CAACCGAACA AAACGACAAG CTCAAACGGG CGATAATTAT TTCAGCCGTG 
CTGCATATCA TCTTATTTGC AGTGCTGATC TGGAGTTCGT TTGATGAGCA TATAGAGGCT
TCTGCCGGCG GCGGCGGTGG TTCCGCTATC GACGCGGTGA TGGTCGATCC TGGCGCCGTT
GTGCAGCAGT ACAACCGTCA GCAGGATCAA CAGGCCAGCG CCAGACGCGC GGAAGAAGAG
CGTAAAAAGC TGCAACAGCA GCAAGCAGAG GAGCTGCAGC AGAAGCAGGC TGCCGAACAG
GAGCGGTTGA AACAACTTGA GAAAGAACGT TTAGCGGCTC AGGAGCAGCA AAAGCAGGCT
GAAGAAGCGG CAAAACTGGC GCAACAGCAG CAGCAACAGG CCGAAGAAGC GGCGAAAGCG
GCGGCGGACG CGAAGAAAAA AGCGGAGGCC GAAGCGGCGA AAGCGGCGGC GGACGCGAAG
AAGAAAGCGG AGGCCGAAGC GGCGAAAGCG GCGGCGGACG CGAAGAAGAA AGCGGAAGCC
GAGGCGGTAA AAGCGGCAGC GGACGCGAAG AAGAAAGCGG AAGCCGAAGC GGCGAAAGCG
GCGGCGGAGG CGAAGAAGAA AGCGGAAGCC GAAGCGGCGA AAGCGGCGGC GGAGGCGAAG
AAGAAAGCGG ATGCCGAGGC GGCGAAAGCG GCGGCGGAGG CGAAGAAGAA AGCGGATGCC
GCGGCGGCGA AAGCAGCGGC GGACGCTAAG AAGAAAGCGG CTGCCGAAAA AGCGGCCGCC
GCAGAAGGCG TCGACGATCT GCTTGGCGAT CTCAGCTCGG GTAAGAATGC GCCGAAAACC
GGCGGCGGCG CGAAAGGGAA TGGTCAGCCA TCGAAAGATA GCGGTACATC GGGCGCTAAC
GGTGGGGCGA CAGGCGCTGA TATCAGCGCC TACGCGAAAC AGATTCAGGT CGCCATTCAG
AGCCGTCTGT ATGATGCGAG CCTGTATCAG GGCAAACAAT GTGTCTTGCA TATTAGCCTG
GCGCCGGATG GCTCATTAAA AAGCATTACG TCTGAGGGCG GCGATCCGGC GCTTTGTCAG
GCGGCGTTAA TGGCGGCAAA AACCGCGAAA ATTCCTAAAC CGCCAAGCCA GGCTGTTTAT
GAGAAAATAA AGGATGCCAA ACTAGACTTT AAACTGTAG
 
Protein sequence
MSKATEQNDK LKRAIIISAV LHIILFAVLI WSSFDEHIEA SAGGGGGSAI DAVMVDPGAV 
VQQYNRQQDQ QASARRAEEE RKKLQQQQAE ELQQKQAAEQ ERLKQLEKER LAAQEQQKQA
EEAAKLAQQQ QQQAEEAAKA AADAKKKAEA EAAKAAADAK KKAEAEAAKA AADAKKKAEA
EAVKAAADAK KKAEAEAAKA AAEAKKKAEA EAAKAAAEAK KKADAEAAKA AAEAKKKADA
AAAKAAADAK KKAAAEKAAA AEGVDDLLGD LSSGKNAPKT GGGAKGNGQP SKDSGTSGAN
GGATGADISA YAKQIQVAIQ SRLYDASLYQ GKQCVLHISL APDGSLKSIT SEGGDPALCQ
AALMAAKTAK IPKPPSQAVY EKIKDAKLDF KL