Gene Sbal195_1784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal195_1784 
Symbol 
ID5753520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS195 
KingdomBacteria 
Replicon accessionNC_009997 
Strand
Start bp2138054 
End bp2139079 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content48% 
IMG OID641288058 
Productprotein TolA 
Protein accessionYP_001554215 
Protein GI160874899 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3064] Membrane protein involved in colicin uptake 
TIGRFAM ID[TIGR02794] TolA protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000349437 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGATA AGTCAGATGT AACATTACCT CTGTCAATTT CAGCGGGTAT TCATATTGGC 
GTGATAATCA TTCTGGCGTT GGGGATCGAT TTCTCTCACA AACCTGAGCC TATGCAACAA
GTTAGTGCGC CAGCGGTTAA AGCCGTTATG GTTGATCAGC AAAAAGTCGC TAATCAAGTT
GAGAAACTTA AGCAAGAAAA GCGTGATGCC GAGCGCCGTG AGCAAGAACG TCAAGCTGAG
CTTGAGCGCA AAGCGCAAGA AGCGAAGCAA GCGCGCGAAC GTGAACAGGC TCAGATTAAA
CAGTTAGAGC AAGAACGTAA GCAGCAAGAA ATTGAAACCC AAAAGGCCAA TGAAGCGACC
AAGCTAGCCC AAGTGAAACA GCAGCAGGAA AAGGAAAAGG CAGTTAAAGC CGAAGCCGAC
CGTAAGCTGA AAGAGCAGGA GCGTAAAGTC GCTGAAGATG CCGCGCAGAA AGCAGCAGAA
AAACGCAAAG TGGAAGAAGC CGCTGCAGCA AAAGCGGAAA GTGACCGTAA GCTAAAGGAG
GCGGAAGCTA AGGCCAAAGC TGAAAAAGCC AAGGCAGATG CAGATGCGAA AGCCGAAGCT
AAGGCCAAAG CAGATGCTAA AGCCAAAGCG GACGCTAAGG CCAAAGCCGA CGCTGAAGCC
AAAGCCCGCG CCCAGCAGGA GCAAGAAATG GCCGATGCAT TAGCGGCGGA GCAGGCGGCG
TTGTCGCAAA CCATGAATAA GCAGATGCAG AGTGAAGTGA ATAAGTATAA GTCGATGATC
ATGTCGACAA TCCAGCGCAA TCTTATCGTT GATGAGTCAA TGCGTGGTAA AACTTGTCAA
GTTTCAGTGC GTTTAGCAAA TGATGGTTTT GTGATCAGCA GTCAAACTCA GGGCGGTGAT
CCTAACGTTT GTCGTGCAGC AAAAGCGGCG ATTCTGAAGG CGGGTAAATT GCCAGTGTCA
CCGGACCCTG CTGTTTATAA AGAATTAAAA GATATTAACT TAACCGTTTC ACCAACGTTT
AATTAA
 
Protein sequence
MADKSDVTLP LSISAGIHIG VIIILALGID FSHKPEPMQQ VSAPAVKAVM VDQQKVANQV 
EKLKQEKRDA ERREQERQAE LERKAQEAKQ AREREQAQIK QLEQERKQQE IETQKANEAT
KLAQVKQQQE KEKAVKAEAD RKLKEQERKV AEDAAQKAAE KRKVEEAAAA KAESDRKLKE
AEAKAKAEKA KADADAKAEA KAKADAKAKA DAKAKADAEA KARAQQEQEM ADALAAEQAA
LSQTMNKQMQ SEVNKYKSMI MSTIQRNLIV DESMRGKTCQ VSVRLANDGF VISSQTQGGD
PNVCRAAKAA ILKAGKLPVS PDPAVYKELK DINLTVSPTF N