Gene EcSMS35_0116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0116 
Symbol 
ID6144545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp128593 
End bp130374 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content49% 
IMG OID641615017 
Productputative S-type colicin 
Protein accessionYP_001742233 
Protein GI170680765 
COG category[S] Function unknown 
COG ID[COG3157] Hemolysin-coregulated protein (uncharacterized) 
TIGRFAM ID[TIGR03344] type VI secretion system effector, Hcp1 family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.305649 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCGATA TTGTTTACCT GAGAATAATC GGTGAGAAGC AGGGCGATAT TTCTTCAGGC 
TGTGGGACGT ATGCGTCGGT TGGTAATCGC TGGCAGGTTG GCCATGAAGA TGAAATTTTT
GCCTTTGCAC TCACCAACAC CATTACCAGT ACTGGTAAAG GCGTTAATCT GCAGGGGCTA
CAATTTTGCA AACTCATTGA TAAAAGCTCA CCGCTACTGT CTAATGCCAT CAATCAGAAT
GAGCGGTTAT TTATTGAAAT CGATTTGTAT CGTATAAATA AAAGCGGGCG CTGGGAACGG
TATTATTATA TTCAGCTAAG AAATGCTTCA TTAACTGCTA TTCATGTAAA CATTTCTGAC
AATAATCTTC CTACCGAATG TGTAACTGTC GATTATGACT ACATATTATG TAAACATCTA
ATAGCCAATA CGGAATTTGA CTGGTTGGCC TTTCCTGCTG GCCATAATAG CTTATTTATT
CCACCTAAAA ACCCACCTGC CAGTAATCTT AACCCTGAGC CTCTACCAGT TGTTAACCTT
CCACTCTCTC CACCAGCGGT TAAACCGGTC TATGCCAAAT CCTGTCTGAA GGAGAAGGGA
TGTACAGATG CCGGAACGGC AGAAGAACCC GCTGAAAACT TCGGGCAAGT AGCGATTTTT
GCTCTGCCAG TGGTTGATGA CTGCTGTGGA TACCACCATC CCGAGGCTAA CGATGTCGGG
CAACCCGCAG AAGCTCAAAC CATGCTACTG TTTCCGGGTA GCGTATTGGC GGCTCAAATA
TGGGGAAAAT GGTCGCTCAG TGGCATACTC AGTGCAACCC GCGGCTCTTA CATCGGTGCG
TTGGCATCTG CTTTGTATAT TCCCTCTGCG GGCGAGGGCA GTGCTCGCGT GCCCGGACGT
GATGAGTTCT GGTATGAGGA AGAACTGCAG CAGAAAGCAC TAGCAGGCAG TACCGCCACC
ACCCGGGTAC GTTTTTTCTG GGGAACTGAC ATTCACGGCA AGCCTCAGGT GTATGGTGTT
CATACGGGTG AAGGTACGCC GTATGAAAAC GTCCGCGTGG CGAACATGCA GTGGAACGAG
CAGACGCAGC GTTATGAATT TACCCCCGCT CACGATGTCG ATGGCCCCCT GATTACCTGG
ACGCCGGAAA ATCCGGAACA TGGGAATGTT CCGGGCCATA CCGGTAACGA CAGGCCGCCG
CTGGAGCAGC CCACCATTCT GGTGACGCCG ATTCCGGACG GCACTGATAC CTATAGCACG
CCGCCATTCC CGGTTCCTGA TCCGAAAGAA TTCAACGATT ATATTCTGGT TTTTCCGGCG
GGATCCGGTA TTAAGCCCAT CTATGTTTAC CTGAAGGAGG ATCCGCGAAA GCTGCCTGGT
GTTGTAACAG GGCGCGGCGT CCCGCTTTCA CCAGGAACTC GCTGGCTGGA TATGTCGGTA
TCCAATAACG GCAACGGCGC ACCAATCCCG GCACATATTG CTGATAAATT GCGCGGACGG
GAGTTTAAAA CCTTTGATGA GTTTCGCGAG GCGCTGTGGC TGGAGGTGAG TCAGGATCCG
GAGTTGATAG CGCAGTTTTC AAGTGGTAAC CAAACACGTA TAAAACAAGG ATTAACCGCA
AAAGCACCTA TTGACGGTTG GCATTATGGC CCTAAAGAAA TAGTTAAAAA ATTCCAGATA
CATCATCGTG TTGCAATTGA ATATGGCGGC AGCGTATACG ATATTGATAA TTTACGAATT
GTTACCCCCA GACTACACGA TGAGATTCAC TACAGGAGAT AA
 
Protein sequence
MGDIVYLRII GEKQGDISSG CGTYASVGNR WQVGHEDEIF AFALTNTITS TGKGVNLQGL 
QFCKLIDKSS PLLSNAINQN ERLFIEIDLY RINKSGRWER YYYIQLRNAS LTAIHVNISD
NNLPTECVTV DYDYILCKHL IANTEFDWLA FPAGHNSLFI PPKNPPASNL NPEPLPVVNL
PLSPPAVKPV YAKSCLKEKG CTDAGTAEEP AENFGQVAIF ALPVVDDCCG YHHPEANDVG
QPAEAQTMLL FPGSVLAAQI WGKWSLSGIL SATRGSYIGA LASALYIPSA GEGSARVPGR
DEFWYEEELQ QKALAGSTAT TRVRFFWGTD IHGKPQVYGV HTGEGTPYEN VRVANMQWNE
QTQRYEFTPA HDVDGPLITW TPENPEHGNV PGHTGNDRPP LEQPTILVTP IPDGTDTYST
PPFPVPDPKE FNDYILVFPA GSGIKPIYVY LKEDPRKLPG VVTGRGVPLS PGTRWLDMSV
SNNGNGAPIP AHIADKLRGR EFKTFDEFRE ALWLEVSQDP ELIAQFSSGN QTRIKQGLTA
KAPIDGWHYG PKEIVKKFQI HHRVAIEYGG SVYDIDNLRI VTPRLHDEIH YRR