Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0116 |
Symbol | |
ID | 6144545 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 128593 |
End bp | 130374 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641615017 |
Product | putative S-type colicin |
Protein accession | YP_001742233 |
Protein GI | 170680765 |
COG category | [S] Function unknown |
COG ID | [COG3157] Hemolysin-coregulated protein (uncharacterized) |
TIGRFAM ID | [TIGR03344] type VI secretion system effector, Hcp1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.305649 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGCGATA TTGTTTACCT GAGAATAATC GGTGAGAAGC AGGGCGATAT TTCTTCAGGC TGTGGGACGT ATGCGTCGGT TGGTAATCGC TGGCAGGTTG GCCATGAAGA TGAAATTTTT GCCTTTGCAC TCACCAACAC CATTACCAGT ACTGGTAAAG GCGTTAATCT GCAGGGGCTA CAATTTTGCA AACTCATTGA TAAAAGCTCA CCGCTACTGT CTAATGCCAT CAATCAGAAT GAGCGGTTAT TTATTGAAAT CGATTTGTAT CGTATAAATA AAAGCGGGCG CTGGGAACGG TATTATTATA TTCAGCTAAG AAATGCTTCA TTAACTGCTA TTCATGTAAA CATTTCTGAC AATAATCTTC CTACCGAATG TGTAACTGTC GATTATGACT ACATATTATG TAAACATCTA ATAGCCAATA CGGAATTTGA CTGGTTGGCC TTTCCTGCTG GCCATAATAG CTTATTTATT CCACCTAAAA ACCCACCTGC CAGTAATCTT AACCCTGAGC CTCTACCAGT TGTTAACCTT CCACTCTCTC CACCAGCGGT TAAACCGGTC TATGCCAAAT CCTGTCTGAA GGAGAAGGGA TGTACAGATG CCGGAACGGC AGAAGAACCC GCTGAAAACT TCGGGCAAGT AGCGATTTTT GCTCTGCCAG TGGTTGATGA CTGCTGTGGA TACCACCATC CCGAGGCTAA CGATGTCGGG CAACCCGCAG AAGCTCAAAC CATGCTACTG TTTCCGGGTA GCGTATTGGC GGCTCAAATA TGGGGAAAAT GGTCGCTCAG TGGCATACTC AGTGCAACCC GCGGCTCTTA CATCGGTGCG TTGGCATCTG CTTTGTATAT TCCCTCTGCG GGCGAGGGCA GTGCTCGCGT GCCCGGACGT GATGAGTTCT GGTATGAGGA AGAACTGCAG CAGAAAGCAC TAGCAGGCAG TACCGCCACC ACCCGGGTAC GTTTTTTCTG GGGAACTGAC ATTCACGGCA AGCCTCAGGT GTATGGTGTT CATACGGGTG AAGGTACGCC GTATGAAAAC GTCCGCGTGG CGAACATGCA GTGGAACGAG CAGACGCAGC GTTATGAATT TACCCCCGCT CACGATGTCG ATGGCCCCCT GATTACCTGG ACGCCGGAAA ATCCGGAACA TGGGAATGTT CCGGGCCATA CCGGTAACGA CAGGCCGCCG CTGGAGCAGC CCACCATTCT GGTGACGCCG ATTCCGGACG GCACTGATAC CTATAGCACG CCGCCATTCC CGGTTCCTGA TCCGAAAGAA TTCAACGATT ATATTCTGGT TTTTCCGGCG GGATCCGGTA TTAAGCCCAT CTATGTTTAC CTGAAGGAGG ATCCGCGAAA GCTGCCTGGT GTTGTAACAG GGCGCGGCGT CCCGCTTTCA CCAGGAACTC GCTGGCTGGA TATGTCGGTA TCCAATAACG GCAACGGCGC ACCAATCCCG GCACATATTG CTGATAAATT GCGCGGACGG GAGTTTAAAA CCTTTGATGA GTTTCGCGAG GCGCTGTGGC TGGAGGTGAG TCAGGATCCG GAGTTGATAG CGCAGTTTTC AAGTGGTAAC CAAACACGTA TAAAACAAGG ATTAACCGCA AAAGCACCTA TTGACGGTTG GCATTATGGC CCTAAAGAAA TAGTTAAAAA ATTCCAGATA CATCATCGTG TTGCAATTGA ATATGGCGGC AGCGTATACG ATATTGATAA TTTACGAATT GTTACCCCCA GACTACACGA TGAGATTCAC TACAGGAGAT AA
|
Protein sequence | MGDIVYLRII GEKQGDISSG CGTYASVGNR WQVGHEDEIF AFALTNTITS TGKGVNLQGL QFCKLIDKSS PLLSNAINQN ERLFIEIDLY RINKSGRWER YYYIQLRNAS LTAIHVNISD NNLPTECVTV DYDYILCKHL IANTEFDWLA FPAGHNSLFI PPKNPPASNL NPEPLPVVNL PLSPPAVKPV YAKSCLKEKG CTDAGTAEEP AENFGQVAIF ALPVVDDCCG YHHPEANDVG QPAEAQTMLL FPGSVLAAQI WGKWSLSGIL SATRGSYIGA LASALYIPSA GEGSARVPGR DEFWYEEELQ QKALAGSTAT TRVRFFWGTD IHGKPQVYGV HTGEGTPYEN VRVANMQWNE QTQRYEFTPA HDVDGPLITW TPENPEHGNV PGHTGNDRPP LEQPTILVTP IPDGTDTYST PPFPVPDPKE FNDYILVFPA GSGIKPIYVY LKEDPRKLPG VVTGRGVPLS PGTRWLDMSV SNNGNGAPIP AHIADKLRGR EFKTFDEFRE ALWLEVSQDP ELIAQFSSGN QTRIKQGLTA KAPIDGWHYG PKEIVKKFQI HHRVAIEYGG SVYDIDNLRI VTPRLHDEIH YRR
|
| |