Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0239 |
Symbol | |
ID | 6966763 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 254656 |
End bp | 255987 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643384310 |
Product | hypothetical protein |
Protein accession | YP_002268826 |
Protein GI | 209398751 |
COG category | [S] Function unknown |
COG ID | [COG3522] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03353] type VI secretion protein, VC_A0114 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 0.892408 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACCA CCCGCAACAA GGTGATGTGG CAGGAAGGGA TGCTGATGCG CCCACACCAT TTCCAGCAAC AGCAGCGTTA CAACGACTAC CTGGATAACC AGCGTTTCCG GGCCATGAAT GATTTATCCT GGGGATTTAC CGAACTCACC CTCAACAATG AACTGCTGGC GCAGGGTAAG ATCATGATTG ACAGCGCGTC AGGCACACTG CCTGACGGCA CCGTCTTTTC TATCCCCGAC CAGGACGCAC TGCCCGATCC GCTGCACCCG CAAAATTTTC CGGACGAAAG AAGCCGCAAT ATTTACCTCG CTCTGCCGGT CGCCAGTGAT GTGAGAAATG AAATCAGCGA CGGGCGGCGA ATCGGGCGTT ACCGGCTGAA TTATGCCGAT GTCCGGGATC TGCATTCAGA AGAAGGCGAC GCGCGAACAC TGACGCTGGG ACAACTGACG CCGCGCATTA TGAGCGGTGC AGAAGATATG AGTGCCTATA TTACGCTCCC GCTTTGCCGT ATCAGCGATC GCCATGCCGA CGGCTCCCTG ACGCTGGATG ACGATTTTAT CCCCTCCTGC CAGAATATTC AGGTCAGTAA GAAACTGCGT GTTTATCTCA AAGAGGTACA GGGGGCCATT GGCGGACGGG CAAGCGATCT GGCAAACCGT ATTGGCTCTC CGGCGCAGAG CGGCATCGCG GATGTGGCGG AATTTATGAT GTTGCAGTTG CTTAACCGTA ACCAGACCCG GTTTACCCAT CGCGCCCGCC GATCCCAGCT CCACCCGGAA GATTTCTACC TTGATCTTGC CGAGTTGCTG GGTGAACTGA TGACCTTTAC AGAGCCGTCA CGCCTGCCCT GCCCGCTTGA TGTGTATGAT CATCATGACC TGACCAAAAC ATTTAAAACA CTGTTACCGG AAGTCAAACG GGCGCTGCAT ACCGTACTGT CGCCAAGAGC GGTCAATCTG CCGCTGCATC TGCGTGACGG CATCTGGCAG GCCGATGTTC ATGACTCGGA ACTGCTACAG TCTGCCACCT TTGTACTGGC CGTGGCAGCA AATATGCCTG TCGATCAGAT CCAGCGTCAG TTTATCCAGC AGTCGAAAAT TTCCTCGCCG GAAAAAATCC GCAATATGGT CAGTGTGCAG ATCCCGGGTA TTCCTCTGCG CGCACTGATG GTGGCCCCCC GCCAGCTTCC CTACCATTCC GGGTTCAGCT ATTTCGAACT CGACAAGAGC GGCCAGGCCT GGACAGAAAT GGCTGCCGCC GGGGCCGTTG CACTGCATGT ATCCGGCAGT TTCCCGGATC TGAACATGCA ACTGTGGGCG ATAAGAGGGT AA
|
Protein sequence | MATTRNKVMW QEGMLMRPHH FQQQQRYNDY LDNQRFRAMN DLSWGFTELT LNNELLAQGK IMIDSASGTL PDGTVFSIPD QDALPDPLHP QNFPDERSRN IYLALPVASD VRNEISDGRR IGRYRLNYAD VRDLHSEEGD ARTLTLGQLT PRIMSGAEDM SAYITLPLCR ISDRHADGSL TLDDDFIPSC QNIQVSKKLR VYLKEVQGAI GGRASDLANR IGSPAQSGIA DVAEFMMLQL LNRNQTRFTH RARRSQLHPE DFYLDLAELL GELMTFTEPS RLPCPLDVYD HHDLTKTFKT LLPEVKRALH TVLSPRAVNL PLHLRDGIWQ ADVHDSELLQ SATFVLAVAA NMPVDQIQRQ FIQQSKISSP EKIRNMVSVQ IPGIPLRALM VAPRQLPYHS GFSYFELDKS GQAWTEMAAA GAVALHVSGS FPDLNMQLWA IRG
|
| |