Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1086 |
Symbol | |
ID | 6968180 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1114057 |
End bp | 1115892 |
Gene Length | 1836 bp |
Protein Length | 611 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643385098 |
Product | hypothetical protein |
Protein accession | YP_002269597 |
Protein GI | 209399966 |
COG category | [S] Function unknown |
COG ID | [COG2989] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0169645 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 0.732672 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGTGTG GTCGTCGGCT GTCGGCAATC AGTTTGTGCC TGGCCGTAAC ATTCGCTCCA CTGTTCAATG CGCAGGCCGA TGAGCCTGAA GTAATCCCTG GCGACAGCCC GGTGGCTGTC AGTGAACAGG GCGAGGCACT GCCGCAGGCG CAAGCCACGG CAATAATGGC GGGGATCCTG CCATTGCCTG AAGGTGCGGC AGAAAAAGCC CGCACGCAAA TCGAATCTCA ATTACCCGCA GGTTACAAGC CGGTTTATCT TAACCAGCTT CAACTGTTGT ATGCCGCACG CGATATGCAA CCCATGTGGG AAAACCGTGA TGCTGTTAAA GCCTTCCAGC AACAGCTGGC AGAGGTGGCG ATTGCCGGTT TCCAGCCGCA GTTTAATAAA TGGGTAGAGT TACTGACCGA TCCTGGTGTT AACGGGATGG CACGCGACGT GGTGCTCTCT GATGCGATGA TGGGCTATCT CCATTTCATT GCAAATATTC CGGTCAAAGG CACTCGCTGG CTATATAGCA GTAAACCTTA TGCGCTTGCA ACGCCGCCGC TTTCGGTGAT TAACCAATGG CAGCAGGCGC TGGATAAAGG TCAATTGCCT ACGTTTGTTG CAGGACTGGC ACCGCAGCAT CCGCAATATG CGGTGATGCA TGAATCGTTA CTGGCCTTAC TCAGTGACAC CAAACCGTGG CCCCAACTGA CCGGCAAAGC AACGTTGCGC CCAGGGCAGT GGAGTAACGA CGTACCGGCG TTGCGTGAAA TATTGCAACG CACAGGCATG TTGGACGGGG GGCCGAAAAT TACTCTACCT GGCGATGACA CGCCAACTGA CGCGGTAGTC AGCCCATCCG CTGTTACTGT TGAAACAGCA GAAACTAAGC CGATGGATAA GCAAACGACG TCTCGTAGTA AACCTGCGCC TGCCGTTCGC GCCGCCTACG ATAATGAACT GGTGGAAGCC GTTAAACGTT TTCAGGCATG GCAAGGATTG GGGGCAGATG GTGCTATTGG CCCGGCAACG CGTGACTGGT TAAACGTAAC GCCCGCCCAG CGTGCTGGCG TGTTGGCTCT CAACATCCAG CGATTGCGCT TGCTGCCAAC AGAGCTTTCT ACCGGGATCA TGGTTAACAT TCCGGCCTAT TCGCTGGTCT ACTATCAGAA CGGCAATCAG GTGCTGGATT CGCGAGTCAT TGTCGGTCGC CCCGATCGCA AAACGCCGAT GATGAGCAGT GCCCTTAACA ATGTAGTGGT AAACCCGCCG TGGAACGTAC CTCCAACTCT GGCACGCAAA GATATTCTGC CGAAAGTGCG CAACGATCCG GGATATCTCG AAAGCCATGG CTATACGGTG ATGCGCGGCT GGAACAGCAG AGAAGCGATT GACCCATGGC AGGTTGACTG GTCTACAATC ACGGCCTCGA ATTTACCGTT TCGCTTCCAA CAGGCTCCAG GCCCACGGAA CTCGCTGGGG CGCTATAAAT TCAATATGCC GAGTTCAGAG GCCATTTATT TGCATGACAC GCCGAACCAC AATCTGTTCA AGCGTGATAC ACGCGCATTG AGCTCAGGCT GTGTACGAGT GAATAAAGCT TCCGATCTGG CGAATATGCT GTTGCAGGAT GCAGGCTGGA ATGACAAACG TATTTCTGAT GCGCTGAAGC AGGGTGATAC ACGTTACGTC AATATTCGGC AGTCGATTCC GGTGAATCTC TACTACCTGA CGGCCTTTGT TGGTGCAGAT GGTCGTACCC AGTATCGTAC AGATATTTAC AATTATGATC TGCCTGCGCG ATCCAGCTCG CAAATCGTAT CGAAAGCGGA ACAATTAATC AGGTAA
|
Protein sequence | MMCGRRLSAI SLCLAVTFAP LFNAQADEPE VIPGDSPVAV SEQGEALPQA QATAIMAGIL PLPEGAAEKA RTQIESQLPA GYKPVYLNQL QLLYAARDMQ PMWENRDAVK AFQQQLAEVA IAGFQPQFNK WVELLTDPGV NGMARDVVLS DAMMGYLHFI ANIPVKGTRW LYSSKPYALA TPPLSVINQW QQALDKGQLP TFVAGLAPQH PQYAVMHESL LALLSDTKPW PQLTGKATLR PGQWSNDVPA LREILQRTGM LDGGPKITLP GDDTPTDAVV SPSAVTVETA ETKPMDKQTT SRSKPAPAVR AAYDNELVEA VKRFQAWQGL GADGAIGPAT RDWLNVTPAQ RAGVLALNIQ RLRLLPTELS TGIMVNIPAY SLVYYQNGNQ VLDSRVIVGR PDRKTPMMSS ALNNVVVNPP WNVPPTLARK DILPKVRNDP GYLESHGYTV MRGWNSREAI DPWQVDWSTI TASNLPFRFQ QAPGPRNSLG RYKFNMPSSE AIYLHDTPNH NLFKRDTRAL SSGCVRVNKA SDLANMLLQD AGWNDKRISD ALKQGDTRYV NIRQSIPVNL YYLTAFVGAD GRTQYRTDIY NYDLPARSSS QIVSKAEQLI R
|
| |