Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1569 |
Symbol | |
ID | 6969029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1534309 |
End bp | 1535430 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643385534 |
Product | cupin family protein |
Protein accession | YP_002270028 |
Protein GI | 209395797 |
COG category | [S] Function unknown |
COG ID | [COG2850] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.486119 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.00000357486 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAATACC AACTCACTCT TAACTGGCCC GATTTTCTTG AACGTCACTG GCAGAAACGC CCGGTGGTGT TAAAACGCGG CTTTAATAAT TTTATTGACC CGATCTCTCC AGACGAGTTG GCGGGTCTGG CGATGGAAAG CGAAGTCGAC AGTCGACTGG TCAGTCACCA GGATGGAAAA TGGCAGGTCA GCCACGGTCC GTTCGAAAGC TACGATCATC TCGGTGAAAC TAACTGGTCA TTGTTAGTGC AAGCAGTGAA CCACTGGCAT GAGCCGACCG CCGCGCTGAT GCGACCGTTC CGTGAACTAC CGGACTGGCG TATTGATGAT CTGATGATCT CTTTTTCTGT ACCTGGCGGC GGCGTCGGCC CGCATCTCGA TCAGTACGAC GTGTTTATCA TTCAGGGTAC CGGACGTCGT CGCTGGCGAG TGGGCGAGAA GCTGCAAATG AAACAGCACT GCCCACATCC GGATCTGTTA CAGGTCGATC CGTTCGAAGC CATCATCGAT GAAGAGCTGG AGCCTGGCGA TATTCTTTAT ATTCCGCCAG GATTCCCGCA TGAAGGCTAC GCGCTGGAAA ATGCGATGAA CTATTCCGTG GGTTTTCGCG CGCCAAATAC GCGGGAATTA ATTAGCGGAT TTGCCGATTA TGTGCTGCAA CGCGAACTGG GCGGCAACTA CTACAGCGAT CCTGATGTTC CACCTCGCGC TCATCCTGCG GACGTTCTGC CGCAAGAGAT GGATAAACTG CGTGAGATGA TGCTCGAATT GATCAACCAG CCGGAACACT TTAAGCAATG GTTTGGCGAG TTTATATCCC AGTCACGTCA TGAACTGGAT ATCGCGCCGC CGGAGCCGCC TTATCAGCCA GATGAAATCT ACGATGCGCT GAAACAAGGT GATGTGCTGG TGCGCCTGGG TGGTCTGCGC GTATTGCGCA TTGGCGACGA CGTGTATACC AATGGTGAGA AGATCGATTC CCCGCACCGT CCGGCACTGG ATGCACTCGC CAGCAACATT GCGCTGACTG CGGAGAATTT TGGCGATGCG CTGGAAGATC CGTCATTCCT CGCGATGCTC GCGGCGCTGG TCAATAGCGG GTATTGGTTC TTCGAAGGGT AA
|
Protein sequence | MEYQLTLNWP DFLERHWQKR PVVLKRGFNN FIDPISPDEL AGLAMESEVD SRLVSHQDGK WQVSHGPFES YDHLGETNWS LLVQAVNHWH EPTAALMRPF RELPDWRIDD LMISFSVPGG GVGPHLDQYD VFIIQGTGRR RWRVGEKLQM KQHCPHPDLL QVDPFEAIID EELEPGDILY IPPGFPHEGY ALENAMNYSV GFRAPNTREL ISGFADYVLQ RELGGNYYSD PDVPPRAHPA DVLPQEMDKL REMMLELINQ PEHFKQWFGE FISQSRHELD IAPPEPPYQP DEIYDALKQG DVLVRLGGLR VLRIGDDVYT NGEKIDSPHR PALDALASNI ALTAENFGDA LEDPSFLAML AALVNSGYWF FEG
|
| |