Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3504 |
Symbol | |
ID | 6971614 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3249970 |
End bp | 3251595 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643387306 |
Product | hypothetical protein |
Protein accession | YP_002271769 |
Protein GI | 209400018 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.619572 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 0.985155 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTATA TCGATATCAC CACGATGCGT GGGATGATGC CGCGCGTTGT GACATCCATG CTGCCCGAGC ATTCCGCTGT ACTGGCGGAG GACTGCCATT TCCGGTTTGG TGTTATTACA CCAGAACGTC AGATATCCGG GGTTGAGAAA ACATTCACAA TTAAGCCAAA AACAATTTTT CATTACCGTG ACGATTTCTG GTTTGCATGG CCGGATGTGG TGGATGTGAT CCGCAGTCCG ATCGCTCAGG ACCCCCACGG GCGTATTTAC TACACTGACG GGCGTTTTCC TAAAGTGACG GATGCGACTA TTGCCACAAA AGGGGACGGG AATCACCCGA CATCATCGTA TCGTCTGGGG ATCCCCGCGC CGACGACAGC TCCTGTCTGT ACTGTTCAGC AGGGCGGTGA TGTTTCTGAC GATAACCCGA ATGATGACGA AACCCGGTTT TATACTGAAA CCTTTGTCTC AGATTATGGT GAAGAAGGTC CGCCTGGTCC GGCGTCTCTG GAGGTAACAC TCCGTACTCC GGGGACTGCG GTACAACTGA CGCTGGCTCC GGTGCCATTG CAGAATGCCA ATATTAAACG TCGCCGGATT TATCGCTCTG CATCAGGTGG AGGAGAAGCG GATTTTTTAC TTGTGGCTGA ACTGGATGCG TCAGTGCTCA GTTACACGGA CAAAATACCG ACGAAAAACC TTGGGCCTTC CCTGGCAACA TGGGATTACC TGCCGCCACC GGAGAATATG ACGGGTCTTT GCCTGATGGC TAATGGTATT GCTGCCGGGT TTGCCGGTAA TGAAGTGATG TTTTCGGAAG CGTATCTGCC GTATGCATGG CCGGAAGTGA ATCGTCACAC GACGGCAGAA GATATTGTGG CTATCTGTCC GCTGGGAACG TCACTGGTGG TGGCGACAAA GGGGGAGCCT TATCTGTTCA GTGGTGTATC GCCTTCCACA ATTTCTGGCT CCAGAATTCC TTCCATGCAG GCATGCCTGA GCCGAAGAAG CATGGTGGCG ATGGAGGGAT TCGTACTGTA TGCCGGGACA AACGGTCTGG TATCTGTTGA TGTAAACGGT AATACAGCAC TGGCAACGGA AAAGATTATT TCACCAGAAC AGTGGCAGAG TCAGTTTAAC CCGATGTCCA TTGTGGCTTA TTCCTGGCGT GGTGACTATA TCGGTTGTTA CACAAAACCG GATGGTAAGC AGGATGTGTT TGTATTCAGT CCGGCGAACA TGGATATCCG TTATCTCAGC ACGCCGTTTG ACTGTGCATG GATTGATCTT GCAAAAGATA TGATGCGCGT GGTGACAGGG GACAAAATGT CAGTGCTTGC CGGGGACTCT CTGCCGTCCA TGATAAGGTG GCATTCAAAA ATTTTTTCAT TACCTGAAAG AACCTCTTTT TCCTGTATCA GAGTGAAATC TCCGGCACCT GAGCGGGTGG GGATCACTGT TATGGCTGAT GATGTTCCTG TGATTCATTT TGCGCCGGGT ACGTTTAAGG GAAGTGTGGT GAGACTTCCG GCAGCAACCG GGCAAAACTG GCAGGTGATG GTATCCGGAT TCGGGCAGGT GGAACGAATA ACCCTGAGTA CATCGATGTC GGAGATGCCG GTATGA
|
Protein sequence | MPYIDITTMR GMMPRVVTSM LPEHSAVLAE DCHFRFGVIT PERQISGVEK TFTIKPKTIF HYRDDFWFAW PDVVDVIRSP IAQDPHGRIY YTDGRFPKVT DATIATKGDG NHPTSSYRLG IPAPTTAPVC TVQQGGDVSD DNPNDDETRF YTETFVSDYG EEGPPGPASL EVTLRTPGTA VQLTLAPVPL QNANIKRRRI YRSASGGGEA DFLLVAELDA SVLSYTDKIP TKNLGPSLAT WDYLPPPENM TGLCLMANGI AAGFAGNEVM FSEAYLPYAW PEVNRHTTAE DIVAICPLGT SLVVATKGEP YLFSGVSPST ISGSRIPSMQ ACLSRRSMVA MEGFVLYAGT NGLVSVDVNG NTALATEKII SPEQWQSQFN PMSIVAYSWR GDYIGCYTKP DGKQDVFVFS PANMDIRYLS TPFDCAWIDL AKDMMRVVTG DKMSVLAGDS LPSMIRWHSK IFSLPERTSF SCIRVKSPAP ERVGITVMAD DVPVIHFAPG TFKGSVVRLP AATGQNWQVM VSGFGQVERI TLSTSMSEMP V
|
| |