Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5727 |
Symbol | |
ID | 6969786 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 5362765 |
End bp | 5363730 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643389360 |
Product | hypothetical protein |
Protein accession | YP_002273753 |
Protein GI | 209399773 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.229908 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 81 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTAGCG GCGTGCTGTA CGCCCTGTTA GCAGGGTTGA TGTGGGGGCT TATTTTTGTC GGGCCGTTGA TCGTGCCGGA ATACCCGGCG ATGTTGCAGT CGATGGGGCG TTATCTGGCG TTAGGGTTAA TTGCGCTGCC CATTGCCTGG CTGGGACGCG TGCGTCTGCG TCAGTTGGCG CGTCGCGACT GGCTTACCGC CTTGATGCTC ACTATGATGG GCAACCTCAT CTATTACTTC TGCCTTGCCA GTGCCATTCA ACGTACTGGT GCGCCTGTTT CCACGATGAT TATCGGCACC CTGCCGGTGG TGATTCCCGT CTTCGCCAAT CTGCTTTATA GCCAGCGCGA CGGCAAACTC GCGTGGGGAA AACTCGCCCC GGCACTGATT TGTATTGGCA TCGGCCTGGC GTGTGTGAAT ATTGCTGAGT TAAACCACGG ACTCCCCAAT TTTGACTGGG CACGTTATAC CTCTGGCATC GTGCTGGCGT TAGTTTCCGT GGTCTGCTGG GCGTGGTATG CCCTGCGCAA CGCCCGCTGG CTGCGGGAAA ATCCCGACAA ACATCCGATG ATGTGGGCAA CAGCGCAGGC GCTGGTCACA CTGCCGGTTT CACTTATCGG CTATCTCGTC GCTTGTTACT GGCTGAATAC GCAAACGCCG GACTTCTCCC TACCCTTTGG CCCCCGTCCG CTGGTGTTTA TTAGTCTGAT GATTGCGATA GCCGTGCTTT GCTCATGGGT TGGCGCACTC TGCTGGAACG TCGCCAGCCA GCGATTACCG ACAGTGATTC TCGGGCCGCT GATTGTTTTC GAAACGCTGG CAGGTTTGCT GTACACCTTT TTACTCCGCC AGCAAATGCC GCCGCTAATG ACGCTGAGCG GTATCGCGCT GTTAGTGATT GGCGTGGTCA TCGCGGTTAG AGCAAAACCA GAAAAACCTT TAACTGAATC TGTCTCAGAA AGTTGA
|
Protein sequence | MISGVLYALL AGLMWGLIFV GPLIVPEYPA MLQSMGRYLA LGLIALPIAW LGRVRLRQLA RRDWLTALML TMMGNLIYYF CLASAIQRTG APVSTMIIGT LPVVIPVFAN LLYSQRDGKL AWGKLAPALI CIGIGLACVN IAELNHGLPN FDWARYTSGI VLALVSVVCW AWYALRNARW LRENPDKHPM MWATAQALVT LPVSLIGYLV ACYWLNTQTP DFSLPFGPRP LVFISLMIAI AVLCSWVGAL CWNVASQRLP TVILGPLIVF ETLAGLLYTF LLRQQMPPLM TLSGIALLVI GVVIAVRAKP EKPLTESVSE S
|
| |