Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4021 |
Symbol | |
ID | 6967326 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3717172 |
End bp | 3718443 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643387788 |
Product | pyridine nucleotide-disulphide oxidoreductase family protein |
Protein accession | YP_002272231 |
Protein GI | 209397050 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 0.953433 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGACG ACTGCGACAT TATTATTATT GGTGCCGGTA TTGCAGGCAC CGCTTGCGCG TTACGCTGCG CGCGAGCGGG TTTATCCGTT TTGTTACTGG AACGCGCTGA AATCCCCGGC AGCAAAAATC TTTCCGGCGG GCGGTTATAT ACCCATGCAC TCGCGGAACT CCTCCCTCAA TTTCATCTGA CCGCGCCTCT TGAACGACGC ATCACTCACG AAAGCCTTTC CCTGTTAACG CCGGATTGCG CAACGACGTT TTCCAGCTTA CAGCCCGGCG GTGAATCCTG GAGTGTATTA CGTGCACGGT TCGATCCGTG GCTGGTTGCC GAAGCCGAAA AAGAAGGTGT CGAATGCATC CCTGGTGCGA CGGTGGATGC ACTGTATGAA GAAAACGGCA GGGTGTGTGG TGTCATTTGT GGTGACGATA TTCTCCGCGC CCGTTATGTG GTGCTGGCAG AAGGTGCCAA CAGCGTCCTG GCTGAACGTC ACGGGTTAGT GACTCGTCCT GCTGGCGAAG CGATGGCGTT GGGGATCAAA GAAGTGCTGT CGCTGGAAAC ATCCGCTATT GAAGAACGTT TTCATCTGGA GAATAACGAA GGCGCAGCGT TGCTGTTCAG CGGCGGAATC TGTGATGACT TACCCGGCGG CGCATTTCTT TATACTAATC AACAAACGCT CTCGTTAGGG ATTGTTTGCC CACTCTCTTC CCTTACGCAA AGTCGTGTTC CGGCAAGCGA GCTGCTTGCT CGCTTTAAAA CGCATCCGGC AGTGCGCCCG CTTATCAAAA ACACGGAATC ACTGGAGTAT GGTGCGCATC TGGTGCCAGA AGGTGGCTTG CACAGTATGC CGGTACAATA CGCCGGTAAC GGCTGGCTGC TGGTGGGCGA TGCGTTGCGC AGTTGCGTCA ATACCGGAAT TTCTGTGCGC GGCATGGATA CGGCGCTAAC TGGCGCGCAG GCGGCGGCAC AAACACTGAT AAGCGCCTGC CAGCACCGCG AGCCGCAAAA TCTGTTTCCG CTTTATCATC ACAACGTAGA GCGCAGCCTG CTGTGGGATG TTCTACAGCG TTATCAGCAT GTTCCGGTGC TTTTGCAACG CCCGGGATGG TACCGTACGT GGCCTGCGTT AATGCAGGAT ATTTCCCGCG ATTTATGGGA TCAGGGTGAT AAACCTGTTC CACCGCTGCG CCAGTTACTC TGGCGTCATT TACGTCGTCA TGGCTTGTGG AATCTGGCGG GCGATGTTAT CAGGAGTGTT CGATGTCTGT AG
|
Protein sequence | MEDDCDIIII GAGIAGTACA LRCARAGLSV LLLERAEIPG SKNLSGGRLY THALAELLPQ FHLTAPLERR ITHESLSLLT PDCATTFSSL QPGGESWSVL RARFDPWLVA EAEKEGVECI PGATVDALYE ENGRVCGVIC GDDILRARYV VLAEGANSVL AERHGLVTRP AGEAMALGIK EVLSLETSAI EERFHLENNE GAALLFSGGI CDDLPGGAFL YTNQQTLSLG IVCPLSSLTQ SRVPASELLA RFKTHPAVRP LIKNTESLEY GAHLVPEGGL HSMPVQYAGN GWLLVGDALR SCVNTGISVR GMDTALTGAQ AAAQTLISAC QHREPQNLFP LYHHNVERSL LWDVLQRYQH VPVLLQRPGW YRTWPALMQD ISRDLWDQGD KPVPPLRQLL WRHLRRHGLW NLAGDVIRSV RCL
|
| |