Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2416 |
Symbol | |
ID | 6970586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 2287404 |
End bp | 2288693 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643386287 |
Product | hypothetical protein |
Protein accession | YP_002270769 |
Protein GI | 209396032 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 62 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGATG ACAAATTTGA TGCCATTGTG GTCGGTGCGG GCGTTGCTGG TAGCGTTGCC GCACTGGTCA TGGCGCGAGC CGGGCTGGAT ATCCTGGTGA TAGAACGCGG CGACAGTGCC GGATGTAAAA ACATGACCGG CGGGCGTCTT TATGCCCACA CACTTGAAGC AATCATTCCA GGCTTTGCAG CATCAGCGCC GGTAGAACGC AAGGTCACAC GCGAGAAAAT CTCCTTCTTA ACCGAAGAGA GCGCCGTTAC CCTCGATTTT CACCGCGAGC AACCAGATGT TCCGCAACAC GCATCTTATA CCGTATTGCG TAATCGTCTG GACCCGTGGT TGATGGAACA AGCCGAGCAG GCGGGCGCAC AGTTTATCCC GGGAGTTCGC GTCGATGCGT TGGTTCGTGA AGGAAACAAG GTCACTGGAG TCCAGGCCGG GGATGATATT CTCGAAGCCA ATGTGGTGAT TTTAGCCGAT GGCGTTAACT CGATGCTTGG CCGCTCGCTG GGAATGGTTC CCGCTTCCGA TCCGCATCAT TACGCTGTTG GTGTTAAAGA GGTTATTGGC CTCACACCAG AACAGATCAA CGATCGCTTT AATATTACGG GCGAGGAAGG TGCCGCCTGG CTGTTTGCCG GTTCCCCTTC TGACGGCCTG ATGGGCGGGG GATTTCTCTA TACCAACAAG GATTCCATAT CCTTGGGGCT GGTTTGTGGA TTGGCTGATA TCGCCCATGC GCAAAAAAGC GTGCCGCAAA TGCTGGAAGA TTTTAAACAA CACCCCGCCA TTCGCCCGCT GATTAGCGGC GGCAAACTGC TTGAATATTC CGCGCATATG GTGCCAGAAG GCGGTCTGGC GATGGTACCG CAACTGGTTA ACGATGGCGT GATGATCGTT GGTGACGCCG CAGGCTTCTG CCTGAATTTG GGTTTTACGG TCCGCGGCAT GGATTTAGCC ATTGCATCGG CTCAGGCTGC CGCCACAACA GTGATCGCCG CCAAAGAACG CGCGGATTTC TCCGCCAGCA GTCTGGCGCA ATACAAACGT GAGCTGGAAC AAAGCTGCGT CATGCGTGAT ATGCAGCATT TTCGCAAGAT CCCGGCGCTG ATGGAAAACC CGCGCCTGTT TAGCCAATAC CCACGAATGG TAGCCGACAT CATGAACGAG ATGTTCACCA TTGACGGTAA GCCTAACCAG CCGGTACGCA AAATGATCAT GGGACACGCG AAGAAAATTG GGCTGATCAA CTTGCTGAAA GATGGCATTA AGGGAGCAAC CGCGCTATGA
|
Protein sequence | MSDDKFDAIV VGAGVAGSVA ALVMARAGLD ILVIERGDSA GCKNMTGGRL YAHTLEAIIP GFAASAPVER KVTREKISFL TEESAVTLDF HREQPDVPQH ASYTVLRNRL DPWLMEQAEQ AGAQFIPGVR VDALVREGNK VTGVQAGDDI LEANVVILAD GVNSMLGRSL GMVPASDPHH YAVGVKEVIG LTPEQINDRF NITGEEGAAW LFAGSPSDGL MGGGFLYTNK DSISLGLVCG LADIAHAQKS VPQMLEDFKQ HPAIRPLISG GKLLEYSAHM VPEGGLAMVP QLVNDGVMIV GDAAGFCLNL GFTVRGMDLA IASAQAAATT VIAAKERADF SASSLAQYKR ELEQSCVMRD MQHFRKIPAL MENPRLFSQY PRMVADIMNE MFTIDGKPNQ PVRKMIMGHA KKIGLINLLK DGIKGATAL
|
| |