Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3352 |
Symbol | apbE |
ID | 6969875 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3084828 |
End bp | 3085883 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643387164 |
Product | thiamine biosynthesis lipoprotein ApbE |
Protein accession | YP_002271627 |
Protein GI | 209399737 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0822228 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 76 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAATAA GCTTTACCCG CGTGGCACTG CTGGCTGCCG CGCTCTTCTT TGTTGGTTGC GATCAAAAAC CACAACCCGC CAAAACCCAC GCTACTGAAG TTACCGTTCT TGAAGGCAAA ACTATGGGTA CCTTCTGGCG TGCCAGCATC CCGGGCATTG ACGCCAAACG CAGCGCCGAA CTTAAAGAAA AGATTCAGAC CCAGCTGGAC GCCGACGATC AGCTGCTTTC GACCTATAAA AAAGATTCCG CGCTGATGCG CTTTAACGAC TCGCAAAGTT TATCGCCGTG GCCGGTAAGT GAAGCGATGG CCGATATCGT CACCACCTCG CTACGCATTG GCGCGAAGAC CGATGGCGCG ATGGATATAA CCGTCGGGCC GCTGGTGAAT CTGTGGGGTT TCGGTCCGGA ACAACAGCCG GTTCAAATTC CGAGCCAGGA ACAGATCGAC GCGATGAAAG CCAAAACTGG TTTACAGCAC CTTACGGTCA TTAATCAGTC GCATCAGCAA TATCTGCAAA AAGACCTGCC GGATTTATAT GTCGATCTCT CCACCGTCGG CGAAGGTTAT GCGGCGGATC ACCTGGCACG CTTGATGGAG CAGGAAGGGA TTTCCCGCTA TCTGGTGTCG GTGGGCGGCG CGCTGAACAG CCGTGGTATG AACGGTGAAG GCCTGCCGTG GCGGGTAGCG ATTCAAAAAC CAACCGATAA AGAAAACGCG GTTCAGGCCG TGGTGGATAT CAACGGTCAT GGGATTAGCA CCTCTGGCAG CTACCGTAAC TATTACGAAC TGGACGGCAA ACGTCTTTCC CATGTTATCG ATCCGCAAAC CGGGCGTCCC ATCGAACACA ATCTGGTATC CGTGACGGTG ATTGCACCGA CGGCGCTGGA AGCCGACGCC TGGGATACAG GTTTGATGGT ACTCGGGCCG GAGAAAGCCA AAGAAGTTGT TCGCCGGGAA GGGCTGGCGG TTTATATGAT CACCAAAGAA GGCGATAGCT TTAAAACCTG GATGTCACCA CAGTTTAAAA GCTTCCTTGT CAGCGAAAAA AATTAA
|
Protein sequence | MEISFTRVAL LAAALFFVGC DQKPQPAKTH ATEVTVLEGK TMGTFWRASI PGIDAKRSAE LKEKIQTQLD ADDQLLSTYK KDSALMRFND SQSLSPWPVS EAMADIVTTS LRIGAKTDGA MDITVGPLVN LWGFGPEQQP VQIPSQEQID AMKAKTGLQH LTVINQSHQQ YLQKDLPDLY VDLSTVGEGY AADHLARLME QEGISRYLVS VGGALNSRGM NGEGLPWRVA IQKPTDKENA VQAVVDINGH GISTSGSYRN YYELDGKRLS HVIDPQTGRP IEHNLVSVTV IAPTALEADA WDTGLMVLGP EKAKEVVRRE GLAVYMITKE GDSFKTWMSP QFKSFLVSEK N
|
| |