Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2363 |
Symbol | apbE |
ID | 6143331 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2394199 |
End bp | 2395254 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641617236 |
Product | thiamine biosynthesis lipoprotein ApbE |
Protein accession | YP_001744408 |
Protein GI | 170681131 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.643512 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAATAA GCTTTACCCG CGTGGCACTG CTGGCTGCCG CGCTCTTCTT TGTTGGTTGC GATCAAAAAC CACAACCCGC CAAAACCCAC GCTACTGAAG TTACCGTTCT TGAAGGCAAA ACCATGGGTA CCTTCTGGCG TGCCAGCATC CCGGGCATTG ATGCCAAACG CAGTGCCGAA CTTAAAGAAA AGATTCAGAC CCAGCTGGAC GCCGACGATC AGCTGCTTTC GACCTATAAA AAAGATTCCG CGCTGATGCG CTTTAACGAC TCGCAAAGTT TGTCGCCGTG GCCGGTAAGT GAAGCGATGG CCGATATCGT CACCACCTCG CTGCGCATTG GCGCAAGGAC CGATGGTGCG ATGGATATAA CCGTCGGGCC GCTGGTGAAT CTGTGGGGCT TTGGCCCGGA ACAACAGCCG GTTCAAATTC CGAGTCAGGA ACAGATCGAT GCGATGAAAG CCAAAACCGG CTTACAGCAC CTGACGGTGA TTAATCAGTC CCACCAGCAA TATCTGCAAA AAGACCTGCC GGATTTATAT GTCGATCTCT CTACCGTCGG TGAAGGTTAT GCGGCGGATC ATCTGGCACG CTTGATGGAG CAGGAAGGGA TTTCCCGCTA TCTGGTGTCG GTGGGCGGCG CGCTGAACAG CCGTGGTATG AACGGTGAAG GCCAGCCGTG GCGAGTGGCG ATTCAAAAAC CAACCGATAA AGAAAACGCG GTTCAGGCGG TGGTGGATAT CAACGGCCAC GGTATCAGCA CCTCCGGCAG CTATCGCAAC TATTACGAAC TGGACGGCAA ACGTCTTTCT CATGTTATCG ATCCGCAAAC CGGGCGTCCC ATTGAACATA ATCTGGTATC CGTGACGGTG ATTGCCCCGA CGGCGCTGGA AGCCGACGCC TGGGACACAG GTTTGATGGT GCTCGGGCCG GATAAAGCCA AAGAAGTTGT TCGCCGGGAA GGGCTGGCGG TCTATATGAT CACCAAAGAG GGCGATAGCT TTAAAACCTG GATGTCGCCG CAGTTTAAAA GCTTCCTTAT CAGCGAAAAA AATTAA
|
Protein sequence | MEISFTRVAL LAAALFFVGC DQKPQPAKTH ATEVTVLEGK TMGTFWRASI PGIDAKRSAE LKEKIQTQLD ADDQLLSTYK KDSALMRFND SQSLSPWPVS EAMADIVTTS LRIGARTDGA MDITVGPLVN LWGFGPEQQP VQIPSQEQID AMKAKTGLQH LTVINQSHQQ YLQKDLPDLY VDLSTVGEGY AADHLARLME QEGISRYLVS VGGALNSRGM NGEGQPWRVA IQKPTDKENA VQAVVDINGH GISTSGSYRN YYELDGKRLS HVIDPQTGRP IEHNLVSVTV IAPTALEADA WDTGLMVLGP DKAKEVVRRE GLAVYMITKE GDSFKTWMSP QFKSFLISEK N
|
| |