Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0685 |
Symbol | apbE2 |
ID | 5711120 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 689465 |
End bp | 690361 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641266594 |
Product | ApbE family lipoprotein |
Protein accession | YP_001532032 |
Protein GI | 159043238 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0743531 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCTGC ATCGACGTCG GTTTCTCAGC ATCAGCGCGG CGGCCATGGC CGCGGGCCCG GTTGCGGCGG CGGGCGCGGG CTTCACCCGT TGGCGCGGCG AGGCGCTGGG GGCGGAATGC GAGATCACCC TGCACGCCCC CCCGGGCAAG GCACAACCGG CGCTGGACGC GGCCCGCGCC GCCCTGCGCG CGGTGGAGAC GCAATTCAAC CTCTATGATC CCACCTCGGC GCTGGCGCGG CTGAACGCGA CCGGGCGGCT GGAGGCGCCG GAGCCGATGT TCCTCGCGCT GATGGAGTTG TCGCGGCAGA TGTACGAGGC CACGGAAGGA CGCTTCGACC CGAGCATCCA GCCGCTCTGG TCCGCATTGG CCCGGGGCCT CGATCCCGAG ACCGCACAGG CGCAGGTGGG GTGGTCCCGG GTGCGCTGGG ACGCGGGGGC GGTGCGGCTC GCCCCCGGTC AGGCACTCAG TTTCAACGGG GTCGGACAGG GCTTTGCCAC GGACCGTGTG GCCGAGACGC TGGCCGCCCA TGGCATGGGC CGGTTGCTGG TCAATATCGG GGAGTTTCGC GGCGCTGGCG GGCCCTGGCG CGTGGGGCTT TCGGACCCGG CGCATGGGCT CGTCGGCACG CGGCAGATCA CCGATGGGGC GATTGCCACC TCCAGCCCCC GGGCGCTGTC CCTCGGCGGG ACGGCACATA TTCTCGACCC GCTGGGGCGG CGCGTGCCGC GCTGGTCCAC GGTGTCGGTG GAGGCAGAGA CCGCGACGGT GGCGGATGCG CTCTCCACGG CGCTGTGCCT GATGCGGCGT GAAGAGATTG GCGCGGTGCA GGCGCGGGTG CCCGCCCTGC GGCGGGTCAC CCTGATCGAT CATGCGGGCG ATCTGGTCAC CCTGTGA
|
Protein sequence | MNLHRRRFLS ISAAAMAAGP VAAAGAGFTR WRGEALGAEC EITLHAPPGK AQPALDAARA ALRAVETQFN LYDPTSALAR LNATGRLEAP EPMFLALMEL SRQMYEATEG RFDPSIQPLW SALARGLDPE TAQAQVGWSR VRWDAGAVRL APGQALSFNG VGQGFATDRV AETLAAHGMG RLLVNIGEFR GAGGPWRVGL SDPAHGLVGT RQITDGAIAT SSPRALSLGG TAHILDPLGR RVPRWSTVSV EAETATVADA LSTALCLMRR EEIGAVQARV PALRRVTLID HAGDLVTL
|
| |