Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3168 |
Symbol | apbE1 |
ID | 5712224 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 3332404 |
End bp | 3333345 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641269095 |
Product | ApbE family lipoprotein |
Protein accession | YP_001534502 |
Protein GI | 159045708 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAACG GAGTGATCTC AGGGCTGACT CGGCGCCGGT TTCTCGCGCT TTCCGCGGGC GCCTTGTGCG GCGCGAGCAG CGTTGCCGCG GCAAGCCGAC CCGTGACGCG TTGGCAGGGC ACGGCCCTTG GCGCCGATGC CTCCGCGCAA CTCGTGGGCC TGGACCCTGC AAGCGCGTCC GACATCCTGG TCGGGCTGCA GCGGGAGTTG CGGCGGCTTG AAACCCTGTT CAGCCTCTAC CGGCCCGAAT CCCAGGTTTC GCGGCTGAAC CGGGAGGGTC GACTGGATGC GCCCGCGCCC GAGCTTCTGG AGGTTCTGGC CCAATGCGAT GCGTTGCACC GGGGAACCGG GGGGGCGTTC GATCCGACCA CGCAGCCACT TTTCGCGACC TTTGCGCAGG CGGCGGCGGC CGGGCGGACC CCTTCGGGCG ACGAGGTCGC ACGGGCCCGC GCCGCGGTCG GCTGGCACCA TCTGAGGATC GGCACCGACA GCGTAGCGTT CGCAAGGCCC GGCGGGCAGC TGACGCTCAA CGGGATCGCG CAGGGCTATA TCAGCGACCG AATCGCTGCC TGGCTGCGGG ACCGGGGTTT GACCGACATC CTGCTTGAGG CGGGCGAAAT CGTCGCCCTG GGCCACGGGC CGGACGCAAC GCCCTGGCGC TGCCGTATCG CGGATGCCGC CGGCACCTCG CGCCGCGAGC TCCGCTTGCG GGATCGGGCG ATCGCCACAT CCGCCCCGGA TGCCATGACG CTTGCGGGCG CACGCCACAT CTTCGATCCG GCCAGCGGAC GCAGTGCCGA CAGGGGGCGT ATGACTTCCG TGTCGGCCCC GGCCGCGATG CTGGCAGACG GGCTGTCCAC GGCCCTGTGC GTTCTGCCGC CGGAGAGCCA TGCCACCGTC ATCGCCCGGT TTCCCGGTGC GCGTGTCGAG TTTGCCACCT GA
|
Protein sequence | MKNGVISGLT RRRFLALSAG ALCGASSVAA ASRPVTRWQG TALGADASAQ LVGLDPASAS DILVGLQREL RRLETLFSLY RPESQVSRLN REGRLDAPAP ELLEVLAQCD ALHRGTGGAF DPTTQPLFAT FAQAAAAGRT PSGDEVARAR AAVGWHHLRI GTDSVAFARP GGQLTLNGIA QGYISDRIAA WLRDRGLTDI LLEAGEIVAL GHGPDATPWR CRIADAAGTS RRELRLRDRA IATSAPDAMT LAGARHIFDP ASGRSADRGR MTSVSAPAAM LADGLSTALC VLPPESHATV IARFPGARVE FAT
|
| |