Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0936 |
Symbol | |
ID | 4711517 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 1014493 |
End bp | 1015542 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639855405 |
Product | ApbE family lipoprotein |
Protein accession | YP_001002514 |
Protein GI | 121997727 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0317624 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGTGCC TGGCATCCAC CACCCGCGCC CTGGCCCTGG CCGCCCTGGC CACCGCCCTG GGGCTGACCG GCTGCACCGC CGAGCCGGAT TCGACCCGGC TGCAGTTCAT CTCGCTGGGC ACGGAGGTGG AGATCCACAT CCTCGATGCC GGCAGCGGCG ACGCCGAGAC GGCCGCCAAG GCGGCCCGCC AGGAGATCGA TGCCATCAGC GAGGCCTGGG AGCCGACCCG TGGCACGGAA CTCGGACCGC TCAACGAGCG GCTGGCCGCC GGCGAGGGCA TGCAGGTCAG CGAGGAACTG ATCGCTATCC TGGAGCGCGC CCGCGAGATG GAGGCGCGCA CCGGCGGGCG CTTCAGCCCG GCCATCGGCG GACTCACCGA GCTGTGGGGC TTCTCCGCCC AGGAGGGGCC GTTGGAGGAG CCGCCGCCGG CCGAAGAGAT CGAGGCGTGG GTGGAGCGGG CCCCGCGCAT CGCCGACCTG AGCTGGGATG CCGAGCGCCG CGTCACCAGC AGCAACGACG GGGTGCGCAT CGACCTCGGC GGCATCGGCA AGGGCTTTGC CGGCGAGCGC GCCGTCGCCG CCCTGCGCGA GCACGGGGTG CGCACGGCGC TGATCAGCCT CGGCGGCGAC CTGGTGGCCC TGGGGGCTCC GGACGACCGC CCCTGGCGGA TGGGCGTGCG CGACCCGCGC GCCGGCACGG TGCTGGCCGC CGTCGAGGCC CACGCCGACG AGACCGTCTT CACCTCCGGG GACTACGAGC GCACCTTCAC CCACGAGGAC CGCCGCTACC ACCATATCCT CGATCCGACC ACCGGTTACC CGGCGATGGG CAGCCGTTCG ATGACCGTCA TCCACGACGA CCCGGTCCAC GCCGACGCCG CGGCGACGGC CCTGTTCATC GCCGGCCCGG ACGACTGGCA GGCCCTGGCC GAGGAGCTGG AGATCGGCTA CGCGCTGCTC GTCGACCGCG ACGGCGCCGT CTGGATGACC GAGGCCATGG CCGAGCGGGT CGAGCTCCAG GGTGAACCGG AGGCGGTCCA CATCGAGTGA
|
Protein sequence | MRCLASTTRA LALAALATAL GLTGCTAEPD STRLQFISLG TEVEIHILDA GSGDAETAAK AARQEIDAIS EAWEPTRGTE LGPLNERLAA GEGMQVSEEL IAILERAREM EARTGGRFSP AIGGLTELWG FSAQEGPLEE PPPAEEIEAW VERAPRIADL SWDAERRVTS SNDGVRIDLG GIGKGFAGER AVAALREHGV RTALISLGGD LVALGAPDDR PWRMGVRDPR AGTVLAAVEA HADETVFTSG DYERTFTHED RRYHHILDPT TGYPAMGSRS MTVIHDDPVH ADAAATALFI AGPDDWQALA EELEIGYALL VDRDGAVWMT EAMAERVELQ GEPEAVHIE
|
| |