Gene Hhal_0936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0936 
Symbol 
ID4711517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1014493 
End bp1015542 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content73% 
IMG OID639855405 
ProductApbE family lipoprotein 
Protein accessionYP_001002514 
Protein GI121997727 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0317624 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGTGCC TGGCATCCAC CACCCGCGCC CTGGCCCTGG CCGCCCTGGC CACCGCCCTG 
GGGCTGACCG GCTGCACCGC CGAGCCGGAT TCGACCCGGC TGCAGTTCAT CTCGCTGGGC
ACGGAGGTGG AGATCCACAT CCTCGATGCC GGCAGCGGCG ACGCCGAGAC GGCCGCCAAG
GCGGCCCGCC AGGAGATCGA TGCCATCAGC GAGGCCTGGG AGCCGACCCG TGGCACGGAA
CTCGGACCGC TCAACGAGCG GCTGGCCGCC GGCGAGGGCA TGCAGGTCAG CGAGGAACTG
ATCGCTATCC TGGAGCGCGC CCGCGAGATG GAGGCGCGCA CCGGCGGGCG CTTCAGCCCG
GCCATCGGCG GACTCACCGA GCTGTGGGGC TTCTCCGCCC AGGAGGGGCC GTTGGAGGAG
CCGCCGCCGG CCGAAGAGAT CGAGGCGTGG GTGGAGCGGG CCCCGCGCAT CGCCGACCTG
AGCTGGGATG CCGAGCGCCG CGTCACCAGC AGCAACGACG GGGTGCGCAT CGACCTCGGC
GGCATCGGCA AGGGCTTTGC CGGCGAGCGC GCCGTCGCCG CCCTGCGCGA GCACGGGGTG
CGCACGGCGC TGATCAGCCT CGGCGGCGAC CTGGTGGCCC TGGGGGCTCC GGACGACCGC
CCCTGGCGGA TGGGCGTGCG CGACCCGCGC GCCGGCACGG TGCTGGCCGC CGTCGAGGCC
CACGCCGACG AGACCGTCTT CACCTCCGGG GACTACGAGC GCACCTTCAC CCACGAGGAC
CGCCGCTACC ACCATATCCT CGATCCGACC ACCGGTTACC CGGCGATGGG CAGCCGTTCG
ATGACCGTCA TCCACGACGA CCCGGTCCAC GCCGACGCCG CGGCGACGGC CCTGTTCATC
GCCGGCCCGG ACGACTGGCA GGCCCTGGCC GAGGAGCTGG AGATCGGCTA CGCGCTGCTC
GTCGACCGCG ACGGCGCCGT CTGGATGACC GAGGCCATGG CCGAGCGGGT CGAGCTCCAG
GGTGAACCGG AGGCGGTCCA CATCGAGTGA
 
Protein sequence
MRCLASTTRA LALAALATAL GLTGCTAEPD STRLQFISLG TEVEIHILDA GSGDAETAAK 
AARQEIDAIS EAWEPTRGTE LGPLNERLAA GEGMQVSEEL IAILERAREM EARTGGRFSP
AIGGLTELWG FSAQEGPLEE PPPAEEIEAW VERAPRIADL SWDAERRVTS SNDGVRIDLG
GIGKGFAGER AVAALREHGV RTALISLGGD LVALGAPDDR PWRMGVRDPR AGTVLAAVEA
HADETVFTSG DYERTFTHED RRYHHILDPT TGYPAMGSRS MTVIHDDPVH ADAAATALFI
AGPDDWQALA EELEIGYALL VDRDGAVWMT EAMAERVELQ GEPEAVHIE