Gene Hhal_1051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1051 
Symbol 
ID4709807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1136276 
End bp1137325 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content69% 
IMG OID639855522 
Productdelta-aminolevulinic acid dehydratase 
Protein accessionYP_001002629 
Protein GI121997842 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0113] Delta-aminolevulinic acid dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00488786 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGGCGG TCGTGACACG GAGCTCCCAA CCGCAGTTCA ATGGCGGCTA CCCCTTACGG 
CGCCCTCGCC GGATGCGCCG CGACGCCTTC TCCCGCCGGC TCATGCGGGA GACCCGACTC
GGCCCAGAGG ACCTGATCCA GCCGGTCTTC GTCCTCGACG GTGAGGACCG CACCGAGCCG
GTGCCCTCCA TGCCCGGTGT CGAGCGCATG ACCATCGATC GGTTGGTTCA CGAGGCCCGG
GAACTGCACG CACTTGGTAT CCCGCTGATC GCCATCTTCC CGGTCACCCC GGCCGAGGTG
AAGAGCGAGG ATGCGCGGGA GGCCTACAAC CCGAGCGGCA TCGCCCAGCG CGCCGTGCGC
GCGGTCAAGG ACGCGGTCCC CGAGATGGGC GTCATGACCG ACGTCGCCCT GGACCCGTTC
ACCAGCCACG GGCAGGACGG TCTCATCGAC GAGACCGGGT ACGTCATGAA CGAGGAGACC
GTCGAGGTCT TGGTCCGCCA GGCGCTGTCC CACGCCGAGG CCGGTGCCGA CGTGGTCGGC
CCGTCGGACA TGATGGACGG CCGCATAGGC GCCATCCGCA GCGCGCTGGA GTCCCACGAC
CACCGCAACG TGCGCATCCT CTCCTACGCG GCCAAGTACG CCTCCTGCTA CTACGGTCCG
TTCCGCGATG CGGTGGGGTC GTCCGACAAC CTCGGCAGCG GCGTGGCCGG CCCCGGCAAG
GACAGTTACC AGATGGACCC GGGCAACAGC GACGAAGCCC TGCACGAGGT AGCCCTCGAC
CTGCAGGAAG GGGCCGATAT GTTCATGGTC AAGCCGGGCC TGCCCTACCT GGACGTGATC
CGGCGGATCA AGGACGAATT CGGCGTCCCG ACCTTCGCCT ACCAGGTCAG CGGCGAGTAC
TCCATGCTGA AGGCCGCCGC CCAGAACGGC TGGCTCGACG AGCGCGAATG CGTCCTCGAG
GCGCTGATGT CGCTGCGCCG TGCCGGTGCC GACGGCATCC TGACCTACCA CGCGCGGGCG
GCGGCCGAGT GGCTCCGGGA AGAGGGCTGA
 
Protein sequence
MEAVVTRSSQ PQFNGGYPLR RPRRMRRDAF SRRLMRETRL GPEDLIQPVF VLDGEDRTEP 
VPSMPGVERM TIDRLVHEAR ELHALGIPLI AIFPVTPAEV KSEDAREAYN PSGIAQRAVR
AVKDAVPEMG VMTDVALDPF TSHGQDGLID ETGYVMNEET VEVLVRQALS HAEAGADVVG
PSDMMDGRIG AIRSALESHD HRNVRILSYA AKYASCYYGP FRDAVGSSDN LGSGVAGPGK
DSYQMDPGNS DEALHEVALD LQEGADMFMV KPGLPYLDVI RRIKDEFGVP TFAYQVSGEY
SMLKAAAQNG WLDERECVLE ALMSLRRAGA DGILTYHARA AAEWLREEG