Gene Hhal_1552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1552 
Symbol 
ID4709416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1688140 
End bp1689447 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content63% 
IMG OID639856016 
Producthypothetical protein 
Protein accessionYP_001003118 
Protein GI121998331 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1232] Protoporphyrinogen oxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.113177 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGAGA ACACGCAACA CACCCGCCCC CGGGTGGCGA TCATCGGCGG CGGCCCGATG 
GGCCTTGCCG TCGCCTATGA GCTGCTGCTC CGCGGCCACC CTGTGGATAT CTACGAGGCC
GGCCCCGTGC TGGGCGGCAT GTCGGCTGCA TTCGAGTTCT CGGGCACGGC GATCGAACGC
TACTACCATT TTATTTGTGC CACTGATGAG TCGATGTTCG CCCTGCTGCG AGAACTCGAC
CTGGAGGAGG CCCTGCACTG GCAGCCGACC GAGATGGGGT ACTTCATTGA CGGCGAACTG
CACGACTGGG GTAATCCCGT TGCCCTGCTC AAACTGCCGC ACCTCGATCT GATCTCGAAA
TTCCGTTACG GCGTCCACGC GTACCTGAGC ACTCAACGAC GCAACTGGGC ATCCCTGGAT
CGGCTCGAGG CCACACGCTG GCTGCGTCGC TGGGTCGGGG ACCGCGCGTA CGATCTGCTC
TGGCGGAAAC TCTTCGAGTT GAAATTCTAC GAACACACCG ACCGGCTCTC CGCCGCCTGG
ATCTGGACAC GGATCAAACG GGTGGGCACC TCCCGGTACA ACTTGATGCA GGAAAAGCTC
GGTTACTTGG AGGGCGGCTC ACAGACGCTG CTCGATGCAC TGCAAGCGCA GATCGAAGCG
CGGGGCGGCC GTATCCATCT CAACAGGCCG GTCGAGGAGG TGGTGCTACG CGACGGGGCC
GTTCAGGGGC TGCGCATCGC CGGTGAGGCG AGCCCGTACT CCACCGTTTT CAGCACCGTG
CCCCTGCCGC TGCTACCAAG CATGATTCCG GATCTGCCGG CCTCCGTCCG GGAACAAATC
CAGGCGCTGG AGAACCTTGC CGTGGTGTGC GTGATCTTCA AGCTCCGCCG GCCGGTCAGC
GATAAGTTCT GGGTCAATAT CAACGACCAG CGCATGGATA TCCCCGGGAT CGTAGAGTAC
ACCAACCTAC GCCCGATGGA TGCCCATATC GTCTATGTGC CTTACTACAT GCCGGGCGAC
AACCCCAAAT ACCAGGACAA CAACCAGGCT TTCATCGACC GCGCCTGGCG CTACCTGCAG
ATGATCAACC CGACCCTGCA GGAGGCGGAT CGCCTCGACG CCTACGCCAG TCGGTACCGG
TACTCCCAGC CGGTCTGCGA ACCCGGCCAC CTGCAGCGGC TCCCTCCGGT GGACCTGCCC
ATTGCGGGGC TCTACGCGGC GGACACCTCC TACTACTACT ACCCCGAGGA CCGCGGGATC
TCCGAGAGCG TGCGGTACGG CCGCGCCATG GCTCGATTCC TGGAGTAA
 
Protein sequence
MAENTQHTRP RVAIIGGGPM GLAVAYELLL RGHPVDIYEA GPVLGGMSAA FEFSGTAIER 
YYHFICATDE SMFALLRELD LEEALHWQPT EMGYFIDGEL HDWGNPVALL KLPHLDLISK
FRYGVHAYLS TQRRNWASLD RLEATRWLRR WVGDRAYDLL WRKLFELKFY EHTDRLSAAW
IWTRIKRVGT SRYNLMQEKL GYLEGGSQTL LDALQAQIEA RGGRIHLNRP VEEVVLRDGA
VQGLRIAGEA SPYSTVFSTV PLPLLPSMIP DLPASVREQI QALENLAVVC VIFKLRRPVS
DKFWVNINDQ RMDIPGIVEY TNLRPMDAHI VYVPYYMPGD NPKYQDNNQA FIDRAWRYLQ
MINPTLQEAD RLDAYASRYR YSQPVCEPGH LQRLPPVDLP IAGLYAADTS YYYYPEDRGI
SESVRYGRAM ARFLE