Gene Hhal_0117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0117 
Symbol 
ID4710748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp132233 
End bp133363 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content69% 
IMG OID639854575 
Producthypothetical protein 
Protein accessionYP_001001713 
Protein GI121996926 
COG category[S] Function unknown 
COG ID[COG3016] Uncharacterized iron-regulated protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCACC CCCTGCCGCG CACCACCGTC TTGTTCGCCG GCCTGTTACT CGCCGGGAGC 
GCCCTCGCCA GCCCCTGCGC CGATCCGGGC CAGTGGTACG AACCGGCCGC CGAACGGACC
CTCAACACCC AGGAAGTCCT CGACGCCCTC GACGGCGCTG AGTTCATCCT GCTTGGCGAG
CGCCACGACG ACGCCGCCCA CCACCGCTGG CAACTCCATA CCCTGGCAGC CCTGCAAGGG
CGCGGCGAGC TGGCGGCGAT CGGCTTCGAG ATGTTCCCAC GCAGCAAGCA GGCCCCCCTG
GAGGACTGGC GTGCAGGCAA GCTGACGCGC GAGGCGTTTT TGGAAGCGAG CGAATGGCAG
CGCGTCTGGG GCTACGATGC CGGACTCTAT ATGCCGCTGT TCGATTTCGT TCGCACCCAC
CGCGTACCCG CCCAGGCCCT CAATGTCGAC CGAGCCACCG TGCGGGCGGT CCGTGAGCAG
GGCTTCGACG CCCTGGATGA AGCCGAGCGG GAATCCGTCA GCAAGCCGGC CGAGGCCAGC
GATGGCTATC GGGATCGTCT ACAGCGGGTA TTTCGCCACC ATCCCGGGGC GGAGGACGAC
GATACGGCCG TCGATCGATT TATCGAGGCG CAGACCTTCT GGGATCGGGC CATGGCCGAG
TCGATGGCCG CCGCCTACGA ACAGCACGGT GGGGCGGTGG TCGGTATCGT CGGCCGAGGC
CACGCCGAGT ACGGCGACGG GATCGCCCAC CAGCTCCAGG ACCTCGGCTA CGAACGGGTT
CGCATCCTGC TGCCGCTCGA CCACACCGCC GAGTGCCCGG ACGCCGGGCA GGCGGACTTC
CTCTTCGCCC TCGAGCCGGA GCGCCGCGGA ACCGAGCCGC CACGCCTGGG CATCGCCATG
GGGCACGAGG ACTCGAAAGT GACCATCGTC GACGTCATGG CCGACACCCC GGCCGAGGAG
GCCGGACTGG CCGCCGGTGA CCGCATCCTC AAGGCGGCGG AAACCAAGAT CGAGCACCCG
AGCGACCTGC AACGGATCGT TGGCCGACAG GCGCCGGGCA CCTGGCTGCC GATACGGATC
GAGCGCGGTG GGGATGAACT GGAGAAGGTC GCACGCTTCC CTGCCGAGTG A
 
Protein sequence
MPHPLPRTTV LFAGLLLAGS ALASPCADPG QWYEPAAERT LNTQEVLDAL DGAEFILLGE 
RHDDAAHHRW QLHTLAALQG RGELAAIGFE MFPRSKQAPL EDWRAGKLTR EAFLEASEWQ
RVWGYDAGLY MPLFDFVRTH RVPAQALNVD RATVRAVREQ GFDALDEAER ESVSKPAEAS
DGYRDRLQRV FRHHPGAEDD DTAVDRFIEA QTFWDRAMAE SMAAAYEQHG GAVVGIVGRG
HAEYGDGIAH QLQDLGYERV RILLPLDHTA ECPDAGQADF LFALEPERRG TEPPRLGIAM
GHEDSKVTIV DVMADTPAEE AGLAAGDRIL KAAETKIEHP SDLQRIVGRQ APGTWLPIRI
ERGGDELEKV ARFPAE