Gene Hhal_1403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1403 
Symbol 
ID4711147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1516518 
End bp1517525 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content66% 
IMG OID639855870 
ProductLysR family transcriptional regulator 
Protein accessionYP_001002972 
Protein GI121998185 
COG category[K] Transcription 
COG ID[COG0583] Transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAAG ATCGAATAGC CTTGGATCAC GAGTCCCCTC GTCATTACTA TCGACACAAC 
CGGCTCAAGC AACTGCGCGC CTTCTGCCAC GCCGCTCAGG AGCGCAGCAT CTCCCGCGCC
GCCGAGCGCC TGGAGCTGAG CCAGCCGTCC GTATCGCTGC AGATCCAGGC CCTCGAGCGC
GAGATGGGGA TCACCCTGTT CGAGCGCCGT GGCCCGCGCA TCCGGCTGAC GCCCGACGGC
GAGGGGCTCT ACGAGGTCGC GGCACCGCTG GTCGAGGGCA TCGACTCGCT GCCCGAGCAG
TTCGTCGCCC GCAACCGTCG ACAACAGCTC GGCCACCTGG ACATCGCCGC CGGCGAGTCG
ACCTCGCTCT ACGTCCTGCC CGATCTGCTG CGCTCGTTCA TGGTGCACCA CCCTAAGGTC
CACGTCCGCC TGCACAACCT CATCGGCGAG GAGCAGATCC ATGCCGTGCA GCAGGATCGG
GTGGATATCG CCGTCGGTTC GAATCTCGAC CTGCCTGACG ATATCGTCTA CCGGGCGACG
CACACCTTCG ACCTGAAGCT GATCACCCCG CTGAACCACC CGCTGGCGGA GAAGGAACAG
ATCACTCTCG AGGACCTGGC CGCCGGCGAA CTGATCCTGC CGCCCCGCCA GCTGACCACC
TGGCGCCTGG TCAACCTGAT CTTTCAGCAG CATAGCGTGC CGTATCAGGT CCGCCTCGAG
GTGGGCGGCT GGGAGATCAT GAAGCGCTAC GTCGAGCTCG GCTTCGGTGT CGGCATCGCC
AGCGCCATCT GCTTGACCGG GCAGGAACAG CTGGCCATTC GCGACCTCCC CGAGATCTTC
CCGCGCCGCA CCTACGGCGT CATGCTGCGC CGCGGGCGCT ACCTCTCCCC GCAGGCCCGC
CGTTTCCTGG AACTGATCGA CCCCCAGGGT TTCGGCCAGG CAGCGGAGTG GGACGGCATC
GCGGGCAGTC AGGAGAGCGT GCTGGTCCCC GGCCGTCAGC CGCAATGA
 
Protein sequence
MTEDRIALDH ESPRHYYRHN RLKQLRAFCH AAQERSISRA AERLELSQPS VSLQIQALER 
EMGITLFERR GPRIRLTPDG EGLYEVAAPL VEGIDSLPEQ FVARNRRQQL GHLDIAAGES
TSLYVLPDLL RSFMVHHPKV HVRLHNLIGE EQIHAVQQDR VDIAVGSNLD LPDDIVYRAT
HTFDLKLITP LNHPLAEKEQ ITLEDLAAGE LILPPRQLTT WRLVNLIFQQ HSVPYQVRLE
VGGWEIMKRY VELGFGVGIA SAICLTGQEQ LAIRDLPEIF PRRTYGVMLR RGRYLSPQAR
RFLELIDPQG FGQAAEWDGI AGSQESVLVP GRQPQ