Gene Hhal_2284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2284 
Symbol 
ID4709118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2507513 
End bp2508553 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content70% 
IMG OID639856760 
Productaldo/keto reductase 
Protein accessionYP_001003850 
Protein GI121999063 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATACC GCCCACTCGG TCACAGCGAG CTACGCGTCA GCGCCATCTG TCTGGGCACC 
ATGACCTGGG GCGAACAGAA CAGTGAGGCC GAGGCCCACG CCCAGCTCGA CCTCGCCGCC
GAGCACGGGG TGAACTTCAT CGATGCGGCG GAGATGTATC CCGTGCCGCC CCGGGCCGAG
ACCGCCGGGC GCACCGAGGC GTACCTGGGC AACTGGCTGG CCCGCCAGCC ACGCCGCGAG
GATCTGGTCA TCGCCACCAA GATCGCCGGC CCCGGGCTGG ACTCGATCCG CGACGGCCAG
CGCGCCTACA CCCCCGAGCA GCTCCGCGAG GCGGTGGACG GCTCATTGCA GCGGCTGCGG
ACCGACTACA TCGACCTCTA CCAGCTGCAC TGGCCGGAGC GCCCCGCCAA TTACTTCGGG
CGACTCGACT ACCCCTGCCC CGAGGACGAC GGGCGGGAGC ATGAGCGCAT CCGGCGCGCC
CTGGAGGGGC TGGCCGAGCT GGTCGACGCT GGCAAGATCC GCCACATCGG ATTGTCCAAC
GAGACGCCCT GGGGCGCCAT GCGCTTCATC GCCGAGGCCG AGCGCCTCGG TCTGCCGCGC
ATCGTCTCCA TCCAGAACCC GTACAACCTG CTCAACCGCA GCTACGAGGT CGGACTCGCC
GAGGTCAGCC ACCGCGAGGG CTGCGGCTTG CTGGCCTACT CGCCCCTCGG CTTCGGCGTG
CTCAGCGGCA AGTACCTGGA TGGTCAACGC CCGGCCGAGG CCCGCCTGAC CCTCTTCGAG
CGCTTCCAGC GCTACACCGG CGAACGGGGC GTGACCGCCA CCCGGGCCTA CGTCGACCTC
GCCCGGAAGC ACGGCCTCGA CCCGGCACAG ATGGCCATCG CCTTCGCCAC CCAGCGGCCC
TTCTGCACCA GCACGATCAT CGGCGCCACG ACCACCGAGC AGCTGCGCAC CAATATGGAG
GCCGGGGCGC TGGCCCTAGA TGGGGCACTC CTGCAGGAGA TCGACACCCT CCACCAGGCC
AACCCCAACC CCTGCCCATG A
 
Protein sequence
MEYRPLGHSE LRVSAICLGT MTWGEQNSEA EAHAQLDLAA EHGVNFIDAA EMYPVPPRAE 
TAGRTEAYLG NWLARQPRRE DLVIATKIAG PGLDSIRDGQ RAYTPEQLRE AVDGSLQRLR
TDYIDLYQLH WPERPANYFG RLDYPCPEDD GREHERIRRA LEGLAELVDA GKIRHIGLSN
ETPWGAMRFI AEAERLGLPR IVSIQNPYNL LNRSYEVGLA EVSHREGCGL LAYSPLGFGV
LSGKYLDGQR PAEARLTLFE RFQRYTGERG VTATRAYVDL ARKHGLDPAQ MAIAFATQRP
FCTSTIIGAT TTEQLRTNME AGALALDGAL LQEIDTLHQA NPNPCP