Gene Hhal_2194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2194 
Symbol 
ID4709208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2405783 
End bp2406772 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content71% 
IMG OID639856669 
Productalcohol dehydrogenase 
Protein accessionYP_001003760 
Protein GI121998973 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.482564 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCGCG CTGTCCGCAT CCATCAAACG GGTGGCCCCG AGCAACTCCA GGTCGAGGAG 
ATCGACGTGC CCGCCCCCGG TCCGGGTGAG GTCCTGTTGC GCCAGACCGC CTGCGGGGTC
AACTTCATCG ACTGCTACCA CCGCAGCGGG TTGTACCCGC TGCCCCAGCT GCCCCACGGC
ATCGGGGTCG AGGCCTGTGG TGTGGTCGAG GCGGTCGGCG ACGGGGTCCG CGGGGTGCAG
ACCGGGGAAC GGATGGCGTA TGCCACGCCG CCACCCGGCG CCTACGCCGA GGCCCGCGTG
CTGCCGGCGG ATCGCCTGAT CCCGGTCCCC GACGAGGTGG AGGACGCGGT GGCCGGTGGC
ACGACTCTCC GCGGCCTCAC GGCGTGGCTG CTGCTGCACC GTACCTTCCC GGTGCGCCGG
GGGCATACGC TGCTGATCCA CGCCGCCGCG GGCGGGGTGG GGCAGATCCT CTGTCAGTGG
GCGGACTACC TCGGGGCCAC GGTGATCGGC ACCGTGGGCG ACGAGGACAA GGCGCAGGTG
GCCGCCGAGC ACGGCTGTCA CCACCCCATC CTCTACACCC AAGACGACTT CCGCGAGAAG
GTCGAGGCGA TCACCGGGGG ACGCGGTGTG GATGTGGTCT ACGACTCGGT GGGGGCGGCC
ACCTTCGAGG CCTCGCTGGA TTGCCTGCGC CCACTGGGGA TGATGGTCAG TTACGGCAAT
GCCTCGGGGG CTCCGCCTGC CATCGAGCCC GGGCTGCTGG CGCAGAAGGG GTCGCTGTTC
CTGACCCGGC CGGTGCTGTT CCACTACATC GCCGATCGGC ACGAGCTGCT GGCCGGAGCG
TCGGCCTACT ACGATGCCCT TACGCAGGGG GTGATCGAGC CGGCGGTGGG GCGGCGCTAC
CCGCTGGAGA AGGCGGCCCG GGCCCACACG GATCTGGAGG CGCGGGCGAC CACCGGCGCC
ACGGTGCTCG ATCTCGGCGC AGCGGATTGA
 
Protein sequence
MARAVRIHQT GGPEQLQVEE IDVPAPGPGE VLLRQTACGV NFIDCYHRSG LYPLPQLPHG 
IGVEACGVVE AVGDGVRGVQ TGERMAYATP PPGAYAEARV LPADRLIPVP DEVEDAVAGG
TTLRGLTAWL LLHRTFPVRR GHTLLIHAAA GGVGQILCQW ADYLGATVIG TVGDEDKAQV
AAEHGCHHPI LYTQDDFREK VEAITGGRGV DVVYDSVGAA TFEASLDCLR PLGMMVSYGN
ASGAPPAIEP GLLAQKGSLF LTRPVLFHYI ADRHELLAGA SAYYDALTQG VIEPAVGRRY
PLEKAARAHT DLEARATTGA TVLDLGAAD