Gene Hhal_1859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1859 
Symbol 
ID4711234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2030657 
End bp2031637 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content68% 
IMG OID639856331 
Productoxidoreductase domain-containing protein 
Protein accessionYP_001003425 
Protein GI121998638 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCACCA CTGAAGCCGC CCGGGCCGCC ACCCCGCGAG TTGCCGTCAT TGGCGCCGGC 
GGCTGGGGCC GCAACCTGGT CCGCAACCTC CAAGAGCTCG GCGCGCTCCA CGCGGTGGTC
GAGGTCGACC CGGACAACCT GCGCGAGGTC GAGGCCATCT GCCCGGGCGT GCCCACCTAC
ACCGACACTA CCGAGGCCCT GGCCGACCCG CAGCTGGATG CCGTCGCCAT CGCCACCCCG
GTCATTACCC ACTACCAGGT CGTCCGCCAG GCCCTCGAGG CCGATAAGGA CGTCTTCGTC
GAGAAACCTC TCACCTCGGA CCGCGACCAG GCGTGGCAAC TGGTCGAACT GGCCGAGCAA
CGCGGCCGCC TGCTTATGGT CGGCCACCTC CTGCTGTTCC AGCCCGCCAT CCAGTGGCTG
CGGGACGATC TCGCCGCCGG CCGGATCGGG ACGATCCGCA GCATCCACCA GGAGCGGCTG
GGGCTGGGCC GGGCACGTGA TCACGAAAAT GCCCTGTGGT GCCTGGGCAC CCACGACGTG
GCCGTACAGC GCTTCCTGCT CTCGGGGCGC ACCCCCGATC AGATGCAGGT CAGTGGCCAA
TGCGTCCTGC AGCCGGGCAT CGAGGACGAT GTCTATCTCC ACCTGGGTTA CGGCGGCGGC
CTGCAGAGCC ACCTGCACTG CTCCTGGCTG TGGCCGGAGA AGCGGCGCAA CCTGGTCATC
ATCGGCAGCG AGGGCATGGT GGTCTACGAC GAGATTCATC AGGTGGTCAC CCACCACCGC
AAGGGCATCG ATCCCGGCCT GAACAACGTC GATGACGGCG CGGAGACGAT CTACCAGGGC
CATGGCCAGC CCCTTCGGCT GGAACTGGAG CACTTCCTCG ATTGCCTGCG CCACGGCGAA
CCGTGCCAGT CGGATGGCCG CTTCGCCGCC GGCATTGTCG ACCTGCTGGA CGAAGCCACC
AAGCGCCTGA GAAACGCCTA G
 
Protein sequence
MATTEAARAA TPRVAVIGAG GWGRNLVRNL QELGALHAVV EVDPDNLREV EAICPGVPTY 
TDTTEALADP QLDAVAIATP VITHYQVVRQ ALEADKDVFV EKPLTSDRDQ AWQLVELAEQ
RGRLLMVGHL LLFQPAIQWL RDDLAAGRIG TIRSIHQERL GLGRARDHEN ALWCLGTHDV
AVQRFLLSGR TPDQMQVSGQ CVLQPGIEDD VYLHLGYGGG LQSHLHCSWL WPEKRRNLVI
IGSEGMVVYD EIHQVVTHHR KGIDPGLNNV DDGAETIYQG HGQPLRLELE HFLDCLRHGE
PCQSDGRFAA GIVDLLDEAT KRLRNA