Gene Hhal_1762 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1762 
Symbol 
ID4709076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1936496 
End bp1937749 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content65% 
IMG OID639856231 
ProductNADH dehydrogenase subunit D 
Protein accessionYP_001003328 
Protein GI121998541 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAGT TCCAGAGCTA CACCCTGAAC TTCGGCCCGC AGCACCCGGC CGCCCACGGG 
GTGTTGCGCT TGGTGCTGGA GATGGAGGGG GAGGCGGTGC GCCGTGCCGA CCCCCACATC
GGTTTGCTGC ACCGGGCCAC CGAAAAGCTG GCCGAGTCCA AGCCTTACAA CCAGTCCATC
GGCTACATGG ACCGGCTCGA CTACGTCTCG ATGATGTGCA ACGAGCACGG CTACGTGCGC
GCCATCGAGA AGCTTTTGGG GATCGAGCCG CCGCTGCGGG CGCAGTACAT CCGCACGATG
ATGGACGAGG TCACCCGCAT CCTGAATCAC TTGATGTGGT TGGGCGGGCA CGGCCTCGAC
GTCGGTGCCA TGACCGCGTT CCTGTACACC TTCCGCGAGC GGGAAGACCT CATGGATGTC
TACGAGGCGG TCTCCGGCGC GCGGATGCAC GCGACCTACT ACCGGCCCGG CGGGGTGCAC
CGGGATCTCC CGGATCAGAT GCCGAAGTAC GAGCCCTCGG CCTACCGCAG CGACAAAGAG
CTGCGCGAGA TGAACCGCGC CCGGGAGGGT TCGGTGCTCG ACTTCCTGGA CGATTTTTGC
GAGCGCTTCC CGGCCTGCGT GGATGAGTAC GAGACGCTGC TGACTGAGAA CCGGATCTGG
AAGCAGCGGC TGGTGGACAT CTGCCCGGTC TCCGCCGAGC GCGCCGTGGA GCTCGGGTTC
ACGGGTCCCC TGCTGCGCGG TTCCGGGGTG GCTTGGGACC TGCGCAAAAA GCAACCCTAC
GCTGCCTACG ACCGGGTCGA TTTCGACATC CCGGTCGGCG TCAACGGCGA CTCCTACGAC
CGCTACCTGG TGCGCGTCGA GGAGATGCGC CAGTCGGTGC GCATCATCAA GCAGTGCGTG
GATTGGCTGC GAGCCAACCC GGGTCCGGTG CGTATCGACG ATCCCAAGGT CACGCCGCCG
ACCCGGGAAG AGATGAAGGA CGACATGGAG TCCCTCATCC ATCACTTCAA GCTCTTTACC
GAGGGGTACT GCACGCCCCC CGGCGAGGTG TACGCGGCGG TCGAGGCGCC GAAGGGCGAG
TTCGGGGTCT ACTTGATCTC GGACGGTGCC AACAAGCCGT ACCGGCTCAA GGTTCGGCCG
CCGTGCTATT ACCACTTGGC GGCTACCGAC GAGATGATTC GCGGCTACAT GTTGGCCGAT
GTGGTGACCT TGATCGGCTC GCTGGATGTG GTCTTCGGGG AGGTGGACCG GTGA
 
Protein sequence
MAEFQSYTLN FGPQHPAAHG VLRLVLEMEG EAVRRADPHI GLLHRATEKL AESKPYNQSI 
GYMDRLDYVS MMCNEHGYVR AIEKLLGIEP PLRAQYIRTM MDEVTRILNH LMWLGGHGLD
VGAMTAFLYT FREREDLMDV YEAVSGARMH ATYYRPGGVH RDLPDQMPKY EPSAYRSDKE
LREMNRAREG SVLDFLDDFC ERFPACVDEY ETLLTENRIW KQRLVDICPV SAERAVELGF
TGPLLRGSGV AWDLRKKQPY AAYDRVDFDI PVGVNGDSYD RYLVRVEEMR QSVRIIKQCV
DWLRANPGPV RIDDPKVTPP TREEMKDDME SLIHHFKLFT EGYCTPPGEV YAAVEAPKGE
FGVYLISDGA NKPYRLKVRP PCYYHLAATD EMIRGYMLAD VVTLIGSLDV VFGEVDR