Gene Hlac_0937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0937 
Symbol 
ID7401309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp936056 
End bp937237 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content67% 
IMG OID643708003 
ProductFAD-dependent pyridine nucleotide-disulphide oxidoreductase 
Protein accessionYP_002565605 
Protein GI222479368 
COG category[C] Energy production and conversion 
COG ID[COG1252] NADH dehydrogenase, FAD-containing subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.521106 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.558162 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACAC AGGTCGTCGT CGTCGGTTCC GGGTACGCCG GTGCAGGGGC AGTGAAGGCG 
TTCGAAGACG AGGTTGGCGA GGGTGAGGCC GATCTCACGT GGATCTCGGA GCACGACTAC
CACCTCGTTC TCCACGAGGT CCACCGCGCG ATCCGCAACC CCGCGGTCGC CGAGAAGATC
ACCATCCCGG TCGACGAGAT CAAATCCCCC GAGTCCAACT TCGTACGGGG CCGCGTCGTC
GACGTCGACA CCAACGAGCA GGTCGTCGAG ACCGACGACG GGACGACCGT CGACTACGAC
TACCTCCTGT TAGGCGTCGG CTCCACGACC GCTTTCTTTG GGATCGAGGG ACTCGAAGAG
AACGCCCACC AACTCAAGAG CCTCGCCGAC GCGAAGGCGA TCCACGAGGA CGTCCGGTCG
GCTGCCGCCG AGGCGACCCG CTCGGACCCC GTCGAGGTCA TCGTCGGCGG CGCCGGGCTC
TCCGGCATCC AGACCGCGGG CGAAATCGCG GAGTACCGCG ATAAACACCG CGCGCCCGTC
GATATCAAGC TCGTCGAGGG ACTCGACGAG GTCTTCCCGG GCAACGATCC CCAGCTTCAG
GGCGCGCTCC GCCAGCGGCT CGAAGACGCC GACATCGAAA TACTCACCGG CGACTTCATC
TCGAAGGCCG ATGAGGACGC GGTGTACTTC GGCGGCGGCG AAGACGAGGA GCCCGAGGAA
CTCTCGTACG ACGTGCTGAT CTGGACCGGC GGCATCACCG GCCAGCCGGA GCTTGAGAAC
GTCGAGGTCG AGAAGGACGA GCGTTCGAAC CGCGTCCACG CCGGCTCCGA CTTCACCACC
AGCGACGACC GCGTGTTCGC TATCGGCGAC ACCGCGCTCG TCGAGCAGGG CGACGACGAC
GTGGCGCCGC CGACGGCACA GGCCGCCTGG CAGGCCGCCG AGGTCGCCGG CGAGAACCTC
GCGCGCGCCG CCCGCGGCGC GCCCCTCCAG TCGTGGGAAC ACACGGACAA GGGGACCGTC
GTCTCGGTCG GCGAGGACGC GGTCGCCCAC GACGTGATGG GGATGCCGAT CAAGACGTTC
GGCGGCACCC CCGCGAAGCT GCTGAAGAAG TCCATCGCGG TGCGATGGAT CGCGAAGATC
TCCTCGACCG GGCGCGGCGT GAGCGCGTTC GGCGATATGT AG
 
Protein sequence
MSTQVVVVGS GYAGAGAVKA FEDEVGEGEA DLTWISEHDY HLVLHEVHRA IRNPAVAEKI 
TIPVDEIKSP ESNFVRGRVV DVDTNEQVVE TDDGTTVDYD YLLLGVGSTT AFFGIEGLEE
NAHQLKSLAD AKAIHEDVRS AAAEATRSDP VEVIVGGAGL SGIQTAGEIA EYRDKHRAPV
DIKLVEGLDE VFPGNDPQLQ GALRQRLEDA DIEILTGDFI SKADEDAVYF GGGEDEEPEE
LSYDVLIWTG GITGQPELEN VEVEKDERSN RVHAGSDFTT SDDRVFAIGD TALVEQGDDD
VAPPTAQAAW QAAEVAGENL ARAARGAPLQ SWEHTDKGTV VSVGEDAVAH DVMGMPIKTF
GGTPAKLLKK SIAVRWIAKI SSTGRGVSAF GDM