Gene Hlac_1947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1947 
Symbol 
ID7399899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1947265 
End bp1948506 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content65% 
IMG OID643709018 
ProductFAD-dependent pyridine nucleotide-disulphide oxidoreductase 
Protein accessionYP_002566595 
Protein GI222480358 
COG category[C] Energy production and conversion 
COG ID[COG1251] NAD(P)H-nitrite reductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.422697 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATT CGTACGTGAT CATCGGTGAT GGTATCGCGG GTGCGTCCGC GGCCGAGACG 
CTCCGCGAGG AAGCGCCCGA CGCCGAGATC ACGGTTCTCA CGGACGAGGG TGAGTCCCTC
TACAACCGGA TCCTGATCAA AGAGTACGCG AAGGGGAAGC TCCCCGAGGC CCCGATCTCG
ATCCATCAGG AATCGTGGTA CGACGATCAC GATGTCGACC TCCGGCTCAA CACGGTCGTC
GTCGACATCG ACATCGAGAA CGACGCGATC CACACCCACG AGGGAGACAC GTTCGAGTAC
GACACCCTCC TGCTCGCGGT CGGCGGCACC CCCCAGCAGC TCCCGGTCGG CAACGCCGAC
GCCGACGGAA TCCACCACTT CTGGACGTTC CAGGACGCCC GCAAGATCAA ACAGAGCGTC
GAGGACGCGG ACCGAGCGGT CATCGTCGGC GCGGGACTCT TAGGCATCGA CTTTGCGGCC
ATCTGCGGCG CGCAGGACGT CGAGGCGAAG TACCTGATGC GCGGCGACTC CTGGTGGCGC
TACGCGCTCT CCGAGGAAGG CGCGGAGATC ATGCACGACG CGATGCGCGA ACGCGGCGTC
GAGCCCGTCT TCGGCTCCGG CGTCGACCAC TTCGAGGTCG ACGAAGACGG TCACGTCGAG
GCCGCGGTCG ACCCGAACGG CGATCGCTAC GAGTGCGACT TCGCCGGCGT CGCCATCGGC
CTGAACTTCA ACACCGAACT GGTCGAGGAC ACGTCGCTGG AGACGGAGAA CGGCATCGTC
GTCGACGAGT TCATGCGCAC CAACGTCGAC AACGTGTTCG CGGCCGGCGA CATCACCACG
TTCAACGACC TGGTCCTCGG CGAGCAGGCG AAGAACGGCT CGTGGGGGTC GGCCAAGCAA
CAGGGAACGA TCGCCGCTCG CAACATGCTC GAGTACGGCA GCGAGGAGTT CGAGTGGGTC
TCCTCGTACT CGATCACTCA CTTCGACTTC CCGTTCCTCT CCTTCGGCCA TCCGACGCTC
GGCGACGACT CTATCGAGGC GACCACCGCC GAGGGCGAGT GGCGCCGCGT GGCTCTCAAA
GACGGGAAAG TCGTCGGCGG CGTGCTCATC GGCGACCTCT CGCCGCAGTC GGCGTTCAAA
CAGCTCATGC GCGAGGGCCG CGACGTGAGC GACCAGCGGG ACCTCCTGAT GGAGCCCGGC
TTCTCCGTCG ACGACCTCGC GGCCGCGACC GAACAGCAGT AG
 
Protein sequence
MSDSYVIIGD GIAGASAAET LREEAPDAEI TVLTDEGESL YNRILIKEYA KGKLPEAPIS 
IHQESWYDDH DVDLRLNTVV VDIDIENDAI HTHEGDTFEY DTLLLAVGGT PQQLPVGNAD
ADGIHHFWTF QDARKIKQSV EDADRAVIVG AGLLGIDFAA ICGAQDVEAK YLMRGDSWWR
YALSEEGAEI MHDAMRERGV EPVFGSGVDH FEVDEDGHVE AAVDPNGDRY ECDFAGVAIG
LNFNTELVED TSLETENGIV VDEFMRTNVD NVFAAGDITT FNDLVLGEQA KNGSWGSAKQ
QGTIAARNML EYGSEEFEWV SSYSITHFDF PFLSFGHPTL GDDSIEATTA EGEWRRVALK
DGKVVGGVLI GDLSPQSAFK QLMREGRDVS DQRDLLMEPG FSVDDLAAAT EQQ