Gene Hlac_1165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1165 
Symbol 
ID7400974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1171472 
End bp1173571 
Gene Length2100 bp 
Protein Length699 aa 
Translation table11 
GC content67% 
IMG OID643708230 
Productnitrous-oxide reductase 
Protein accessionYP_002565829 
Protein GI222479592 
COG category[C] Energy production and conversion 
COG ID[COG4263] Nitrous oxide reductase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.804263 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.124168 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACAG ATCCGAACGC GAGCGACGAG CGGACGGCAC GGACCGACGA ACAACGCGAC 
GAGCAGTCGG CGAACGCGAA TGCGGGCGAC GCGAAGACAG CGGGATCGGC CGCGGACGGC
ACCGAGACGG CGGACGACGG TCCGGAGATC GTCGACGGCC GGACGATCGA ACACTTGTTG
ACCGATCACG AACGCGACAT CGCCGCCACC GCGGAGGGGA CCGAGGCCAC CGCGGCGAAG
CGGCCGAGCC TCGATCTGCC GCGGTTGGAG CTCGGACGCC GCGACTTCAT GAAGGCCGGC
GCCGCCGTCG GGGCCATGAG CGGTCTCGCC GGTTGTACGA GCATCCTCGC CGACGAGGAG
AGCAACGACT CGGGCGACGC GGCCGCCAGC GGCGAGCACT CGGTGGCTCC GGGCGAGCAC
GACGAGTACT ACGGGTTCCT CTCCGGCGGC CACACCGGCG AGATCCGCGT CGTCGGGCTC
CCGTCGATGC GGGAGCTGAT GCGGATCCCG GTGTTCCAAG CGGAGAGCGC CCGCGGCTTC
GGCCACGACG CGAAGTCGAG CCAGATGCTG GAAGAGGAGG GCGGCGGCTA CACGTGGGGC
GACACCCACC ACCCGCGCGT CTCCCAGACC GACAACGACT ACGACGGCCG CTGGCTGTTC
GTCAACGATA AGGCGAACGG GCGGATGGCC CGGGTCGACT TGGAGTACTT CGAGACCGAC
GCGATCACGA ACGTCCCGAA CTGTCAGGGC GTCCACGGGG CATGCATGCA GCTACCTGAC
ACGCAGCTGG TCTTCGGCGT CGGCGAGTTC CGCGTCCCGA TGCCGAACGA CGGCCGCGAC
GTGCAGAGCC CCGACGAGTA CGGCTCCGTC CTCAGCGCCA TCAACCCCGA GACGATGGAC
GTCGAGTGGC AAGTCGAGGT CGACGGCAAC ATGGACAACG GGGACGGCGG GAAGGAGGGG
CGCTGGTTCT TCGCGACCGG CTACAACAGC GAGGAGGGCG TCACGGAGTC CGAGATGGCC
CGGTCGGACC GCGACTACGT GAAGGCGTTC GACATCCCCG CGATCTGGGA CGCCGTCGAG
GCCGGGAACT ACGAGGAGAT CAGCGGCGTG CCCGTCGTCG ACGGGACGAT GGACAGCTCG
CTCAACGAGG GCGACCGTCC GATCGTCCGG TACGTCGCCA CACCGAAGAG CCCGCACGGG
ATCAGCGTGA CGCCCGACGG GAACTACGCC ATCGCCTCCG GCAAGCTCGA TCCGAGCTGT
ACGGTCATCG ACATCGACGC GATCGCCGAG GTCGACGACC CGCAGGAGTC CATCGTCGGG
CAGCCCCGGA TCGGGATGGG ACCGCTCCAC ACCGCCTACG ACGGGCGGGG GCACGCGTAC
ACGACCCTGT TCATCGACTC GCAGGTTGCC AAGTGGGACT ACGAGGCCGC CGTCGAGGCC
GAGGCCGGCT CCGAGGAGCC GGTGATCGAG AAGCACGACG TACACTACAA TCCCGGCCAC
CTCATCGCCT CGGAGTCGTA CACGGCGTCG CCGCAGGGCG ACTACCTCGT CTCGCTGAAC
AAGCTCTCGA AAGACCGGTT CCTCCCCGTC GGACCGATCC ATCCGGAGAA CGATCAGCTG
TTCCACATCG GCGACGACGA CGCCGGGATG GAGCTGATCA AGGACAAGCC GTCGTACCCC
GAGCCGCACG ACGCCAGTAT CGTCCACAAG GACAAGATCG AGCCGGCGAA GACGTGGGAC
ACCGACGACT ACGAGATCGA CTACGTCGCC GAGGGCGAGG AGTCCTTCGA ACGGGTCGAC
GACAGCACCG TCGAGATCGA GATGTGGGCG CGCCGCAACG AGTACGCCTT CCCGGAGGTC
ACGCTGCAGG AGGGCGACGA GGTGACGCTC CGGATCTCCA ACGTCGAGAC CACGAGCGAC
GTGATCCACT CGATCGCGAT CCCGGAGTAC GACATCAACC TCGCGTTAGC GCCCCAGGAC
ACCCGTGAGG TGACGTTCAC CGCGGACCAG CCCGGCGTGT TCTGGGCGTA CTGCGCGTAC
TTCTGTAGCG CGCTCCACCT AGAGATGCGC TCGCGGATCC TCGTCGAACC GGACGAATAG
 
Protein sequence
MTTDPNASDE RTARTDEQRD EQSANANAGD AKTAGSAADG TETADDGPEI VDGRTIEHLL 
TDHERDIAAT AEGTEATAAK RPSLDLPRLE LGRRDFMKAG AAVGAMSGLA GCTSILADEE
SNDSGDAAAS GEHSVAPGEH DEYYGFLSGG HTGEIRVVGL PSMRELMRIP VFQAESARGF
GHDAKSSQML EEEGGGYTWG DTHHPRVSQT DNDYDGRWLF VNDKANGRMA RVDLEYFETD
AITNVPNCQG VHGACMQLPD TQLVFGVGEF RVPMPNDGRD VQSPDEYGSV LSAINPETMD
VEWQVEVDGN MDNGDGGKEG RWFFATGYNS EEGVTESEMA RSDRDYVKAF DIPAIWDAVE
AGNYEEISGV PVVDGTMDSS LNEGDRPIVR YVATPKSPHG ISVTPDGNYA IASGKLDPSC
TVIDIDAIAE VDDPQESIVG QPRIGMGPLH TAYDGRGHAY TTLFIDSQVA KWDYEAAVEA
EAGSEEPVIE KHDVHYNPGH LIASESYTAS PQGDYLVSLN KLSKDRFLPV GPIHPENDQL
FHIGDDDAGM ELIKDKPSYP EPHDASIVHK DKIEPAKTWD TDDYEIDYVA EGEESFERVD
DSTVEIEMWA RRNEYAFPEV TLQEGDEVTL RISNVETTSD VIHSIAIPEY DINLALAPQD
TREVTFTADQ PGVFWAYCAY FCSALHLEMR SRILVEPDE