Gene Hlac_2018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2018 
Symbol 
ID7402037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2012226 
End bp2013674 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content68% 
IMG OID643709089 
ProductFO synthase subunit 2 
Protein accessionYP_002566666 
Protein GI222480429 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.595846 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCGG CGCGGGACCC CGACGCGACA GACCGCGACG CCGCCGACCT CGACGACGAC 
TTCGGCTTCG CCGAGCCGGC GACCGACCAG TCGTTCGAGA ACGCGCTCGC GAAGGCCCGC
GACGGCGACC GCCTCTCAAT CGACGACGCG ACCGAACTGC TCGCGACCGG CACGGACACC
GAGGGGGTCG ACCCCGTCCG CAAAGAGCGG GTCCTCGAAC TCGCCGACCG CCGGCGCCAT
GAGGAGGTCG GCGACGAGAT CACGTTCGTC GCCAACCTCA ACAACAACGT CACGACCGCC
TGCAACACGG GCTGTCTGTT CTGCAACTTC AAGGACTCGG CGCACGCCTT CGAGGCCGAC
AGCGACGCCG ACCACGTTGG CTTCACGAAG ACGCCCGCCG AGTCCCGCGC GATCGTCGAA
GACGCCCTCG ACATGGGCGT CTACGAGGTG TGCTCGGTGT CCGGGCTCCA CCCCGCGCTC
GCGCTGAACG AGGAGCACCA CGAGATCCTT CAATCCTACG ACGACCCGGA AAGCGAGGTG
AACTACAAAC CGCCCGAGGA GTACGCCACG GATCCGGGCA CCTACGTCGA GCAGATCGAG
GCGATGTCGG TCGGCGGGAT CCACCTGCAC TCGATGACGC CCGAGGAGGC GTACCACGCC
GCCCGCGGCA CCGACTGGGA CTACGAGACG GTCTACCGCG AACTCGCTGC GGCCGGACTC
GACTCCGCTC CCGGCACCGC CGCGGAGATC CTCGTCGACG AGGTCCGCGA CGTGATCTGC
CCCGGGAAGA TCCGCACCGA CGACTGGGTC GCGGCCATGG AGGGCGCGAT GGCGGCCGGC
CTCGATGTCA CCTCAACGAT GATGTACGGT CACGTCGAGA CGGTCGAACA CCGCGCGAAA
CACCTCGGAG TGATCCGCGA TCTACAGGAC CGCACCGGCC GGATTACGGA GTTCGTCCCC
CTCTCCTTTA TCCACCAGAA CACGCCCCTC TACCGCCACG GCGTTGTCGA CTCCGGCCCC
TCTCACGACG AGGACGAACT CGTGGTGGCG GTCGCGCGCC TCTTCTTGGA CAACGTCGAT
CACGTGCAGG CCTCGTGGGT GAAGTCGGGC GACGCGCACG GACTGAAACT GCTCAACTGC
GGCGCCGACG ACTTCATGGG CACCATCCTC TCCGAGGAGA TCACCAAACG CGCCGGCGGC
GAGTACGGGG AGTTCCGCTC GTTCGACGAC TACGTCGACA TGATCACGGC GATCGGCCGC
ACGCCGGTCG AGCGCTCGAC CGACTACCGC ACCCGCCGGC GGATCGATCC CGACGACAAT
CCTCACGGAC CGCGGCTCGG TCCTCGCGCC GACGGCACGC CGATGCTCTC GGACTCGTCG
TCCGGCGGGT CCGGTAAGTC TGGCGCGTCT GGCGCGTCCG ACGGGGAGTC GTCGGGCGCC
GACGACTGA
 
Protein sequence
MNPARDPDAT DRDAADLDDD FGFAEPATDQ SFENALAKAR DGDRLSIDDA TELLATGTDT 
EGVDPVRKER VLELADRRRH EEVGDEITFV ANLNNNVTTA CNTGCLFCNF KDSAHAFEAD
SDADHVGFTK TPAESRAIVE DALDMGVYEV CSVSGLHPAL ALNEEHHEIL QSYDDPESEV
NYKPPEEYAT DPGTYVEQIE AMSVGGIHLH SMTPEEAYHA ARGTDWDYET VYRELAAAGL
DSAPGTAAEI LVDEVRDVIC PGKIRTDDWV AAMEGAMAAG LDVTSTMMYG HVETVEHRAK
HLGVIRDLQD RTGRITEFVP LSFIHQNTPL YRHGVVDSGP SHDEDELVVA VARLFLDNVD
HVQASWVKSG DAHGLKLLNC GADDFMGTIL SEEITKRAGG EYGEFRSFDD YVDMITAIGR
TPVERSTDYR TRRRIDPDDN PHGPRLGPRA DGTPMLSDSS SGGSGKSGAS GASDGESSGA
DD