Gene Hlac_0624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0624 
Symbol 
ID7401759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp641922 
End bp643574 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content70% 
IMG OID643707690 
Producthypothetical protein 
Protein accessionYP_002565296 
Protein GI222479059 
COG category[S] Function unknown 
COG ID[COG3390] Uncharacterized protein conserved in archaea 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.802185 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG ACGGACCGGG CGCCCGCGAG GTCGCTCACC GCGTGTTCGC CGCCGAGTTC 
GACGACGCCT CCCTCTCGTA CTCCGAGAGC GACGAGGAGC GCGCCCCGAA CTACGTCGTC
ACCCCGACCG GCGCGCGCGT GAACCGGCTG TTCGTCGCGG GCGTGCTCAC CGAGGTCGAG
CGCGTGAACG ACGAGACGCG TCGCGGGCGG GTCGTCGACC CATCCGGCGC GTTCGTCACC
TACGCCGGCC AGTACCAGCC CGAGGCGCAG ACGTTTCTCG AACGCGCCGA GCCGCCGGCG
TTCGTCGCGC TCACGGGCAA GGCACGCACC TTCGAGCCGG AGGACTCAGA TCGGGTGTTC
ACTTCGGTTC GCCCCGAGAG CCTCAACGCG GTCGACGCCG ACACCCGCGA TCGGTGGGTC
GTCTCCGCCG CGGAGGCCAC TCTCGACCGA CTCGCCGTCT TCGCGAAGGC CCTCGACTCG
GAGCTCCGCG GCGAGGAACT CCGCGTAGCC CTCGAAACGG GCGGCGCACC GGCGGCGCTG
GCGGCCGGGA TCCCGAAAGC CATCGCGCAC TACGACACCT CGACCGCCTA CGTCGAGGCG
CTCCGGCGGC TCGCCGTCGA CGCCCTGAAG CTGATCGCCG GCGACCGCGA CGAGGTGCGC
TCACCCGATA TCGCCCCCGA TGCGGGCGGT GAGGCCGCCA TCGGCGCGCT CCCGGAAACG
GACGTGACGA TCGAGGCGCC GGCGGAACCG GTCGCCGACG ATGGGGCGGT GGAATCGGAA
TCCGAGTCTG TCGCCGCCGA ATCGGACGAG TCTGCCTCTG AGCCGGAACC CAACGCTGAG
TCGGAACCTG CGGTGACAGG TGACGAGACG GCCGACTCAA CCGAGGCCGA CTCAACCGAG
ACCGACTCAA CCGAGACCGA CTCAACCGAG ACCGACTCAA TCGAGACCGA TACTGAGGAC
ACCGCCGTGT CCGTCGAAAC CGAACCGTCG GATTCCGGTT CCGATGCTGC TGGATCCGTC
GACGCCGGCC CCGGCTCCGC CGACTCCGAG TCTGACGACG AGAGCGGTGG GCTCGGGGAC
TTCGACGCCG GGACCGACGA TACCGAGACC GACGAATCCG GGACGGACGA CGCCGAGGCC
GACCTCGACG AGCCGTCGTC CGAGCCCGGC TCCGACGAGA TGTACCAGCT CGACGACGAG
GAACGCGAGG AGATCGAGTC GGAGTTCGGC ACCGACTTCT CGACCGGAAC CGAGGTCGAC
GAGCCCGGCG AGGCCGACAT CGACGTACCC GATCCCGAGG AGATAGAGGA GTCGCCGACC
TCGGTGGGCG CTTCGGAGTC CGCCGGGGCC GACGCACCCG CCGGAAGCGC TGACGAGGCC
GCAGCGGAGC CCACCACTGA TTCGGAACCA GAGAGCGAAG AGCCGGCGGG CGACGAGACG
AGCGATGCTG ACGCCGACAT CGACCTCGAA GCCGCCGTTA TCGACGCGAT GGGTGCGCTC
GACGACGGCG CCGGCGCCGA TCGCGAGGCG GTCGTCGAGG CGGTCGTCGA CGACCACGGC
GTCGCAGCCG ACGCGGTCGA GGACGCGATC CAGGACGCGC TGATGAGCGG GAAGTGCTAC
GAGCCCGGCG ACGGCACGCT GAAGCCGATC TGA
 
Protein sequence
MSDDGPGARE VAHRVFAAEF DDASLSYSES DEERAPNYVV TPTGARVNRL FVAGVLTEVE 
RVNDETRRGR VVDPSGAFVT YAGQYQPEAQ TFLERAEPPA FVALTGKART FEPEDSDRVF
TSVRPESLNA VDADTRDRWV VSAAEATLDR LAVFAKALDS ELRGEELRVA LETGGAPAAL
AAGIPKAIAH YDTSTAYVEA LRRLAVDALK LIAGDRDEVR SPDIAPDAGG EAAIGALPET
DVTIEAPAEP VADDGAVESE SESVAAESDE SASEPEPNAE SEPAVTGDET ADSTEADSTE
TDSTETDSTE TDSIETDTED TAVSVETEPS DSGSDAAGSV DAGPGSADSE SDDESGGLGD
FDAGTDDTET DESGTDDAEA DLDEPSSEPG SDEMYQLDDE EREEIESEFG TDFSTGTEVD
EPGEADIDVP DPEEIEESPT SVGASESAGA DAPAGSADEA AAEPTTDSEP ESEEPAGDET
SDADADIDLE AAVIDAMGAL DDGAGADREA VVEAVVDDHG VAADAVEDAI QDALMSGKCY
EPGDGTLKPI