Gene Hlac_1583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1583 
Symbol 
ID7401516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1604291 
End bp1605649 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content66% 
IMG OID643708649 
Producthypothetical protein 
Protein accessionYP_002566239 
Protein GI222480002 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0370379 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACAGAA GGCGGTTCCT TCGGAGCAGT ATCGCCGCGG TCGGCCTCGC AGGGTTCGGA 
TCCACGGTGA GCGCACAGTC GGGCGGCGAC TACCGTCCCG GCGGGCCGTG GGCGCCGAAC
GAGCGCGCGG TCAACTACGA GGCGTATCTC GACAACCAAC AGCTCGGCGA GCGCCTCAAG
CAGATCGACC GGCGGAGCGA CCGGGTCGAA CTTGAACAGA TCGGCGCCTC GGCCGGCCGT
GAGGACCCGA TCTGGGAGGT CACGATCGGC GACGGCAACG AGAGCCTCCA CCTCATCAAC
CAGATCCACG GCGACGAGCC CTCCGGCGCC GAGGCCGTCG TGAAGATCCT CAACCGGCTG
GCGACCGGCG GCTCCCGGCG GGTCGAGACG ATCTTGGACA ACCTCACGAT CACGATCGTC
CCGCGGGTCA ACCCGGACGG GGCGAACTTC GTCGGTGACG ACGGGCTCGA AACGGATGGG
GAGCTCCGGC AGCGCCGGTA CAACACCAAC ACGTGGGAGG AGGGCGACTC CCGGTACATC
AACCGCAACT CGTACTTCGC CGGCGACGTG CCGGGATACG ACATGAACCG GGGGTTCAGT
ATCCTCCCCG ACTTCGAGCC GGGCGACGAG GACGAAGACT GGTGGGATGT CGTCGAGGAA
GCCCCGCAGT TCGGATACCT CAATATCCCG GTCGAGGACG TCCCGGTCGA TGTCCGCGAC
CCGACGGTCG CCGCCGGCGA GAATCCGTAC GACGAGCTGT GGTCGATGGG ACTCAATCTC
AACCCGGAGA ACCGGGCAGT GACGGAGTCG TTCCTCGACG CCGACCCCGA CTGGGCGATC
ACTCATCACC ACCAAGGCGC GGTCGTGGAT CCGGACTCCC CCGATCGGGG GAACGGACCC
AAACAGCAGT CGATCATGAG CGTGATGGCG CCGTTCGGTC CCAGATACAT CGACCACGAT
AGGTTCGACT ACGCCAGCTA CGTCGGCAAC GGGAACCCGT ACCTCTCTGA GGACGCCCAG
ACGCGCTCGC TGCAGCTCAA TCAGCTGGTC AACGAGCAGG CCCAGCAGTT CGGCAAGGGG
AAGTTCAACA CGCTCACCCG GTACGGATAC GGTCCGCTCT GGGGGTCGTA CCTCGACGCG
CTGTGTCCGC GGACGGACGC CGCCGGGATG CTCTACGAGG TGTCCCACCA GAGCGACGAG
CGCGGCCACA AGGCGATCGG CACGACGGTC AAGATCACCG TCGAGGGGTT CATGGCGACG
TTCGAGCGGA TCGCCGACGG CTCGATCAGC GAGGTCGACG AACTGGACTA CTTCGACATG
CCGCTGGCCG AGGGCATCGA GAGTCCGTTC GGCCGATAG
 
Protein sequence
MHRRRFLRSS IAAVGLAGFG STVSAQSGGD YRPGGPWAPN ERAVNYEAYL DNQQLGERLK 
QIDRRSDRVE LEQIGASAGR EDPIWEVTIG DGNESLHLIN QIHGDEPSGA EAVVKILNRL
ATGGSRRVET ILDNLTITIV PRVNPDGANF VGDDGLETDG ELRQRRYNTN TWEEGDSRYI
NRNSYFAGDV PGYDMNRGFS ILPDFEPGDE DEDWWDVVEE APQFGYLNIP VEDVPVDVRD
PTVAAGENPY DELWSMGLNL NPENRAVTES FLDADPDWAI THHHQGAVVD PDSPDRGNGP
KQQSIMSVMA PFGPRYIDHD RFDYASYVGN GNPYLSEDAQ TRSLQLNQLV NEQAQQFGKG
KFNTLTRYGY GPLWGSYLDA LCPRTDAAGM LYEVSHQSDE RGHKAIGTTV KITVEGFMAT
FERIADGSIS EVDELDYFDM PLAEGIESPF GR