Gene Hlac_2233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2233 
Symbol 
ID7399942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2218307 
End bp2219407 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content71% 
IMG OID643709306 
Producthypothetical protein 
Protein accessionYP_002566880 
Protein GI222480643 
COG category[S] Function unknown 
COG ID[COG5282] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03624] putative hydrolase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.442538 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATCC TCCGAAGCCT CCGGACCGTC TCCGAGGCCA GCGGGCCGGG CGTCGTCGAC 
TGGGACCGGG CCGCGGCGGC CGCGAAGGCG AGTACCGACT CCGGATCGAT CGCCCTCACC
GAGGCGGAGC GAGCCGGGTA CGCGGCCGAC GTGCGCGACG CGCGCTCCCG TCTCCGCGAG
GTCGCCGGTA TCGAGTTCGA CGTGCCCGAC CGCATCGAGG TGCAGAACCG GCACCACTGG
ATCGACGCCA GCGTCGACAC GTTCCGGAAC GTGATGGCGC CGATCGAGGC GGCGACGACC
GATTCCGACA ACGAGGGGGC GGTGATCGGG GGCGGCGAGG AGCCGATCGG CGGGATCGTC
GAGCCGACCG GCGGGCCCGT GGGCTTCCCG ACCGGCGACC TGACGCGAGG GTTCGCGCAG
GACCTCTCGC GGATCGCTAA CACTGGCTCG ATGGCGTTCA CGCTCGGGTT CTTAGCGCGC
AACGTACTCG GCCAGTATGA CCCGCTCCTG TTGGCCGACG AGCCCGACGC CGACCACGGG
CTCTACTTCG TCCACCCGAA CATCGTCGCG GTCGCGGCGT CGCTCGACGT CGAGTACCCT
CGGTTCAGGC GCTGGATCGC TTTCCACGAG GTGACGCACG CGGCGGAGTT CGGCGCGGCG
CCGTGGCTCC CCGAGTACCT CGAATCGCGG GTTGAGCGCG GGATCAAGGG GCTCACCGGC
GGCGACAGAC TGACCGCGGG CGGGCTGCCG GTCGACGCGC TCGATACCGA GCCGTTTGCG
GAGCTGCAGG CGGCGATGAC GGCGGTCGAG GGGTACGCCG AGGTGCTGAT GGACCGTGCT
TTCGACGGCG AGTACGCCGA CCTCCGCCGG AAGCTTGACG AGCGTCGGGG CGGAGGCGGC
CCGGTCCAGC GGCTCGCGCG CCGGCTGCTC GGGCTCGGAC TGAAGCGCCG GCAGTACGAG
CGCGGCGCCA CCTTCTTCCG ACACGTCGCC GACGCCCGGG GGATCGAGGC GGCCGGCGCC
GTCTGGGAAC GTCCCGAGAA CCTTCCGACG AGCGCCGAGC TTGAGGATCC CGACATGTGG
CTGGTTCGAG TCGACCCCTG A
 
Protein sequence
MDILRSLRTV SEASGPGVVD WDRAAAAAKA STDSGSIALT EAERAGYAAD VRDARSRLRE 
VAGIEFDVPD RIEVQNRHHW IDASVDTFRN VMAPIEAATT DSDNEGAVIG GGEEPIGGIV
EPTGGPVGFP TGDLTRGFAQ DLSRIANTGS MAFTLGFLAR NVLGQYDPLL LADEPDADHG
LYFVHPNIVA VAASLDVEYP RFRRWIAFHE VTHAAEFGAA PWLPEYLESR VERGIKGLTG
GDRLTAGGLP VDALDTEPFA ELQAAMTAVE GYAEVLMDRA FDGEYADLRR KLDERRGGGG
PVQRLARRLL GLGLKRRQYE RGATFFRHVA DARGIEAAGA VWERPENLPT SAELEDPDMW
LVRVDP