Gene Hlac_0310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0310 
Symbol 
ID7399700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp331616 
End bp332908 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content69% 
IMG OID643707372 
ProductProtein of unknown function DUF650 
Protein accessionYP_002564984 
Protein GI222478747 
COG category[S] Function unknown 
COG ID[COG1602] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.156521 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCTCG ACGAGTTCAT TGAATTCGAA GCGGACGAGC GCGCCGAGCG GCGGCGCCTC 
GCCGCCGAGA AGGACTACGG CATCCTCGAT CACCTCGACT CCTTCGAGCG GCGCTTCGAG
GAGCACGTCT CCGACGACGC CGTGGTCGGC AGCGTCTCCC CCTCCATCTT CGTCGGGCGC
TCGAACTACC CGAACGTCTC GACCGGGCTG CTCTCGCCGG TCGGCCGCGA GGAGCGCGCA
GCGCGCTTCG AAACCTCCGC GGCGTGGTAC GACGAGGGCG TCTCCATCGC CGACGTGTTC
GACCGGCGGA CGAGCCTGCT CAACTCGACC CGCGGGGCCG ACGTGGCGGA CGCCGGACGC
GACGGGGGCG GCGTCCACGA CGCGTGGAAC GGCTGGCTCG GGGTCCAGCG CGAGGTCGCG
ATCGCGGACC GGCCGGTCGA CGTCGAGATA GGGTTGGACG GTACCCCCGA CCTCGACTTC
GATATCGGCA CGGAGGACAT CAAGACGCCG ACCGGCCCGC GCGCCGCGGC TCGGACCGCC
GACCTCGGCG AGAACCCGCA CGTCCCGCGC CCGGTGAAAA AGACGCTCGA AGACGACGAC
TGGCGCGCCG AGGGCGCGAT GACGTATCTC TACCGCCGCG GATTCGACGT GTACGACATC
AACACGATCC TCTCCGCGGG CGCACTCGGG CGCGGGAAGA ACCGCCGGCT CGTCCCGACG
CGCTGGTCGA TCACGGCCGT CGACGACACG GTCGGACAGT TTCTCCGCGG ATCGATCCGC
GACAATCCGA CGGTTAACCG GATCGAAGTC CACCGAAACG AGTACCTCGG CAACGCCTTC
TGGGTGATCT TGGTTCCGGG ACAGTGGGAG TACGAGCTTG TCGAGATGAA GTCGCCCGGC
TCGATCTGGA ACCCCGACCC TGGGGCGGGC GTGTATCTCT CGGCCGCGAG CGAGGGCCGC
GAAGGTCGGA CCGGCTACGT CGAGGAGACC GCGGGCGCCT ACTACGCGGC CCGGATGGGG
GTTCTGGAAC ACCTCGACGA CCGCGGACGG CAGGCGAAGG CGCTCGTGTT GCGGCACGTC
TCCGACGACT ACTGGGGTCC GGTCGGCGTC TGGCAGGTGC GCGAGGCGGT CCGGAACGCC
TTCGAGGGGG AACACGGGAC CGCCGAGACG TTCGGCGAGG CAGTTCGCGG CGTCGCTGAC
CACCTCCCGG TCCCGCTCGG TCGCCTCCGA CGAAAATCGA CGATGGCGGC CGGACTACAG
GCGAACCTCG GCGACTTCGT CGACGCCGGG TGA
 
Protein sequence
MRLDEFIEFE ADERAERRRL AAEKDYGILD HLDSFERRFE EHVSDDAVVG SVSPSIFVGR 
SNYPNVSTGL LSPVGREERA ARFETSAAWY DEGVSIADVF DRRTSLLNST RGADVADAGR
DGGGVHDAWN GWLGVQREVA IADRPVDVEI GLDGTPDLDF DIGTEDIKTP TGPRAAARTA
DLGENPHVPR PVKKTLEDDD WRAEGAMTYL YRRGFDVYDI NTILSAGALG RGKNRRLVPT
RWSITAVDDT VGQFLRGSIR DNPTVNRIEV HRNEYLGNAF WVILVPGQWE YELVEMKSPG
SIWNPDPGAG VYLSAASEGR EGRTGYVEET AGAYYAARMG VLEHLDDRGR QAKALVLRHV
SDDYWGPVGV WQVREAVRNA FEGEHGTAET FGEAVRGVAD HLPVPLGRLR RKSTMAAGLQ
ANLGDFVDAG