Gene Hlac_0040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0040 
Symbol 
ID7401393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp42396 
End bp43853 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content73% 
IMG OID643707099 
Productprotein of unknown function DUF402 
Protein accessionYP_002564716 
Protein GI222478479 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1530] Ribonucleases G and E 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.708501 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0289003 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCCC GCGTTCGCGG CATCTACGCG ACGGCGCTCA CCGAGGCCCT GCTCGACGCG 
GGCCACGAGG TCGTCGGCGC GTCGACCCCG ATCCGACGGC GCTTCGACGC CGAGTTCGAA
AGCGCACCGC CCGACGCACG GATCGCGACG ACAGAGGATC GGCAGGGTGT CGGCGCGCAC
GGGGATCCGG ACGCGATAGG GACCCTCCGG GGCCTCCTGA CCGACACGGG ACTCGACGCG
CTGGCGTGGA CCGATCCGAC CCCGCCCGGG ACCGTCTGCG ACGGGACGGT GACCGAGACG
CTCGGCGGTG GGGCGGTCGT ACGGCTTCGC GTTGGCGGGG GCGAGAGCGA GGGCGACGCC
ACCACCGAGG GGTACCTCCC GTACGGGAGC GTCGACGACC GAATCGAGAC CGGCGATCCG
GTCCGGGTGC AGGTCCGGGA GTCCGCGGCG CCATGGACGG ATCGCCGCCC CGAGTTGGAC
GGGTCGCTGC GAGCGGGCGG CGGGCTCGTC ACGCTCGAAC CCGGCTCGGG CACCCGCGTC
GACGCGCGGA ACGACAAGGA CGCGCGAGAG CTGTCGGGAA TGCTCGACCT GCTCGGACTG
AAGCCGCCGG AGGGGTGGCG CGCCGTCTGG AAGCCGCCCG CGGTCGACGC CGACACCGAG
GAGCTGCAGG CCGGACTCGA CCGGGCGGTC GCGGCCGTCG AGGGGCTGGA CGACGCCGTC
GACGCGGCGG GAGGCGCCGG CGTTCTCGAC GGTTCGGACA GCGTTCGCGA GGAGCCGTTG
ACGCGCCCGA ACGCCGGCGT CTGGGTGTGG TTCGGCCGCG AGAGCCGGTT CGCGCTCGAC
GACCGCCGAC GCGAGGCGAC CGCGACGATG CCGGGTCACC ACCGGGTGAA GGCGGGGTCG
GCGGACGCAT CTTCGGGCGT TGACCTCGCA GAGGCGCTGT GCGAGCCCGA CGCGGACGCC
TCATTCCCGT TCGGGGTCGT GACGGACGCG TTCGGGCCGG CCGAGGGCGA CGCGCTCCGG
CTCGAACACG GCAAGCCCGA CGGGCGACTG ATCACGCTGG GCGAGGCGAC GGTGACCACA
GTCGACGCCG ACGGCTCGGT CGCGGTCGAG CGCGAGATGA CCGGCGGCGG CTCTTACGAC
GGGTTGGACG TGCCCCGCGA GGCCGGCGAC ATCGCTGAGA CCAGCCTGAA GGAGGGCCGA
TGGTGGTACC CGACGACGTA CCGCGGGCGC GATGGGACGG TGCGCGGGAC GTATGTCAAC
GTCTGCACGC CGGTCGAGGT GTTCCCGGAC GCCGCCCGCT ACGTCGACCT TCACGTCGAC
GTGATGAAAC ACCCCGACGG GACCGTCGAG CGCGTCGACG ACGACGAACT GCGGGACGCA
GAGGCGGCCG GAGACGTGCC GGAGCCGCTG GCGGAGAAGG CTCGGAGCGT GGCGTCGGCG
CTGGAGAACG CGCTGTGA
 
Protein sequence
MKARVRGIYA TALTEALLDA GHEVVGASTP IRRRFDAEFE SAPPDARIAT TEDRQGVGAH 
GDPDAIGTLR GLLTDTGLDA LAWTDPTPPG TVCDGTVTET LGGGAVVRLR VGGGESEGDA
TTEGYLPYGS VDDRIETGDP VRVQVRESAA PWTDRRPELD GSLRAGGGLV TLEPGSGTRV
DARNDKDARE LSGMLDLLGL KPPEGWRAVW KPPAVDADTE ELQAGLDRAV AAVEGLDDAV
DAAGGAGVLD GSDSVREEPL TRPNAGVWVW FGRESRFALD DRRREATATM PGHHRVKAGS
ADASSGVDLA EALCEPDADA SFPFGVVTDA FGPAEGDALR LEHGKPDGRL ITLGEATVTT
VDADGSVAVE REMTGGGSYD GLDVPREAGD IAETSLKEGR WWYPTTYRGR DGTVRGTYVN
VCTPVEVFPD AARYVDLHVD VMKHPDGTVE RVDDDELRDA EAAGDVPEPL AEKARSVASA
LENAL