Gene Hlac_2070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2070 
Symbol 
ID7400590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2059214 
End bp2060416 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content73% 
IMG OID643709141 
ProducttRNA splicing endonuclease 
Protein accessionYP_002566718 
Protein GI222480481 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1676] tRNA splicing endonuclease 
TIGRFAM ID[TIGR00324] tRNA intron endonuclease 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0810012 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATGC AACCGACAGG GCACCTGCGC GGCGACGCGG TCCACGTCGC CGGCGACGCC 
CGCCAGCGGT TCCACGACTC CAACGGCTAC GGCCGTCCGC TCGGCGGCAA CGAGATCGCG
CTCTCGCGGG TCGAGACCGC TCACCTGCTG TTCCGGGGCG ACCTCTCGGG AGTCTCCCTC
ACCCCCGACG CCGACCCCGT CGGCTTCGAG CGCTTCTTCG TCGAGTCCGC GGCCGCCGCC
GACCGCTTCG CGGTACGCTA CCTCGTTTAC GCCGACCTGC GCGACCGCGG CTTCTACCTC
TCGCCCGCCC GCGAGCCGTG GCCCGGCGGC GACGTCGACG CGCCCGACGC GGTCGACTTC
GTCGCCTACG AGCGCGGTTC GACGCCCGAC ACGGGGGACG TGAAGTACCC CGTACAGGTC
GTCGGCGAGC GCGAGTCGCT GCCGGCGGCC GGGCTCGCGG GACGCACCCT CGCCGTCGTC
GACGAGGAGT CCGACATCAC GTACTTCGCG GCGACGCGCG GGGAGATCGC GGGTGCGACC
GACTACGAGC CCCCGGACCA GCTCGACGGC GTCCTGCTCT CCGACCGCGT CGTCGTCTGG
GACGCCCCCG AAGGGCTCTA CGAGCGCGGC TTCTACGGCC AGCCGCTCAC CGGCCGCGCC
GCGGCGGTCG AGGGCGCGGT CCAGCTATCC TTAGTCGAGG CCGCGTCGCT CGCCGCCGAC
GGCGTCCTCT CGCTGTCGAC GTCGGTGGGG ACGGCGGGCG AATCCCCCAG CGAGGGCGCC
GCAGAGACGG ACGGTCCCGT CGCCGCAGTC GTCGCCCGCG GCCGCGACGT GGAGGGCGAG
CGGTTCGACC GCCGGCTCGC CGTTTATAAG CGCCTCCGCG CGGCCGACGC CGTGCCGAAG
ACCGGCTTCA AGTTCGGCGC AGACTTCCGG ACGTACCTCG ACGTGGAGAC GGTCGAGGAC
CTGCCGCACT CCGAGCACCT CGTGCGCGTC GTCGAGGGCG ACCACCGATT CTCCCCGCGC
GAGCTCTCGC TCGACGTGCG GCTCGCGGGC GGCGTCCGCA AGGAGATGGT GTTCGCGCTG
ACGATGGTGG AGGGTGACGG TAGCGGCGCC GGTGGCGACG GCGGCGACGA CGAGACCGCC
GAGAACGGCG CGGTCCGCGA CGCCGACGTG GAGTGGCTCT CGATCGGGCG ACTGACGCCC
TGA
 
Protein sequence
MDMQPTGHLR GDAVHVAGDA RQRFHDSNGY GRPLGGNEIA LSRVETAHLL FRGDLSGVSL 
TPDADPVGFE RFFVESAAAA DRFAVRYLVY ADLRDRGFYL SPAREPWPGG DVDAPDAVDF
VAYERGSTPD TGDVKYPVQV VGERESLPAA GLAGRTLAVV DEESDITYFA ATRGEIAGAT
DYEPPDQLDG VLLSDRVVVW DAPEGLYERG FYGQPLTGRA AAVEGAVQLS LVEAASLAAD
GVLSLSTSVG TAGESPSEGA AETDGPVAAV VARGRDVEGE RFDRRLAVYK RLRAADAVPK
TGFKFGADFR TYLDVETVED LPHSEHLVRV VEGDHRFSPR ELSLDVRLAG GVRKEMVFAL
TMVEGDGSGA GGDGGDDETA ENGAVRDADV EWLSIGRLTP