Gene Hlac_3051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3051 
Symbol 
ID7399025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012028 
Strand
Start bp309938 
End bp311236 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content54% 
IMG OID643706858 
Producttransposase, IS605 OrfB family 
Protein accessionYP_002564480 
Protein GI222475959 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACTACA CCTACAGATT TCGGCTTGAT CCCACGCCTG AACAGCGTGA ACTGTTGGAT 
CATCACCGAG ATACCTGTAG GCAACTCTAC AACCACGCAC TCAACGAATT CAAGCAAATT
CCCAAATCGG CGGGTACACT TAACCAACGA GTGCGACAAG TACGCGATCA GCTCACCAGC
CTCAAAGACT GGTGGGATGA GCTGAACGAT GTCTATTCAA CGGTCGCACA AGCTGCTGTC
ATGCGTATCG AAGACAGCAT CAAAGCCCTC TCTCAGTTGA AGCAGAACGG CTACAACGTG
GGCAGTCTCA ATTGGAAGGC CCCCAAGGAT TTCCGTAGTT TCACCTACAT ACAGTCTGGC
TTCGAGTTCG ATAGTAAGAA CGGCCAACCC GTACTGTCGC TGTCGAAACT TGCGGATATT
CCCCTCATCA AACACCGCGC AATTCCTGAC GCCGAGACTG TCAAAGAAAT CACGATTAAG
AAGGAGTCAA CCGGTGAATG GTTCGCTTCA TTCACCGTCG GCGATAAAGA GACTCCTGAG
AAACCGACCG ACCCAGATCG ATGTGTCGGG ATTGACGTTG GCATCTTGAA GTACGCCCAT
GACACAGACG GCACCGCCGT CGAATCGCTC GACTTATCTG ACGAACGCGA GCGGTTGGAA
CGCGCACAGC ATGATCTTTC GCAGAAGGAA CGCGGTTCCG CGAATTGGGA GAGACAACGG
CAAGTTGTGG CCGAGCGCCA CGCCGATCTC AAGCGAAAGC GTCGTGACTT CCTTCACAAA
CTCTCGAACT ACTACGCCAC CGAATACGAC CTCGTAGCGG TCGAAGGCCT CGACGCGAAG
GAGTTGGTCG AACTCCCCGG AAACTCACGG AATCGGGCGG GAGCGGCGTG GGGAACGTTC
CTTCGAATGC TTGAGTACAA GTGCGAACGC GAAGGAACAC ACTTTGCCGA AGTCGATCCA
AGGAACACGA CGAAAGCGTG CGCGTCTTGC GGCGTCAAGA CGGACAAGCC GTTGTGGGTT
CGTGAACACT CGTGTCCCTC GTGTGGGTTT GAGGCGGACA GGGACGCGAA CGCAGCGTGG
AACATTCTTT CTCGCGGTCT TAAAAATATA GGAGTGGTTC ACTCCGAATC AACGCCTGTG
GAGACTGCGC TCCCTACGGA CACCGTTGTG TCTGCAAAGC GCGTCATCGA AACAGGAAGC
CCCATCACCA GAAGTCAAAG ACTTCGGGTT AGCAGTCAGA ACTCGGAGAG TTCTGACGAC
ACCCTCAAGG AGCGAACGGC GTCAGCCGTG AGCGAGTAG
 
Protein sequence
MHYTYRFRLD PTPEQRELLD HHRDTCRQLY NHALNEFKQI PKSAGTLNQR VRQVRDQLTS 
LKDWWDELND VYSTVAQAAV MRIEDSIKAL SQLKQNGYNV GSLNWKAPKD FRSFTYIQSG
FEFDSKNGQP VLSLSKLADI PLIKHRAIPD AETVKEITIK KESTGEWFAS FTVGDKETPE
KPTDPDRCVG IDVGILKYAH DTDGTAVESL DLSDERERLE RAQHDLSQKE RGSANWERQR
QVVAERHADL KRKRRDFLHK LSNYYATEYD LVAVEGLDAK ELVELPGNSR NRAGAAWGTF
LRMLEYKCER EGTHFAEVDP RNTTKACASC GVKTDKPLWV REHSCPSCGF EADRDANAAW
NILSRGLKNI GVVHSESTPV ETALPTDTVV SAKRVIETGS PITRSQRLRV SSQNSESSDD
TLKERTASAV SE