Gene Hlac_3363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3363 
Symbol 
ID7402218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012030 
Strand
Start bp118051 
End bp119304 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content56% 
IMG OID643709914 
Producttransposase, IS605 OrfB family 
Protein accessionYP_002567480 
Protein GI222481244 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAATC TCACCGTCAC GCGCACCTAC GTTGGTTCTA TCCAGAACCA CCAACAGATC 
TGTGATGGTC TGGATTCGCT CGGGGATTCC GCCTCGAAAA TCTGGAACGT CGCACGATGG
ACAGCCGACC GTATCTGGGA CGCAACCGGT GAGATCCCGA ACGGTAGTGT TCTGAAATCG
TGTATGAAGA ATCAGTCGTG CTGGAAAGAT TTGAACGCGC AATCCAGTCA GAAAGTCATT
GAAGAACTTT CTGACGCTTT CCAGTCGTGG TTCGACTTAC GGCACAAGTC CAGTGAGGCG
AATCCGCCCG GCTACCGCAA ACACGGTGAC ACCCGACCGC GTTCCACGGT GACATTCAAA
GAAGACGGAT TCAAACACGA CCCCGAGAAC AACCGCGTCC GGCTCTCGAA AGGCTCGAAC
CTGAAAGAAT ACTGGTCGGA CTTCCTGCTC TGCGAGTACC AGACGCGCCC TGACATTGAC
CTCTCTGAAG TCAACCGAGT GCAGAACGTT CGCGCCGTCT GGAACGGCGA CGAGTGGGAA
CTGCACTTCG TCTGCAAAGT CGAACTCGAA ACGAACGACT CCGCAGGCGA CGAAGTGGCG
GGGATTGACC TTGGCATCAA GAACATCGCC ACGGTCGCGT TCCCGGACGA ATACGTTCTC
TACCCCGGTA ACTCGCTCAA AGAAGACAAG CACTACTTCA AACGAGCCGA GTACGACACC
GAAGGTGAGA ACGGCCCCTC GGAGAAGTCG ATGTGGGCGC GTCGGAAACT CGCTGACCGC
GAAACACACT TCTACCACGT CCTCTCAGAC ACCATCATCA CAGAGTGTGT CGAACGCGGT
GTTGGTACAC TCGCGGTGAG TTGGCCTGAA GAGGTGCGAG AGTCCGACTG GGGCAAAACT
GGGAACAAGA AGTTACACTC GTGGGCGTTC GACCGCATCT ACCAGTACCT CGCGTACAAA
GGCGAGATTC ACGGTGTCGA GGTGTTGAAG GAGAACGAGT GGAACACCTC AAAGACCTGC
TCGAACTGTG GTGACGACAC GAAGGAGAAC CGTGTCGAGC GTGGGTTGTA CGTCTGCTCG
TCGTGCGGTT TGGTTGGGAA CGCGGATTGC AACGGGGCGG AGAACATGCG GCAGAAGATA
ACTCCGAGTC CTCACGGTGA GGATAGGAGT AACGGCTGTG TGGCACAGCC ATCGACATAC
TTGTTCGACC GCGAGAGCGG GACGTTTCAC ACGAGAGAAC AAGCCGTGTC GTAG
 
Protein sequence
MANLTVTRTY VGSIQNHQQI CDGLDSLGDS ASKIWNVARW TADRIWDATG EIPNGSVLKS 
CMKNQSCWKD LNAQSSQKVI EELSDAFQSW FDLRHKSSEA NPPGYRKHGD TRPRSTVTFK
EDGFKHDPEN NRVRLSKGSN LKEYWSDFLL CEYQTRPDID LSEVNRVQNV RAVWNGDEWE
LHFVCKVELE TNDSAGDEVA GIDLGIKNIA TVAFPDEYVL YPGNSLKEDK HYFKRAEYDT
EGENGPSEKS MWARRKLADR ETHFYHVLSD TIITECVERG VGTLAVSWPE EVRESDWGKT
GNKKLHSWAF DRIYQYLAYK GEIHGVEVLK ENEWNTSKTC SNCGDDTKEN RVERGLYVCS
SCGLVGNADC NGAENMRQKI TPSPHGEDRS NGCVAQPSTY LFDRESGTFH TREQAVS