Gene Hlac_2942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2942 
Symbol 
ID7398925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012028 
Strand
Start bp197890 
End bp199566 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content57% 
IMG OID643706758 
Producttransposase IS4 family protein 
Protein accessionYP_002564380 
Protein GI222475859 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATCAA CAGAGCAATC GCGACGAGAT GTCTTTCGAA CTATCGCACA GTTACAGCAC 
GTCGAATGGC CTACATACGG GTCGACACCG CTGTACGACC GCAGTTCGGT ATCTGCTCTC
CAATCGGACA TTCGAACCGT CGCCAGAGTC TGGTTCGAAC ACGACGTCCA CGACTCGATT
GCGGAGTTCG TCTCGCAGTA CCCACTGAAA TACGTCGATT TCGGGCCGCA TGACGAGTAT
TCTGGCTCCA CGCGGTATCA GATGCCGCAG TTGGTCCGGT TGTTCTTGCT CAAAGAGATC
CACGGTTGGG ACCACGAAAC GGCACTCTTG ACGTTCCTTC GACAGCGACC GGAACTCCGT
TGCGATCTCG GTTTCGAGCG TATGCCGGAT CAATCGACTC TATGGCGCAG TTGGCACCAG
CGATTCACGA CCGAACTCCG CGAGACAATC GAAACGGCGG CTCGAACGAT CCTGATCAAG
GCCCAGGATG CGGGTCTCAT AGTTCCACGA GAACCAGACC GAAAGCTCAG CTATCGAAAC
GATGATCAAG ATGACTCAGC ACCGGACGAT CAGACCGTTC TGAAGCGGGC CAAGAAAATC
ACCAACCACT TCAGTCGTAT CGTCTTCCCT GCATTCTCAC TGAATCGCGG TGAGGGATGT
GAGATCCACG AGAACGCCTA CTGGGGCTTG CAGACCTATC TCGGGCTCCG TGAGAACTTG
GCTGCCAACG AGGGTGCTCG GAGCTTCATT TACGAGTCAA CGCGAGAGCG GACACCACTC
GGACACGCTC ACCGTGACCA CATTCGCGGT CTTTCGATAC CAGCGGTTCG GGAGATGTAC
CGAGAGGCCG TCGATCGGCT GCTGGAAGAG GTTGCAGGGA CTGAGGAGTT CTTTCGAGCT
GGAATCGTCG CGATCGACAT CACCGAAGCC GATCCGTTCA CGGGCGACCG AACGGGCCAC
GAAGACGAGA TCATCGGGAC GAAAGAGCAG ACCGACGAGT ACGCCTATCA GTGGGCGACG
GTCCAGTTGG TCGGGAACGC CGTCCCAATC GTGCTGGACG CGCGCCCGGT ACGGAAAGGA
GAGTCACGAA AGGAGATCGT CGAGGACCTG CTGAATTCGG CTGAGGACCT CGTTCATGTC
GATAACGTCC TGATGGACCG GGAGTTCGAT AGCCAACACG TTCTGGAGAT GCTCAGCCAG
CGCGGGCTTT CCTACGTCGT TCCGAAACGG ATGCAGACCA GCGAGAAAGC TCAGGCCAAG
CGGTTGCTCC GGCGTGACCG AGACCGATAC GAGACCGACC GGAAGCTGCA TCTCGGAAAG
AACGAGTGGC ACCAGACGAC GCTAATCTAC CACCGCAAAG AGAACGCTGA GCACGACGAC
CATCGACAGT ATTCAGTGTT CATGACGAAT TGCGGGAGTG GTCACCTCAC GGAGTACGGC
TATCGCTGGG AGATCGAGAG CGGATACCGG TCGATTAAGC GGTTCATGGC TGCGACGACG
TCGAAGGATT TCGGGCTTCG CTTCTTCTAC TTCGCGTTTG CGTGTCTGCT CTACTCGATC
TGGCGGGCTG TCGATCTACT CGTCCAAGTC GAGTTGACCG GTGAATACGA GCACTCGCCC
ATTGTGACGG CCGACAATAC GCTCACGCTG TTGAAGAAGG AGACCGGAAT CGGATAG
 
Protein sequence
MPSTEQSRRD VFRTIAQLQH VEWPTYGSTP LYDRSSVSAL QSDIRTVARV WFEHDVHDSI 
AEFVSQYPLK YVDFGPHDEY SGSTRYQMPQ LVRLFLLKEI HGWDHETALL TFLRQRPELR
CDLGFERMPD QSTLWRSWHQ RFTTELRETI ETAARTILIK AQDAGLIVPR EPDRKLSYRN
DDQDDSAPDD QTVLKRAKKI TNHFSRIVFP AFSLNRGEGC EIHENAYWGL QTYLGLRENL
AANEGARSFI YESTRERTPL GHAHRDHIRG LSIPAVREMY REAVDRLLEE VAGTEEFFRA
GIVAIDITEA DPFTGDRTGH EDEIIGTKEQ TDEYAYQWAT VQLVGNAVPI VLDARPVRKG
ESRKEIVEDL LNSAEDLVHV DNVLMDREFD SQHVLEMLSQ RGLSYVVPKR MQTSEKAQAK
RLLRRDRDRY ETDRKLHLGK NEWHQTTLIY HRKENAEHDD HRQYSVFMTN CGSGHLTEYG
YRWEIESGYR SIKRFMAATT SKDFGLRFFY FAFACLLYSI WRAVDLLVQV ELTGEYEHSP
IVTADNTLTL LKKETGIG