Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_2942 |
Symbol | |
ID | 7398925 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012028 |
Strand | - |
Start bp | 197890 |
End bp | 199566 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643706758 |
Product | transposase IS4 family protein |
Protein accession | YP_002564380 |
Protein GI | 222475859 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCATCAA CAGAGCAATC GCGACGAGAT GTCTTTCGAA CTATCGCACA GTTACAGCAC GTCGAATGGC CTACATACGG GTCGACACCG CTGTACGACC GCAGTTCGGT ATCTGCTCTC CAATCGGACA TTCGAACCGT CGCCAGAGTC TGGTTCGAAC ACGACGTCCA CGACTCGATT GCGGAGTTCG TCTCGCAGTA CCCACTGAAA TACGTCGATT TCGGGCCGCA TGACGAGTAT TCTGGCTCCA CGCGGTATCA GATGCCGCAG TTGGTCCGGT TGTTCTTGCT CAAAGAGATC CACGGTTGGG ACCACGAAAC GGCACTCTTG ACGTTCCTTC GACAGCGACC GGAACTCCGT TGCGATCTCG GTTTCGAGCG TATGCCGGAT CAATCGACTC TATGGCGCAG TTGGCACCAG CGATTCACGA CCGAACTCCG CGAGACAATC GAAACGGCGG CTCGAACGAT CCTGATCAAG GCCCAGGATG CGGGTCTCAT AGTTCCACGA GAACCAGACC GAAAGCTCAG CTATCGAAAC GATGATCAAG ATGACTCAGC ACCGGACGAT CAGACCGTTC TGAAGCGGGC CAAGAAAATC ACCAACCACT TCAGTCGTAT CGTCTTCCCT GCATTCTCAC TGAATCGCGG TGAGGGATGT GAGATCCACG AGAACGCCTA CTGGGGCTTG CAGACCTATC TCGGGCTCCG TGAGAACTTG GCTGCCAACG AGGGTGCTCG GAGCTTCATT TACGAGTCAA CGCGAGAGCG GACACCACTC GGACACGCTC ACCGTGACCA CATTCGCGGT CTTTCGATAC CAGCGGTTCG GGAGATGTAC CGAGAGGCCG TCGATCGGCT GCTGGAAGAG GTTGCAGGGA CTGAGGAGTT CTTTCGAGCT GGAATCGTCG CGATCGACAT CACCGAAGCC GATCCGTTCA CGGGCGACCG AACGGGCCAC GAAGACGAGA TCATCGGGAC GAAAGAGCAG ACCGACGAGT ACGCCTATCA GTGGGCGACG GTCCAGTTGG TCGGGAACGC CGTCCCAATC GTGCTGGACG CGCGCCCGGT ACGGAAAGGA GAGTCACGAA AGGAGATCGT CGAGGACCTG CTGAATTCGG CTGAGGACCT CGTTCATGTC GATAACGTCC TGATGGACCG GGAGTTCGAT AGCCAACACG TTCTGGAGAT GCTCAGCCAG CGCGGGCTTT CCTACGTCGT TCCGAAACGG ATGCAGACCA GCGAGAAAGC TCAGGCCAAG CGGTTGCTCC GGCGTGACCG AGACCGATAC GAGACCGACC GGAAGCTGCA TCTCGGAAAG AACGAGTGGC ACCAGACGAC GCTAATCTAC CACCGCAAAG AGAACGCTGA GCACGACGAC CATCGACAGT ATTCAGTGTT CATGACGAAT TGCGGGAGTG GTCACCTCAC GGAGTACGGC TATCGCTGGG AGATCGAGAG CGGATACCGG TCGATTAAGC GGTTCATGGC TGCGACGACG TCGAAGGATT TCGGGCTTCG CTTCTTCTAC TTCGCGTTTG CGTGTCTGCT CTACTCGATC TGGCGGGCTG TCGATCTACT CGTCCAAGTC GAGTTGACCG GTGAATACGA GCACTCGCCC ATTGTGACGG CCGACAATAC GCTCACGCTG TTGAAGAAGG AGACCGGAAT CGGATAG
|
Protein sequence | MPSTEQSRRD VFRTIAQLQH VEWPTYGSTP LYDRSSVSAL QSDIRTVARV WFEHDVHDSI AEFVSQYPLK YVDFGPHDEY SGSTRYQMPQ LVRLFLLKEI HGWDHETALL TFLRQRPELR CDLGFERMPD QSTLWRSWHQ RFTTELRETI ETAARTILIK AQDAGLIVPR EPDRKLSYRN DDQDDSAPDD QTVLKRAKKI TNHFSRIVFP AFSLNRGEGC EIHENAYWGL QTYLGLRENL AANEGARSFI YESTRERTPL GHAHRDHIRG LSIPAVREMY REAVDRLLEE VAGTEEFFRA GIVAIDITEA DPFTGDRTGH EDEIIGTKEQ TDEYAYQWAT VQLVGNAVPI VLDARPVRKG ESRKEIVEDL LNSAEDLVHV DNVLMDREFD SQHVLEMLSQ RGLSYVVPKR MQTSEKAQAK RLLRRDRDRY ETDRKLHLGK NEWHQTTLIY HRKENAEHDD HRQYSVFMTN CGSGHLTEYG YRWEIESGYR SIKRFMAATT SKDFGLRFFY FAFACLLYSI WRAVDLLVQV ELTGEYEHSP IVTADNTLTL LKKETGIG
|
| |