Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0414 |
Symbol | |
ID | 7401031 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 434253 |
End bp | 435528 |
Gene Length | 1276 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643707478 |
Product | Protein of unknown function DUF1225 |
Protein accession | YP_002565087 |
Protein GI | 222478850 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.859835 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.270687 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGTG CCAACACTTT TGAGGTCGTG CCACAGACCG AGAACGACAA AGAGTGCCTC CTACGGCTAC TCGATGCATC CGCTTCTCTG TGGAACGAAC TGACCTACGA ACGTCGTCAG AACTACTTCG GTGACGGCGA CGTGTGGGAC ACTCCCGAGT ACCGAGGACG CTACAACGGC GTCGTCGGAA GCGCGACTGT TCAACAGGTC ACGCGCAAGA ACAGCGAAGC GTGGCGGTCG TTCTTCGCCC TCAAGGAGAA AGGCGAGTAC GCCAACCCAC CGTCGTACTG GGGCAACGAG GAGGACGGAC GCGAACTCCG TACCTACATC CGATGCAACC AGTACACGAT TGAGTGGGGG AAACGTAGCC GTCTCGAAAT CCCTGTCGGG CAAGAACTGA AAGACGAATA CGGACTCGGC TACCACGAAC GACTCCGCCT CGAAGTCCGA GGCAACCCGA AGTGGGACGG CAAACAGGGT CGTCTGGAAC TTGAGTACGA CGAGGTTAGC GACACGTTCA GGGCTTTTCA ACCAGTCACC GTACCTGATT CTCGACTGGA TTCACCACTG GCTTCGGAAG AAGCCGCCCT CGACGTTGGA GCGAACAATC TCGTCGCGTG TTCCACGACT ACTGGGAACC AGTACCTCTA CGACGGTCGT GAGTTGTTCG GACGGTTCCG CGAGACGACA GACGAAATCG CCCCGCCTAC AGTCGAAACT CCGAGAGGGT CGCTACTCCT CGAATCGGAT TCGACGGCTG TACCGACAGC GGACGAAGCG TCGTGACCAT GCACAGAACG CGCTGGTGCG CGACCTCGTT GAACGGCTGT ACGATGAGGG CGTGGCGACG GTGTACGTGG GCGACCTGAC AGACGTGCTG GAAGCGCATT GGTCGGTCAG GGTGAACGAG AAGACGCACA ACTTCTGGGC GTTCAAGAAG TTCATCCACC GTCTCGCGTG CGTCTGTGAG GAGTACGGCA TCAGCCTCGA AACCGAGTCG GAAGCGTGGA CGAGTCAGAC GTGTCCCGAG TGTGGCGACC ACGAGAAGAC GGTTCGCCAC GAGGATACGC TGACGTGTCC ATGTGGCTTC GAGGGGCACG CCGACCTCAC GGCGTCAGAG ACGTTCCTTC GGGAAAACAG CAATTGCGAA ATCAGGCCGA TGGCACGGCC CGTGCGATTC GAGTGGGACG ACCACGACTG GTCGGGGAAA CTATACCCTC ACGAAAGTCC CAAAGAAGTG CGCACGAACC CGCAAGTTGC CTCCGTGGGT CGGTAG
|
Protein sequence | MKRANTFEVV PQTENDKECL LRLLDASASL WNELTYERRQ NYFGDGDVWD TPEYRGRYNG VVGSATVQQV TRKNSEAWRS FFALKEKGEY ANPPSYWGNE EDGRELRTYI RCNQYTIEWG KRSRLEIPVG QELKDEYGLG YHERLRLEVR GNPKWDGKQG RLELEYDEVS DTFRAFQPVT VPDSRLDSPL ASEEAALDVG ANNLVACSTT TGNQYLYDGR ELFGRFRETT DEIARLQSKL REGRYSSNRI RRLYRQRTKR RDHAQNALVR DLVERLYDEG VATVYVGDLT DVLEAHWSVR VNEKTHNFWA FKKFIHRLAC VCEEYGISLE TESEAWTSQT CPECGDHEKT VRHEDTLTCP CGFEGHADLT ASETFLRENS NCEIRPMARP VRFEWDDHDW SGKLYPHESP KEVRTNPQVA SVGR
|
| |