Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_2789 |
Symbol | |
ID | 7399196 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012028 |
Strand | - |
Start bp | 47932 |
End bp | 50712 |
Gene Length | 2781 bp |
Protein Length | 926 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643706618 |
Product | SMC domain protein |
Protein accession | YP_002564244 |
Protein GI | 222475723 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0419] ATPase involved in DNA repair |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTCA AGACCCTCAT TTTAGAGAAT ATCCGCAGCT ACGAGAACGG TCATATCGAC TTCGAGGACG GCGAAAATCT CCTCTTCGGG CTGAACGGCG CGGGGAAGTC CACCATCCTG CAAGGGGTGT TCGGTGGGCT GTTCCAGACG AAAATGAAGT ACCAGGTCGG CAATGACTTT GATTTACCAG ACCTTGTTCG CACGCAGGCT GACGAAGGGC GTATCGAACT TGTCTTCGAG GCAGGCGGAG CCGAGTACAC CGTTGAGTGG GTCATCCAGA AAAGCTACGA CGACGACGAC GAGGTTAACG GGGCCAAGAC CAAGCAAGGC TACCCGAAAC TCTCCTCGGA CGCCCTCTCC GAAGACGTGT CCAGCCTTGG CGACGTGCAG ACCGAAATTC AGCGCGTTGT CGGGATGGAC GCAGAGTCGT TCGTCAACAG CGTCTACGTC CAGCAAGGCG ACATCACGCG CCTCATCCAT GCCAGCACTG AAGACCGTCG GAAGATTCTT GACGGCCTCC TTGGTCTGAA CCGTCTCGAT GAGTACGTCG ACCGCATGGA AGACGCTCGC CGTGAGTACA AGAAGGCAAA GCGTGATTCG AACAGTCGCC TTGACGAAAC CAAGAAGCGA CTTCAAGATC TGCCTGCGGA GGATGAGATC CAATCCCAAA TAAACAAGAC TGACAAGAAG ATCTCGGACA TCAAAGGCGA CATCGACGAT CTCGAATCGA AAATTGATGG TCTTGAGGAC GAGCGTGAGA CGAAGACGGA GACCCTTGAC CGCATCGACG ACCTCCAAGT GGAACTCGAT GAGACCCGTG ACAAGTACGA GGACGCCGAA AGCGACCACG AGACGTACAA GGAAGAACTC CAAGAGGAAA AGGAAGCACA GCGCAAAGCT GAAGATGCGC GTGACGAGGC ACAGGAGGAT CTTGAAGCCT TAGCCGAGCG GGATGAACTC GCTGACTACG ACGTACTGGA TGCTGACACC GCCGAGCAGG CGCACCTAAC GGCGCAAAAA GAAGCCGAAA TGGCTCGCTC AGAACGTCAA TCAATCGAGG AAGGGCGACT CAACTCCCTT CAGTCGGACC TTGACCGAGT CGAATCCGAC ATCGAAGAAA CAGTAGACGA AATCGAGTCA AAAGAGAGTG AACTCGACGA CGTCCGTGCT GAGGTTGAGT CCGCTGAGTC GCGCAAGTCC GATGCCGAGA AGCGTGTCGA GAGATTCGAA AATGCCCTCG AAAATGACCG AACGACGGTC GCTGATCTTG CGGTGGAACT CGATATCCCC CAAGATGCGT CGGTAGACGA CATTGATGAG TCACACATCC CCGAGACAAG GCAAACTATT TCGTCCGAAC GAGAGCAAAT CGTCGAAAAA AAGAGCCACC TCGGAACTCT ACAAGAGCAG GTCGACAATC TCGAATCCGA GGGTGAATGC CCGGTTTGTG GTGCGACAGA CGACGCTCAT GACATCGACT CGGAGGCGGT TGCTGCAGAG CACAAGGCCA ACCTCGAAGC TGCAGAGGAG CGTCTCGCGG AACTCGACGA AGCACAAGAG ACGCTTGATG ATCTTCGTGA AGCAGTCCGC GACGCGAAAT CTACCCGGAA CGACCTCGAA GATGCCCGCG GAGAGGTTGA ACAGGCCGAC TCCGATATTG AAGATGCTGA AGCGCGCGTT GATGAGGTGG AGTCGACGCT GGAGGGACTC CGTGGTGAAC TCGAAGAGTA CCGTTCAAAG AAGGAGCGAC TCAACGACGA AATCGACGAC GCGAACGAGG ATCTCGAAGA CGCTGAAGTC CGAGTCAACA GAGTGGAGAC CGTCGAAGAA CTGCTCTCAG AAGCAGTCGA ACTACACGGA CGTATCTCCG AACTGGAATC CGATATCGAC CGCCATAAGG AGAACCGCGA GCGCATCGGC GAGCTTCGAC GGACGGCGTA CGACCGAATG TCCGACCTGA AAGAAGACGT GGAGGAGCTT GAGGCGGAAC TTGGTGATCT TGACGCCGAG TCCATCAGAG ACGATATCAA CGAGATTGAC GATTACCTTG ACGAATTCCG CGAGACCTTG GATGGAAAAG AGTCCGAAGA GGAGGAGCTT GTCAATAAAC TCGCTTCTCT TAACTCGAAG AAGGAGCAGA TACAGGAAGA GACCAAGCGC AAGAAAATGC TCGCCAGTCA GCGCGAGTGG GCTAATGAAC GCATTGACGA GGCGACAGAG GTCATCAAGA AGTATAAGGA GGTTCGTGGC AAGCTCCGAC AGCAGAATAT CTACAAGCTG AACGAGTACA CTAACGAGGT GTTTTCGGAC CTCTACAGGG ATCAGTCCTA TCGCGGCGTC CACATCGACA AAAAGTACAA CATCTACCTC ATCGCACAGG ACGGTGAGAA ACTTAAGCCG CAACTGTCTA GTGGTGGAGA GTCCGGCATA CTGAACCTCG CACTTCGGGC GGGCGTCTAC AAAATTATCA CCGAACGCGA CGGCGTCGCC GGTGCGGCGC TCCCGCCATT CATCCTCGAC GAGCCGACGA CATTCCTTGA CTCGGGGCAC GTCGGCGAGT TACAGACGAT GATACAGACC ATCGGCGAGT GGAATGTCCC ACAAATTCTG CTCGTCAGCC ACGACGAAAC CCTCATCGAG AACAGCGATC ACGCAATCCT CGTCGAAAAG GATCCTCAGA CCGAGACCAG CCGCGTCCGT TCCGGCCATG CGGCGGTCGA GGAAGCAACG AATGACACCG AGACTGAAGC CACCGTGGGC GACACGGCCA CCGACGACTG A
|
Protein sequence | MKFKTLILEN IRSYENGHID FEDGENLLFG LNGAGKSTIL QGVFGGLFQT KMKYQVGNDF DLPDLVRTQA DEGRIELVFE AGGAEYTVEW VIQKSYDDDD EVNGAKTKQG YPKLSSDALS EDVSSLGDVQ TEIQRVVGMD AESFVNSVYV QQGDITRLIH ASTEDRRKIL DGLLGLNRLD EYVDRMEDAR REYKKAKRDS NSRLDETKKR LQDLPAEDEI QSQINKTDKK ISDIKGDIDD LESKIDGLED ERETKTETLD RIDDLQVELD ETRDKYEDAE SDHETYKEEL QEEKEAQRKA EDARDEAQED LEALAERDEL ADYDVLDADT AEQAHLTAQK EAEMARSERQ SIEEGRLNSL QSDLDRVESD IEETVDEIES KESELDDVRA EVESAESRKS DAEKRVERFE NALENDRTTV ADLAVELDIP QDASVDDIDE SHIPETRQTI SSEREQIVEK KSHLGTLQEQ VDNLESEGEC PVCGATDDAH DIDSEAVAAE HKANLEAAEE RLAELDEAQE TLDDLREAVR DAKSTRNDLE DARGEVEQAD SDIEDAEARV DEVESTLEGL RGELEEYRSK KERLNDEIDD ANEDLEDAEV RVNRVETVEE LLSEAVELHG RISELESDID RHKENRERIG ELRRTAYDRM SDLKEDVEEL EAELGDLDAE SIRDDINEID DYLDEFRETL DGKESEEEEL VNKLASLNSK KEQIQEETKR KKMLASQREW ANERIDEATE VIKKYKEVRG KLRQQNIYKL NEYTNEVFSD LYRDQSYRGV HIDKKYNIYL IAQDGEKLKP QLSSGGESGI LNLALRAGVY KIITERDGVA GAALPPFILD EPTTFLDSGH VGELQTMIQT IGEWNVPQIL LVSHDETLIE NSDHAILVEK DPQTETSRVR SGHAAVEEAT NDTETEATVG DTATDD
|
| |