Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3091 |
Symbol | |
ID | 8743711 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 3172581 |
End bp | 3175544 |
Gene Length | 2964 bp |
Protein Length | 987 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646513675 |
Product | excinuclease ABC, A subunit |
Protein accession | YP_003404629 |
Protein GI | 284166350 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAGG ACTACATCGA GGTGCGGGGC GCGGAGGAAC ACAACCTCAA GGACCTCGAC GTCACCATTC CGCGCGAGGA GTTCACCGTC GTCACCGGCC TGTCGGGGTC GGGCAAGTCC TCGCTGGCGT TCGAGACGAT CTACGCCGAG GGCCAGCGGC GGTACATCGA GAGCCTCTCA GCGTACGCCC GGAACTTCCT CGGGCAGATG GACAAGCCGC AGGTCGAGAC CGTCGAAGGG CTCTCCCCGG CGATCTCGAT CGACCAGAAG AACGCCGCGA ACAACCCCCG ATCGACGGTG GGGACCGTCA CGGAACTCCA CGACTATCTC CGTCTCCTCT ACGCCCGCGT CGGCACCCCC CACTGTCCCG AGTGCGGCCG CGAAGTCGGC GAACAGTCGG CCCAGAACAT GGTCGAACGC ATCCTCGAGC TCCCCGAGGG CACGAAGGTC AAGCTGGCGG CGCCGGTCGT CCGCGACCAG AAGGGGGCCT TCGAGGACCT CTTCGAGGAA TTAGTGTCGG AGGGATACGC CCGCGTCGAG ATCGACGGCG AGGAACACGA CCTCACGCTG GACGATCCCG ATCTGGACGA GAACTTCGAT CACACCGTCG ACGTCATCGT CGACCGCGTG AAGGTCTCCG CGGAGGACCG CCCGCGCATC ATCGACAGCG TCGAAACCGC GCTCGACGAG GCCGAGGGCG TCCTGAAGGT CATCCTGCCG GACGCGCCCA AAGACGTCGC GAGCGACCTC GGCGAGGCGG CCCGTCGGAC GGGCGCGCTG GGCGACGAGA CCGAGGAGGA CGACCGCTTC GTCGTCGAGT TCTCGAAGGA CCTCGCCTGT ACCCACTGCG GGATCGACGT CCCCGAGATC GAGACCCGCT CCTTTTCGTT CAACTCGCCC CACGGCGCCT GTCCCGAGTG CGAGGGGCTG GGCGAGACCA AGGAGGTCGA CGAGGATCTG GTCGTCCAGG ACGAGTCCAA GCCGCTCAAG CACGTCTTCG AGGCCTGGAG CTACAACCGG TCGTACTACC GGACCCGCCT CGACGCCGTC GCCGAGCACT TCGGCGTCTC GCTGTCGACG CCGTTCGAAG AGTTAGACGA GGACGTCCAG CGGGCGTTCC TCTACGGCAC CGACGACGAG GTCGTGTTCA AGCGAAGCAC GAAGAACGGT ACCCGCCGGA AGCGAAAGCG CTTCGAGGGC GTCATTCCGA ACCTCGAGCG CCGGTATATC GAGACCGACT CCGACTCGAC CAGAGAGCAC ATCGAGGACT ACATGTCCGC GACGGAGTGT CCGGCCTGTG ACGGCACGCG GCTGAAGGCC GCGAGTCGGG CCGTGCTCGT CGACGGGACG GCGATCACTG AGATCAACGC GATGAGCATC GGCGACGCCC TCGAGCACTT CGAATCGATG GAGGCGAACT TCACCGAACG CGAGAAGGTG ATCGCCGAGG AGATCTTAAA GGAGATCCGC GCGCGTCTGG GCTTCATGTG CGAGGTCGGC CTCGAGTACC TTACGCTCGA TCGGGAGGCC GCGACGCTGT CGGGCGGCGA GAGCCAGCGC ATCCGCCTCG CCACGCAGAT CGGTTCCGGC CTCGTCGGCG TCCTCTACGT GTTAGACGAG CCCTCGATCG GGCTCCACCA GCGGGACAAC GACCGCCTGC TGGACACCTT GGAGGAACTG CGGGACCTCG GAAACACCCT CATCGTCGTC GAACACGACG AGGAGACGAT GCGCCGAGCG GACCAGGTCA TCGACATGGG GCCCGGTCCG GGCAAGCGCG GCGGCGAGGT CGTCGCCAAC GGCCCCGTCG AAGAGGTCAA GGCGACCGAG GGCTCCGTGA CGGGCGAGTA CCTCTCCGGC CGCCGGCAGA TTCCGGTCCC CGACGAACGC CGCGACGCCG ACGGGGCACT CACGATCCGC GGCGCCCGCC AGCACAACTT GGACGACGTC GACGTCGACA TCCCGCTCGG CAACTTCACG GCGATCACGG GCGTCTCCGG CTCCGGCAAA TCGACGCTCA TGCACGAGGT GCTCTACAAG GGACTGGCCC GCGAGATGAA CGACAACACG TCGGTCATTC CTGGCGACCA CGACGCCCTC GAGGGCCTCG AGGACATCGA GACCGTGCGC CTGATCGACC AGTCGCCGAT CGGCCGCACA CCCCGCTCGA ACCCGGCGAC GTACACCAAC GTCTTCGACT ACATCCGCGA GCTGTTCGCT CAGACGAAGC TGGCGAAACA GCGCGGCTAC GAGAAGGGAC GGTTCTCCTT CAACGTCAAG GGCGGCCGCT GCGAGGAATG CGGCGGACAG GGCACCGTCA AGATCGAGAT GAACTTCCTG AGCGACGTCT ACGTCCCCTG TGAGGAGTGT GACGGCGCCC GTTACAACGA CGCCACGCTC GACGTCACCT ACAAGGGCAA GACCATCGCC GACGTCCTCG AGATGGAAGT CGAGGAGGCC TACGAGTTCT TCGAGTCCTC GAGCCAGATC CGACGGCGCC TGAAGCTGCT GAAGGACGTC GGCCTCGACT ACATGAAGCT CGGCCAGCCC TCCACGACGC TGTCGGGCGG CGAGGCCCAG CGGATCAAGC TCGCCGAGGA GTTGGGGAAG AAGGACACGG GGGAGACGCT CTACCTGCTC GACGAGCCCA CCACCGGGCT CCACAGCGAG GACGAGCGCA AGCTCATCGA CGTCCTCCAC CGGCTGACCG ACAACGGCAA CACCGTCGTC GTCATCGAGC ACGAGCTCGA CCTCGTGAAG AACGCCGACC ACATCATCGA TCTCGGCCCC GAGGGCGGCG AGAACGGCGG CGAGATCGTC GCGACCGGTA CGCCCGAGCA GGTCGCGCAA CTCGAAGATT CCCACACCGG ACGCTACCTG CGTGATCTGC TGCCGAAAGT GGATCTCGAG GGGCCGCGCG GCGAGCGCGT CGAGCCCGTG ACGGCGCCGA TGGACGACGA CTGA
|
Protein sequence | MSKDYIEVRG AEEHNLKDLD VTIPREEFTV VTGLSGSGKS SLAFETIYAE GQRRYIESLS AYARNFLGQM DKPQVETVEG LSPAISIDQK NAANNPRSTV GTVTELHDYL RLLYARVGTP HCPECGREVG EQSAQNMVER ILELPEGTKV KLAAPVVRDQ KGAFEDLFEE LVSEGYARVE IDGEEHDLTL DDPDLDENFD HTVDVIVDRV KVSAEDRPRI IDSVETALDE AEGVLKVILP DAPKDVASDL GEAARRTGAL GDETEEDDRF VVEFSKDLAC THCGIDVPEI ETRSFSFNSP HGACPECEGL GETKEVDEDL VVQDESKPLK HVFEAWSYNR SYYRTRLDAV AEHFGVSLST PFEELDEDVQ RAFLYGTDDE VVFKRSTKNG TRRKRKRFEG VIPNLERRYI ETDSDSTREH IEDYMSATEC PACDGTRLKA ASRAVLVDGT AITEINAMSI GDALEHFESM EANFTEREKV IAEEILKEIR ARLGFMCEVG LEYLTLDREA ATLSGGESQR IRLATQIGSG LVGVLYVLDE PSIGLHQRDN DRLLDTLEEL RDLGNTLIVV EHDEETMRRA DQVIDMGPGP GKRGGEVVAN GPVEEVKATE GSVTGEYLSG RRQIPVPDER RDADGALTIR GARQHNLDDV DVDIPLGNFT AITGVSGSGK STLMHEVLYK GLAREMNDNT SVIPGDHDAL EGLEDIETVR LIDQSPIGRT PRSNPATYTN VFDYIRELFA QTKLAKQRGY EKGRFSFNVK GGRCEECGGQ GTVKIEMNFL SDVYVPCEEC DGARYNDATL DVTYKGKTIA DVLEMEVEEA YEFFESSSQI RRRLKLLKDV GLDYMKLGQP STTLSGGEAQ RIKLAEELGK KDTGETLYLL DEPTTGLHSE DERKLIDVLH RLTDNGNTVV VIEHELDLVK NADHIIDLGP EGGENGGEIV ATGTPEQVAQ LEDSHTGRYL RDLLPKVDLE GPRGERVEPV TAPMDDD
|
| |