Gene Htur_3091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3091 
Symbol 
ID8743711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp3172581 
End bp3175544 
Gene Length2964 bp 
Protein Length987 aa 
Translation table11 
GC content66% 
IMG OID646513675 
Productexcinuclease ABC, A subunit 
Protein accessionYP_003404629 
Protein GI284166350 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAGG ACTACATCGA GGTGCGGGGC GCGGAGGAAC ACAACCTCAA GGACCTCGAC 
GTCACCATTC CGCGCGAGGA GTTCACCGTC GTCACCGGCC TGTCGGGGTC GGGCAAGTCC
TCGCTGGCGT TCGAGACGAT CTACGCCGAG GGCCAGCGGC GGTACATCGA GAGCCTCTCA
GCGTACGCCC GGAACTTCCT CGGGCAGATG GACAAGCCGC AGGTCGAGAC CGTCGAAGGG
CTCTCCCCGG CGATCTCGAT CGACCAGAAG AACGCCGCGA ACAACCCCCG ATCGACGGTG
GGGACCGTCA CGGAACTCCA CGACTATCTC CGTCTCCTCT ACGCCCGCGT CGGCACCCCC
CACTGTCCCG AGTGCGGCCG CGAAGTCGGC GAACAGTCGG CCCAGAACAT GGTCGAACGC
ATCCTCGAGC TCCCCGAGGG CACGAAGGTC AAGCTGGCGG CGCCGGTCGT CCGCGACCAG
AAGGGGGCCT TCGAGGACCT CTTCGAGGAA TTAGTGTCGG AGGGATACGC CCGCGTCGAG
ATCGACGGCG AGGAACACGA CCTCACGCTG GACGATCCCG ATCTGGACGA GAACTTCGAT
CACACCGTCG ACGTCATCGT CGACCGCGTG AAGGTCTCCG CGGAGGACCG CCCGCGCATC
ATCGACAGCG TCGAAACCGC GCTCGACGAG GCCGAGGGCG TCCTGAAGGT CATCCTGCCG
GACGCGCCCA AAGACGTCGC GAGCGACCTC GGCGAGGCGG CCCGTCGGAC GGGCGCGCTG
GGCGACGAGA CCGAGGAGGA CGACCGCTTC GTCGTCGAGT TCTCGAAGGA CCTCGCCTGT
ACCCACTGCG GGATCGACGT CCCCGAGATC GAGACCCGCT CCTTTTCGTT CAACTCGCCC
CACGGCGCCT GTCCCGAGTG CGAGGGGCTG GGCGAGACCA AGGAGGTCGA CGAGGATCTG
GTCGTCCAGG ACGAGTCCAA GCCGCTCAAG CACGTCTTCG AGGCCTGGAG CTACAACCGG
TCGTACTACC GGACCCGCCT CGACGCCGTC GCCGAGCACT TCGGCGTCTC GCTGTCGACG
CCGTTCGAAG AGTTAGACGA GGACGTCCAG CGGGCGTTCC TCTACGGCAC CGACGACGAG
GTCGTGTTCA AGCGAAGCAC GAAGAACGGT ACCCGCCGGA AGCGAAAGCG CTTCGAGGGC
GTCATTCCGA ACCTCGAGCG CCGGTATATC GAGACCGACT CCGACTCGAC CAGAGAGCAC
ATCGAGGACT ACATGTCCGC GACGGAGTGT CCGGCCTGTG ACGGCACGCG GCTGAAGGCC
GCGAGTCGGG CCGTGCTCGT CGACGGGACG GCGATCACTG AGATCAACGC GATGAGCATC
GGCGACGCCC TCGAGCACTT CGAATCGATG GAGGCGAACT TCACCGAACG CGAGAAGGTG
ATCGCCGAGG AGATCTTAAA GGAGATCCGC GCGCGTCTGG GCTTCATGTG CGAGGTCGGC
CTCGAGTACC TTACGCTCGA TCGGGAGGCC GCGACGCTGT CGGGCGGCGA GAGCCAGCGC
ATCCGCCTCG CCACGCAGAT CGGTTCCGGC CTCGTCGGCG TCCTCTACGT GTTAGACGAG
CCCTCGATCG GGCTCCACCA GCGGGACAAC GACCGCCTGC TGGACACCTT GGAGGAACTG
CGGGACCTCG GAAACACCCT CATCGTCGTC GAACACGACG AGGAGACGAT GCGCCGAGCG
GACCAGGTCA TCGACATGGG GCCCGGTCCG GGCAAGCGCG GCGGCGAGGT CGTCGCCAAC
GGCCCCGTCG AAGAGGTCAA GGCGACCGAG GGCTCCGTGA CGGGCGAGTA CCTCTCCGGC
CGCCGGCAGA TTCCGGTCCC CGACGAACGC CGCGACGCCG ACGGGGCACT CACGATCCGC
GGCGCCCGCC AGCACAACTT GGACGACGTC GACGTCGACA TCCCGCTCGG CAACTTCACG
GCGATCACGG GCGTCTCCGG CTCCGGCAAA TCGACGCTCA TGCACGAGGT GCTCTACAAG
GGACTGGCCC GCGAGATGAA CGACAACACG TCGGTCATTC CTGGCGACCA CGACGCCCTC
GAGGGCCTCG AGGACATCGA GACCGTGCGC CTGATCGACC AGTCGCCGAT CGGCCGCACA
CCCCGCTCGA ACCCGGCGAC GTACACCAAC GTCTTCGACT ACATCCGCGA GCTGTTCGCT
CAGACGAAGC TGGCGAAACA GCGCGGCTAC GAGAAGGGAC GGTTCTCCTT CAACGTCAAG
GGCGGCCGCT GCGAGGAATG CGGCGGACAG GGCACCGTCA AGATCGAGAT GAACTTCCTG
AGCGACGTCT ACGTCCCCTG TGAGGAGTGT GACGGCGCCC GTTACAACGA CGCCACGCTC
GACGTCACCT ACAAGGGCAA GACCATCGCC GACGTCCTCG AGATGGAAGT CGAGGAGGCC
TACGAGTTCT TCGAGTCCTC GAGCCAGATC CGACGGCGCC TGAAGCTGCT GAAGGACGTC
GGCCTCGACT ACATGAAGCT CGGCCAGCCC TCCACGACGC TGTCGGGCGG CGAGGCCCAG
CGGATCAAGC TCGCCGAGGA GTTGGGGAAG AAGGACACGG GGGAGACGCT CTACCTGCTC
GACGAGCCCA CCACCGGGCT CCACAGCGAG GACGAGCGCA AGCTCATCGA CGTCCTCCAC
CGGCTGACCG ACAACGGCAA CACCGTCGTC GTCATCGAGC ACGAGCTCGA CCTCGTGAAG
AACGCCGACC ACATCATCGA TCTCGGCCCC GAGGGCGGCG AGAACGGCGG CGAGATCGTC
GCGACCGGTA CGCCCGAGCA GGTCGCGCAA CTCGAAGATT CCCACACCGG ACGCTACCTG
CGTGATCTGC TGCCGAAAGT GGATCTCGAG GGGCCGCGCG GCGAGCGCGT CGAGCCCGTG
ACGGCGCCGA TGGACGACGA CTGA
 
Protein sequence
MSKDYIEVRG AEEHNLKDLD VTIPREEFTV VTGLSGSGKS SLAFETIYAE GQRRYIESLS 
AYARNFLGQM DKPQVETVEG LSPAISIDQK NAANNPRSTV GTVTELHDYL RLLYARVGTP
HCPECGREVG EQSAQNMVER ILELPEGTKV KLAAPVVRDQ KGAFEDLFEE LVSEGYARVE
IDGEEHDLTL DDPDLDENFD HTVDVIVDRV KVSAEDRPRI IDSVETALDE AEGVLKVILP
DAPKDVASDL GEAARRTGAL GDETEEDDRF VVEFSKDLAC THCGIDVPEI ETRSFSFNSP
HGACPECEGL GETKEVDEDL VVQDESKPLK HVFEAWSYNR SYYRTRLDAV AEHFGVSLST
PFEELDEDVQ RAFLYGTDDE VVFKRSTKNG TRRKRKRFEG VIPNLERRYI ETDSDSTREH
IEDYMSATEC PACDGTRLKA ASRAVLVDGT AITEINAMSI GDALEHFESM EANFTEREKV
IAEEILKEIR ARLGFMCEVG LEYLTLDREA ATLSGGESQR IRLATQIGSG LVGVLYVLDE
PSIGLHQRDN DRLLDTLEEL RDLGNTLIVV EHDEETMRRA DQVIDMGPGP GKRGGEVVAN
GPVEEVKATE GSVTGEYLSG RRQIPVPDER RDADGALTIR GARQHNLDDV DVDIPLGNFT
AITGVSGSGK STLMHEVLYK GLAREMNDNT SVIPGDHDAL EGLEDIETVR LIDQSPIGRT
PRSNPATYTN VFDYIRELFA QTKLAKQRGY EKGRFSFNVK GGRCEECGGQ GTVKIEMNFL
SDVYVPCEEC DGARYNDATL DVTYKGKTIA DVLEMEVEEA YEFFESSSQI RRRLKLLKDV
GLDYMKLGQP STTLSGGEAQ RIKLAEELGK KDTGETLYLL DEPTTGLHSE DERKLIDVLH
RLTDNGNTVV VIEHELDLVK NADHIIDLGP EGGENGGEIV ATGTPEQVAQ LEDSHTGRYL
RDLLPKVDLE GPRGERVEPV TAPMDDD