Gene Hore_16220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_16220 
Symbol 
ID7312658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1740648 
End bp1743473 
Gene Length2826 bp 
Protein Length941 aa 
Translation table11 
GC content45% 
IMG OID643612069 
Productexcinuclease ABC, A subunit 
Protein accessionYP_002509366 
Protein GI220932458 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.266346 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCAAGG AAATGATTTA TGTTAAGGGA GCCCAGGAGC ATAATTTAAA GAATATTGAT 
GTCAGGATTC CAAGGGATAA ACTTATAGTT ATAACCGGTT TGAGTGGTTC TGGTAAATCT
TCACTGGCTT TTGATACCAT TTATGCTGAG GGTCAAAGGC GTTATGTCGA GTCTCTATCA
GCCTATGCCC GGCAGTTTTT AGGACAGATG GAAAAACCCC GGGTTGAATA TATAGAAGGT
TTATCTCCTG CTATTTCAAT TGATCAGAAA ACCACCAGTA AAAACCCCCG TTCCACAGTC
GGTACTGTTA CTGAAATCTA TGACTATCTG CGCCTTCTGT ATGCCAGAAT AGGAAGGCCC
CACTGCCCCG AATGCGGGCG GGCTATCTCA TCCCAGAGTG TTGATGAGAT AGTGGATCAG
GTTCTGGATC TACCGGGAAG GACCAAAATA CAGATTCTGT CACCGGTGGT CAGGGGAAGA
AAGGGTGAGC ATCAGAAGGT ATTTACAAGG GCCCTGAGGG ATGGTTTTGT CAGGGCCCGG
GTAGATGGAA GGGTCATCAT TCTGGGGGAA GAAGAGGTTA GCCTTGAGAA AAATTTAAAA
CATGATATTG AAGTTGTCGT CGATAGACTG GTTATTAAAG AAGGTATCCG GGAGCGATTG
ACCGATTCCA TTGAGACTGC CCTGGAATAC AGTGAAGGGC TGGTCATTGT AGATGTTATC
GATGGAGATG AAATGGTATT TAGTGAGAAA TTTGCCTGTC CCGAATGTGG TATCAGTCTG
GAAGAAATGT CTCCCCGGAT GTTTTCTTTC AACAGCCCCT ATGGGGCCTG TCCCAATTGT
GATGGACTGG GTATAAAAAA GGAATTTGAT CCTGATCTTA TCCTTGATAA GGAAAAATCC
ATCAGTAACG GGGCCATTAT TCCCTGGCGT AATTCAACAA GTCGTTATTA TCCTCAGCTC
CTGTCAGCCC TGGCTGAAGA GTATAACTTC AGCCTGGATA CCCCCCTGGA GCAACTGGAT
GATGACATCA TAGATATTAT TCTGTACGGT TCGGACAGGC AACTTGTTTT TCCCTATACC
AACCGCTATG GGAGAACCAG GCAGCATAAA ACCTATTTTA AGGGGATAGT TGGTTATTTA
AAAAGGCGGG TTAATGAATC TGACTCTCCG ACAGCCCATC GCAGGCTGGA AAGATATATG
AGTGAACGGC CCTGTTCTAC CTGTCAGGGG GGCCGGTTGC GGCCTGAGGT TCTGGCTGTG
ACGGTCGGGG GTAAATCAAT TGCCGAATTT ACCCGTTATT CAATTAAGGA AGCCTATGAT
TTCATAGAAG AATTAAATTT AACAGAAAGG GAACAGTATA TAAGTCAGGA AATTATTAAA
GAGATTAAAA ACAGGTTACG GTTTTTAATA GATGTTGGTC TGGACTATCT TACCCTGGAC
CGGCCGGCCG GGAGTTTGTC CGGGGGTGAG GCCCAGCGAA TCAGGCTGGC TACCCAGATT
GGTTCAGGAC TGGTAGGGGT TTTATATATT CTAGATGAAC CGAGTATAGG GCTCCACCAG
AGGGATAATA ACCGTTTAAT AAGAACCCTT GAACATATCA GGGATCTGGG GAATACGGTT
ATTGTTGTTG AACATGATGA AGATACCATC AGGGCAGCTG ACCACATTAT TGACATCGGT
CCCAGGGCCG GTAAACACGG TGGCCGGGTG GTAGCCCAGG GTTCTCTGGA AGACATTGTT
TCCAGTAAAG AATCCCTGAC CGGTCAGTAT TTATCTGGAG AAAAGAAAAT ACCGGTACCT
GAAAAGAGGG TTAGACCCAA TGGTAAATAC CTGGAGATTA AGGGGGCCAG GCAGCATAAC
CTTAAAAATA TAGATGTCAA AATCCCCCTG GGGACTTTTA CCTGTGTTAC CGGTGTTTCC
GGTTCAGGGA AGAGTACCCT GATTAATTTG ACTTTGAAAC GAAAGTTAAT GCAGCATTTT
TATGATTCCA CCCTGAGGCC GGGTGAACAT GATGAAATAA AGGGGCTGGA GTATGTCGAT
AAGATTATTA ATATAGACCA GTCACCCATA GGCAGGACCC CTAGGTCAAA TCCAGCAACC
TACACCAAGG TTTTTGATTA TATCCGGGAT GTGTTTGCCA AAACCCCCGA GGCCCGGAGG
ATGGGGTATA AGAAGGGGAG GTTCAGTTTT AACGTAAAAG GGGGCCGTTG TGAGGCCTGT
AAAGGTGATG GCATAATCAA GATAGAAATG CATTTTCTGC CTGATGTTTA TGTCCCCTGT
GAGGTATGCG GGGGTAAAAG ATATAACCGG GAGACCCTGG AGATAAAATA TAAGGGAAAA
ACAATAGCAG ATATACTCGA GATGACGGTT GAGGAAGCCC TGGAGTTTTT TGCCAATGTC
AATCCGATAA AGAGGCGGTT ACAGACCCTT TATGATGTTG GTCTCGGTTA TATTCGACTG
GGTCAACCGG CAACAACTTT ATCCGGGGGT GAGGCCCAGA GGATAAAGAT TGCCTCTGAA
CTGGGAAAAA GGAGTACCGG AAAGACCATC TATATTCTGG ATGAACCAAC TACCGGTCTG
CATTTTGAGG ATGTAAAGAA ACTTCTTGAA GTACTCTTCA GACTCCGGGA AGGTGGTAAT
ACCGTCATTG TAATAGAGCA TAACCTTGAT GTTATAAAAG CTGCCGATTA TATCATAGAC
CTTGGCCCTG AAGGAGGGGA CCGGGGAGGG CAGGTTATTG CTACCGGAAC CCCGGAAGAG
GTTGCAGCCA ACCCTGAATC TTATACGGGA CAGTTTTTAA GTAAATATCT CAATAATAAA
TGGTGA
 
Protein sequence
MVKEMIYVKG AQEHNLKNID VRIPRDKLIV ITGLSGSGKS SLAFDTIYAE GQRRYVESLS 
AYARQFLGQM EKPRVEYIEG LSPAISIDQK TTSKNPRSTV GTVTEIYDYL RLLYARIGRP
HCPECGRAIS SQSVDEIVDQ VLDLPGRTKI QILSPVVRGR KGEHQKVFTR ALRDGFVRAR
VDGRVIILGE EEVSLEKNLK HDIEVVVDRL VIKEGIRERL TDSIETALEY SEGLVIVDVI
DGDEMVFSEK FACPECGISL EEMSPRMFSF NSPYGACPNC DGLGIKKEFD PDLILDKEKS
ISNGAIIPWR NSTSRYYPQL LSALAEEYNF SLDTPLEQLD DDIIDIILYG SDRQLVFPYT
NRYGRTRQHK TYFKGIVGYL KRRVNESDSP TAHRRLERYM SERPCSTCQG GRLRPEVLAV
TVGGKSIAEF TRYSIKEAYD FIEELNLTER EQYISQEIIK EIKNRLRFLI DVGLDYLTLD
RPAGSLSGGE AQRIRLATQI GSGLVGVLYI LDEPSIGLHQ RDNNRLIRTL EHIRDLGNTV
IVVEHDEDTI RAADHIIDIG PRAGKHGGRV VAQGSLEDIV SSKESLTGQY LSGEKKIPVP
EKRVRPNGKY LEIKGARQHN LKNIDVKIPL GTFTCVTGVS GSGKSTLINL TLKRKLMQHF
YDSTLRPGEH DEIKGLEYVD KIINIDQSPI GRTPRSNPAT YTKVFDYIRD VFAKTPEARR
MGYKKGRFSF NVKGGRCEAC KGDGIIKIEM HFLPDVYVPC EVCGGKRYNR ETLEIKYKGK
TIADILEMTV EEALEFFANV NPIKRRLQTL YDVGLGYIRL GQPATTLSGG EAQRIKIASE
LGKRSTGKTI YILDEPTTGL HFEDVKKLLE VLFRLREGGN TVIVIEHNLD VIKAADYIID
LGPEGGDRGG QVIATGTPEE VAANPESYTG QFLSKYLNNK W