Gene Hore_20420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_20420 
Symbol 
ID7314366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2200095 
End bp2203841 
Gene Length3747 bp 
Protein Length1248 aa 
Translation table11 
GC content39% 
IMG OID643612486 
Productconserved repeat domain protein 
Protein accessionYP_002509782 
Protein GI220932874 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGGGGTA AAAAACCTGG ATGGGATAAA TTTTTCCTCA TACTTATTTT CCTGTTATTT 
ATAACCTGGG CCTGCCAATT TTCCAGGGTA TTTGCCAGTG ATACCGATGA TTATATAGAT
GTTGATTTTT CCAGTTCTAC TGTTAAGTTT GTTAACATTA GTCCTGATGG CAGTGGATAT
ATTATCCCAT ATGCTACCAA TCTCAGTATA GTTTCCCCCC TTCGTAAATG GGAGTTATAT
ATAGAAGCTA CCGGTGATCT TGTCAATAAT AATGACGGGA CCAGGATACC GATCAGTAAC
CTTTCATGGG CTTTACATAG CGACAATGGT AAAACCAACT GGAGGCCCCT TAAAACAAAA
CCAACCCTTG TAAGTAAAGG ACTTCCCCGT AAGTCACCGG GTTATGTTGC ATCATTTGAT
TATAAACTGG CTATAAATCC GGGAGTTAAA GCTTCTTCTT CCCCGAAAGC AATTTATGAG
ACGGCAATAA GGTTTAATTT AACAGCAGGG TCTTTATCTT CAAGCCTTGC TTATCCCAAT
CCTTTTTCAC CAAATGGTGA TGGAATTAAG GATACAACTG AAATATCGTT TAATATTACA
GAAGAGTTTG TGAATATCCC GGTTTTTGGT TATATAGTTG ATGTTGATGG AAATATGGTA
AGTATTTTTG ATGACCTGTG GTTTAAGTAC TTTGATAAAA CGGGGATATA TAAAATATCC
TGGGACGGCC AGGGTTTTGA TAATGAAGTC TTGCCTGATG GACAGTATTA TTATATTATT
GAGGCTTTTT CCTTGTCCGA TACCTGGCCT TTGCTTGACT GGTTTAAAGT TATCGGGGTA
GGAACGCTCG AGATTAGTAA TAATACGGCC CCCGGTGATT CTTCCATTAA AGGTCGGGTT
TTTAGTGAAG AAGGTGGTTT TTTACCCGGA GTTAGTATTG ACCTTTACAG TGAAAGTGGT
ATCAAGGTAA CAGAGACAAT AACCGATTCT AAGGGCAAGT TTAGAATAAA TGACCTGGCC
GGTGGTTATT ATTACCTGGA ACTTTCCCGG GAAAATTATC TGCCGTACCG TACCCCTGTG
TTTTACCTTG AGGAAAAATC TAATCTCAAA AAGGATATTA TTCTGAGCCA TAATTCATCC
CTTTATATAG ATGTTAAGGC TGATAAGACC CAGGCCAGAC CAGGTGATGT TATTACATAC
AGTGTTTACG TTGAAAATAA AGGGACATCC AGAGTCAATA ACACCTGTCT TCAGGAGAGG
TTACCGGAAG GGTTGGCCTA TCTGGAAGGA AGTTTTGAAT CAACCCGGGA TCTTTCCGTT
AGTGATTACA AGGATAGTCT CAGGTGGGAA TTTGGTAGCC TGGCTCCTGG TGACTTTTTT
GAAGTAAAAT ATTCTGTGGT GGTGACTCCG GCTGCCCCGG AAGGTGAATT GAGAAACAGG
GTTACTGTCA GTGGCACGGC TGAAGGTGAT TATGGGGGTT ATGTAGCTAG AGAGGGAAGT
GCTCCTGTGA TTATAAGCAG AGGAGTATTT GCCAGTTGTG GCGATATTCT TGGTAGAATT
ACTGGCTACC GGGAAGACCT GGATGATTCA CCATTAGATA AAAATATTTA CGTAAAATTA
AGTAATGGTA GTCTGGCCTC TGTACATGAT GGTAAATACT ACCTTGAGGG TATCCCTGAG
GGGAATTACC TGATCTGGAT AGTGCAAAAA CGAAACGGTA GATTTGTTAA TATTACAAAG
CCTGAAACTA TCAGGGTAGC AGGTGGTCTG GCTGTAAGGA AAGATTTTTA TATAAGCAAA
GGTGATTTTG TAAAAGCTGA TAACAAAGAG CAGTCCTATT ATGGTGTAGC TCAACTTGAC
CTTAATTTTA ACAATGGACT CAAAATGAGT GGTGAGCTTG ATTTAGAATA TAAAACAGGG
GTGAAAGACT GGGATTTATC GATAATATCA TCAACCAGGA GTCGTAAAAA CAGGGAGCAG
GATGGTAATG GTATAGATAT ATTACAGGGT TCCCCGTTAA AAATAACAGC CAGTAGGGAA
AAAGATATTG TTGAAATGGG AGAAGTTGAT TCAGGTGTCC TCGGCATGGC CTGGACAAGG
GAGCTTGATA ATAAAAAATT AAATGTTTAT GCCGGGGCCG GTGAAGTAGG AGAAGCCCTG
GAGGAAATTC CTCTTGATAA TTTCAGTGGT CCCTATCAAC TCGACCATTT CCCGGTGGTT
TCCGGAAGTG AAGAATTATA TATATGTGAG TTTGACCAGG ATGGTAATCT GATCAATGCC
TTTAAACCTG AATATTACCT TGATTATTAT GAAGGCAAGG TATATTTTAA TAAACATTAT
ACTTATATTG ACTACCGTAA TTATAGAAAG GTAATGCGGA TTAAATATCG TTATAAAACT
GGTTTTCTGG ATTGCACCAG GTATTATGGA TTTAATATAA CTACTGAAGA TGATAAAGCC
AGACTTACGG GAGAGGGTTC GTTAAATGAT TCGGGCCATG CAGTTAATAT AGAAGGTGAA
TATCGTAATA ATATAGAAAA AGGTGTCCTG ACCCTTACAG GTGAGGTAGA TTACCGGGCC
AATAAGGAAA CAGATGGAAA TGCCATGAAT TTGAAAGGGA ATGGCGCCTA CAATTTCCAG
GTTAAGGGGC ATGATTTAAA TCTAAAATTG TGGCATAAAC ATAGCAATAT TGAAGATATG
GTTATTGAAC CCCCGGGGGT TTCAGGAAAC GAGACCGGGA TTGGTTTACA GGATTCATGG
AATATTTTAC CTCAACTAAA AAATAAAAAC GATATCAGTC TGGTCAGGAA TTCAGAAGGT
ATAATTGAGT CATTATTAAA TACAGAGTTT GTTTACACCA GGGATATATA TTCGGCTTCT
CTAGCCTGGG AAGGAAGGTC CCATGGAGAG GACAATCTCA TTCTGACCGG TTTGATTGAG
ATTGATAGGA TTAAAGACTG GAAGCAGCAT GTTTCCCTAA CTCTTTCTCA GGAAAAAGAC
TGGATGCCTT TACTGATGTA TCATGGTGAA GCAGATCTGA AACAGGGCTA TCAGATTGAA
AATGAGGTTC AGCTGGGGGC CAAGAATACT GTGGATGTTG GCCTGGACTA TAAGGGTGAT
CACGGCATAA ATTATGAGTC TACCATAGCC TATGACTGGG ACCGGAATCA GCTGGGTTTA
ACTGCCAGGA TGACTTCCGG ACAGGATAAA TTAAAGACTG ACGGTCAGGT TAACATCAAT
TATGATATGG TAGCACATGT CCTGGATCTC TATCTGGAAC AGAAAGGGCT ATATCTACCT
GAACAGAATT ATTATTTAAA AGAAAAAATC AGTGCCGATA GAACTTTTGG GAAGCAAAAA
CAGGGTGACT TAATCCTTTC AATGTGGGCT GGTAAAGTAT TTGATAAAGG AAATATATTC
AGCAAGAATA ATTTAGAGGT GAATGGTGGT CTTATCTGGA GAAATAATAC CGAGAAGGGA
TATAATATAA ATACCGCGGC AACTTCTCTG GGATTACAAT ATGACATTAA ACCGGTAATT
ATTAGTTTTA AAACCCAGCA CTGGTTACAG GATGTGGTTT CCTTTAATGA ATTGGTTGCT
GACCTGGGAG TAATAATACC TGTAAAAGAC TGGTTAGAAC TAAAGGTGGG GTATCGTAAT
AATTCCCTCC TTAATGACAA TGCTCTGGTT CCTGTTCTGG AAAAAGGGTT TTACATTAAT
ATTACCGGTA AAGATTATAA CTTTTAG
 
Protein sequence
MGGKKPGWDK FFLILIFLLF ITWACQFSRV FASDTDDYID VDFSSSTVKF VNISPDGSGY 
IIPYATNLSI VSPLRKWELY IEATGDLVNN NDGTRIPISN LSWALHSDNG KTNWRPLKTK
PTLVSKGLPR KSPGYVASFD YKLAINPGVK ASSSPKAIYE TAIRFNLTAG SLSSSLAYPN
PFSPNGDGIK DTTEISFNIT EEFVNIPVFG YIVDVDGNMV SIFDDLWFKY FDKTGIYKIS
WDGQGFDNEV LPDGQYYYII EAFSLSDTWP LLDWFKVIGV GTLEISNNTA PGDSSIKGRV
FSEEGGFLPG VSIDLYSESG IKVTETITDS KGKFRINDLA GGYYYLELSR ENYLPYRTPV
FYLEEKSNLK KDIILSHNSS LYIDVKADKT QARPGDVITY SVYVENKGTS RVNNTCLQER
LPEGLAYLEG SFESTRDLSV SDYKDSLRWE FGSLAPGDFF EVKYSVVVTP AAPEGELRNR
VTVSGTAEGD YGGYVAREGS APVIISRGVF ASCGDILGRI TGYREDLDDS PLDKNIYVKL
SNGSLASVHD GKYYLEGIPE GNYLIWIVQK RNGRFVNITK PETIRVAGGL AVRKDFYISK
GDFVKADNKE QSYYGVAQLD LNFNNGLKMS GELDLEYKTG VKDWDLSIIS STRSRKNREQ
DGNGIDILQG SPLKITASRE KDIVEMGEVD SGVLGMAWTR ELDNKKLNVY AGAGEVGEAL
EEIPLDNFSG PYQLDHFPVV SGSEELYICE FDQDGNLINA FKPEYYLDYY EGKVYFNKHY
TYIDYRNYRK VMRIKYRYKT GFLDCTRYYG FNITTEDDKA RLTGEGSLND SGHAVNIEGE
YRNNIEKGVL TLTGEVDYRA NKETDGNAMN LKGNGAYNFQ VKGHDLNLKL WHKHSNIEDM
VIEPPGVSGN ETGIGLQDSW NILPQLKNKN DISLVRNSEG IIESLLNTEF VYTRDIYSAS
LAWEGRSHGE DNLILTGLIE IDRIKDWKQH VSLTLSQEKD WMPLLMYHGE ADLKQGYQIE
NEVQLGAKNT VDVGLDYKGD HGINYESTIA YDWDRNQLGL TARMTSGQDK LKTDGQVNIN
YDMVAHVLDL YLEQKGLYLP EQNYYLKEKI SADRTFGKQK QGDLILSMWA GKVFDKGNIF
SKNNLEVNGG LIWRNNTEKG YNINTAATSL GLQYDIKPVI ISFKTQHWLQ DVVSFNELVA
DLGVIIPVKD WLELKVGYRN NSLLNDNALV PVLEKGFYIN ITGKDYNF