Gene Hore_00420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_00420 
Symbol 
ID7314259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp47678 
End bp50020 
Gene Length2343 bp 
Protein Length780 aa 
Translation table11 
GC content40% 
IMG OID643610459 
ProductKojibiose phosphorylase 
Protein accessionYP_002507798 
Protein GI220930890 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1554] Trehalose and maltose hydrolases (possible phosphorylases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones80 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGATAG CTTATAATCT AGGTAAAGGA GAATATAAAA ACTGGATAAT TTCTGAAACA 
GAATTTAATG AAAACAATCT GCCTAAATTT GAAACTATCT TTAGTCTTGG AAATGGTTAT
ATGGGTTTAA GGGCGGCCAC AGAAGAACAC TATTTTAATG AAACAAGAGG TTGTTATGTG
GCCGGAATGT TTGACAGGTT TAAAGGGGAG GTTGCCGAGC TTCCCAATAT TCCTGATTAT
GTAGGGATGG AGATTAAGCT TGACGGAGAG AGATTTAACC TGAACCAGGG AAAAATTATA
TCTTATCACA GGTACTTAAA CGTTAAAGAC GGAGAACTGG TTAGAGAGGT GGAATGGCAA
AGTCCAGCCG GAAATATTAC AAAACTGGTC TTCAAACGCT TTGTTTCTCT TGCCAATCTC
CATCTGGCCG GTTTTAAGAT TAAAATAATT CCAGTTAACT ATTCCGGAAA GGTGGCCATT
AAAACTGGTT ATAACGGGCA GGTAACCAAT AGTGGTGTTC AACATTTCGT TGAAGGTGAT
AAAAGGGTTT TACCTGATGG TAAATCTTAT TTAACTGTCC GGACCCAGGA GTCGGGTATT
TTTACAATTG TAGCCGGTAA GTTCCGTTTC CTGATAAATG CCAGTGAAAT TAATCCACAA
CAGCAGATCG TTACCGGTAG GCGTCAGCTT TTCCTGAGAA GCGAATATGA ATTGAAAGAA
AATGAGTGCC TGGAAATGGA GAAGTGTGTA ATTGTATATA CCGGGCGGGA TCTGGAATTT
AAAGATAAGG ATATTGATTC TGGAGATATT GTTGAAACAG CTCTTACAAC TCTAGATAAA
GCAGGTACTA AGAGGTATGA AGAACTTTTT TCAGAACATC GCCAGAAATG GCACAAGCTA
TGGCATGAGA TAGATATTGA AATCGGTGGG CCTGATTTTG ATCAGCTTGC CGTAAGGTTT
GCCCAGTATC ATCTGGTACA GATGACCCCG TCCCATGACA GTCGTATCAG TGTTGCTGCC
AAGGGGCTTT CGGGGGAAGG GTATAAGGGT CATGTTTTCT GGGATACCGA GATTTTTATC
CTTCCTTTCT TTATCTATAC TTTCCCACAA ATAGCCCGGA AGTTACTCGA ATATCGCTAT
CATACCCTGG ATGGAGCTCG TAAAAAGGCC CGGGAAAATG GGTATAAAGG AGCCATGTAT
CCCTGGGAAA GTGCTGATAC CGGTGAGGAA ACCACCCCTG AATTTGGTGA AGTTGATATA
AAGACCGGGA AACCCATCAG GATCTGGTGT GGGGAAATAG AACAACATAT TACGGCAGAT
GTTGCATATG CAATCTGGCA TTATTATCAG GTTACCGGAG ACAAAGAATT TATGTATAAC
TATGGAACTG AAATTTTTAT GGAGACAGCC CGCTTCTGGG CCAGTAGACT GGAGTATAAT
CAGGGTCTGG ACCGTTATGA AATTAAAGAT GTCATCGGTC CTGATGAATA TAGTGAACAT
GTTAACAATA ATGCCTATAC AAATTATATG GTGAAATGGC ACCTTGAGAA GGCAATTGAT
ATCTACAACT GGTTATCAGA TGATAGCAGA GATATTCTGG AGAAAATAAT AAATAAAATT
GCTTTAAAAG AAGATGAACT AAATGAATGG AAGAAAAAGA AAGATAAAAT TTACCTTCCC
TTCCAGGAAG ACAGTAAAGT TATTCCTCAA TTTGATGGTT TTATGGACCA GGATGTCATA
GATATAAGCA GCTACCGTGG TGATGTCGGG GCTATAATGA AGGCTTATAG CTGGGATGAA
ATAACCAGCA GTCAGGTTAT AAAACAGGCC GATGTAGTGA TGTTACTCTA TCTTCTTGGG
GAAGATTTTA GCCATGAGGT TAAAGAGAAA AATTATCATT ATTATGAACC TAAGACCCTT
CACGATTCTT CGTTAAGTCC CAGTATTCAT GCCATTATGG GTAAGGAAAT CGGTGATTTA
GATGAGGCCT ATAGATACTT CAATAAATCT ACCACTATTG ACCTTGGCAG GAATATGAGA
AGCTGTGATG CTGGTTTGCA TTCAGCTTCT CTTGGAGGTA TCTGGCAGGC AGTTGTTTTA
GGCTTTGGTG GTGTTAAAGT TAAAGATAAT GTTCTCAATA TAGACCCCAT GTTACCTGAG
AAGTGGGATT ACCTGAACTT TAAGCTAAAA TGGCAGGGGA TGCCAATCAG GGTTGAAATT
CGTAATGACA GAGTAAGTGT CAGTTTTTTA AATGATGATA AAGCTATGCT AAAAGATGTT
TCAGTAATGG TCAAGGGCCG TAACCTTCAG CTAAACAATA ATAAAGCAGT AGTGAATCTC
TAA
 
Protein sequence
MAIAYNLGKG EYKNWIISET EFNENNLPKF ETIFSLGNGY MGLRAATEEH YFNETRGCYV 
AGMFDRFKGE VAELPNIPDY VGMEIKLDGE RFNLNQGKII SYHRYLNVKD GELVREVEWQ
SPAGNITKLV FKRFVSLANL HLAGFKIKII PVNYSGKVAI KTGYNGQVTN SGVQHFVEGD
KRVLPDGKSY LTVRTQESGI FTIVAGKFRF LINASEINPQ QQIVTGRRQL FLRSEYELKE
NECLEMEKCV IVYTGRDLEF KDKDIDSGDI VETALTTLDK AGTKRYEELF SEHRQKWHKL
WHEIDIEIGG PDFDQLAVRF AQYHLVQMTP SHDSRISVAA KGLSGEGYKG HVFWDTEIFI
LPFFIYTFPQ IARKLLEYRY HTLDGARKKA RENGYKGAMY PWESADTGEE TTPEFGEVDI
KTGKPIRIWC GEIEQHITAD VAYAIWHYYQ VTGDKEFMYN YGTEIFMETA RFWASRLEYN
QGLDRYEIKD VIGPDEYSEH VNNNAYTNYM VKWHLEKAID IYNWLSDDSR DILEKIINKI
ALKEDELNEW KKKKDKIYLP FQEDSKVIPQ FDGFMDQDVI DISSYRGDVG AIMKAYSWDE
ITSSQVIKQA DVVMLLYLLG EDFSHEVKEK NYHYYEPKTL HDSSLSPSIH AIMGKEIGDL
DEAYRYFNKS TTIDLGRNMR SCDAGLHSAS LGGIWQAVVL GFGGVKVKDN VLNIDPMLPE
KWDYLNFKLK WQGMPIRVEI RNDRVSVSFL NDDKAMLKDV SVMVKGRNLQ LNNNKAVVNL