Gene Hore_09460 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_09460 
Symbol 
ID7313439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1019366 
End bp1021057 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content38% 
IMG OID643611385 
Productalpha amylase catalytic region 
Protein accessionYP_002508697 
Protein GI220931789 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0188478 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAAAA AAGAGAAAAG CTGGTGGAAA GAGGCGGTAG TTTATCAAGT ATACCCGCGT 
TCTTTTAATG ATACTACTGG TAATGGTATC GGGGACTTAA GGGGTATTAT TGAAAAGCTC
GATTATATAA AAGACCTTGG TGTTGATGTT ATCTGGTTAA ATCCCGTTTA CGAGTCTCCA
TGTGATGACA TGGGGTATGA TATTAGCAAT TATAGAAAAA TATTACCCCA GTTTGGCACC
ATGGAGGATT TTGATCTTCT CCTCTCTGAA ATGCATAAGC GGGGATTAAA ACTTGTCATG
GATCTGGTAG TGAATCATAC TTCTGATGAA CATCGCTGGT TTGTTGAGTC CAGGAAGTCC
AAAGATAATC CCTACCGGGA TTATTATATC TGGAAAAAGC CAAAAGCTGA TGGTAGCCCT
CCCAACAACT GGGTTTCCTA TTTCGGGGGT TCTGCCTGGG AGTATGATGA ACAAACCGGT
GAATATTATC TCCATCTGTT TTCCAAAAAA CAGCCTGATT TGAACTGGGA AAACCCAAAG
GTCAGAGAAG AAGTAAAGGA TATAATGCGT TTCTGGCTTG ATAAAGGTGT CGATGGATTT
AGAATGGATG TTATTGGATT TATTTCAAAG GATCCTGATT TTGAAGATTT TCCAACTGAT
AATCCCAGTG GGAAGGATCT TGGTGATAAA TATGCCAATG GGCCCAGATT ACATGAATTT
TTACAGGAAC TCCATGATGA TGTTCTTAGT CACTATGACT GTATGACCGT TGGAGAATGC
CCCGGAGTAT CACCTGAAGA TGCCCTGTTA ATAGTTGGTA AAGACAGGCG AGAACTCCAG
ACTCTCTTTC AATTTGAGGG AATGGACATT GATTATGGTA AAAATGGAAG CCGCTTCAGT
ATAGGTAACT GGGATGTTCA TGGTTTTAAA AAAGTATATA CAAAATGGCA TAAAAAGTTA
TATGGTAAGG CCTGGAACAG TATTTATCTT ATGAACCATG ACCAACCACG GGCAGTGTCC
AGGTTTGGTG ATGATAAAAA ATACCGCAAA GAATCTGCTA AAATGTTAGC AACCTTCCTG
CTGTCTATGT GGGGTACCCC CTATATCTAT CAGGGGGAAG AAATAGGTAT GACAAATTGC
CCCTTTGAAG GTGTGGAAGA ATTCCGGGAT ATTGAAATGA TTAATTATTA TAATGAACAG
ATAAGTAAGG GTAAAACTAA GGAAGAAATA ATGCCCGGAT TATTATACAG AGGACGGGAT
AATTCCAGGA CTCCAATTCA ATGGAATGAC TCCAGAAATG CAGTTTTTTC TGATGCTGAA
GAGACCTGGA TAAAGGTAAA CCCCAATTAT ACTGAAATTA ATGTTGAAGA AGCTGAAAAA
GACCCTGATT CAATTCTCCA TTATTTCCGT CGTATGATTA AAACCAGGAA AGATAATGAT
GTTCTAATAT ATGGTGACTA TGAACTGGTA GATGAAGGAA ATGACGATGT GTATGCCTAT
CGAAGATTTC TAGACAATGA AGAAATGCTT GTTCTTCTAA ACTTTACAGA TAAAGAGACA
AGCTGTGATG TTAGCCCTTA TAACTTAGAA GATAAAGAGC TGATTATCTC TAATTATAAG
GGGGGTCAAA AGGTCAAAGG AACTGAAGTG ACTTTAAGGC CTTATGAAGC CAGGATCTAT
AAGATAAAAT AA
 
Protein sequence
MTKKEKSWWK EAVVYQVYPR SFNDTTGNGI GDLRGIIEKL DYIKDLGVDV IWLNPVYESP 
CDDMGYDISN YRKILPQFGT MEDFDLLLSE MHKRGLKLVM DLVVNHTSDE HRWFVESRKS
KDNPYRDYYI WKKPKADGSP PNNWVSYFGG SAWEYDEQTG EYYLHLFSKK QPDLNWENPK
VREEVKDIMR FWLDKGVDGF RMDVIGFISK DPDFEDFPTD NPSGKDLGDK YANGPRLHEF
LQELHDDVLS HYDCMTVGEC PGVSPEDALL IVGKDRRELQ TLFQFEGMDI DYGKNGSRFS
IGNWDVHGFK KVYTKWHKKL YGKAWNSIYL MNHDQPRAVS RFGDDKKYRK ESAKMLATFL
LSMWGTPYIY QGEEIGMTNC PFEGVEEFRD IEMINYYNEQ ISKGKTKEEI MPGLLYRGRD
NSRTPIQWND SRNAVFSDAE ETWIKVNPNY TEINVEEAEK DPDSILHYFR RMIKTRKDND
VLIYGDYELV DEGNDDVYAY RRFLDNEEML VLLNFTDKET SCDVSPYNLE DKELIISNYK
GGQKVKGTEV TLRPYEARIY KIK