Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_23200 |
Symbol | |
ID | 7314203 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | - |
Start bp | 2536924 |
End bp | 2538888 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 643612772 |
Product | alpha amylase |
Protein accession | YP_002510060 |
Protein GI | 220933152 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0000149088 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAGCTA ATTTACATAA AAGAAGGTTA CTGGTCTTTT TGGTTATATC CTTAGTTGTC TTTACAGGAT TAAACCTTTA CACAGGGTCG GAAGCTATAG CCGGGGAAAC CACAGCTTTA GACTGGGAAC CGGCTGAATG GGCCCGTAAG GCAGTATTTT ATGAAGTCTT TGTCAGGAGT TTTTATGATG GAAATGGAGA TGGAATCGGG GATTTTGTGG GATTAAAGGA AAAGATACCA TACTTTAAAG AACTGGGGGT TGATACCCTC TGGTTGATGC CGGTTAATGA TTCTCAAAGT TACCACGGGT ATGATGTTGT TGATTACTAT AATACAGAAC CTGATTACGG AACCCTGGAA GAATTCCGGG AGTTTTTACA GGAGGCCCAT GCCAATGGTC TTAAGGTAAT CATGGACCTG GTTTTAAACC ACACCTCGGT CAACCACTAC TGGTTCCGTG AGGCTGTAAA CACCAGAGAC AGTAAATACC GTGATTATTA TGTCTGGGCT GAAAATGAGG AACAGGTTAA AGAACTGGGC CCCTGGGGCC AGCCGGTCTG GCACAGATCC CCGGACGGGG GTTATTATTA TGGCCTCTTC TGGAGTGGTA TGCCTGACCT TAACTACCGC AACCCTGAGG TCAGGGCTGA AGCTAAAAAA ATAGCGAAAT TCTGGCTGGA CCCCAATGGA GATGGAGATT TTTCCGATGG TGTTGACGGT TTCCGCCTCG ATGCAGCCTT ACATATTGAT AAAGACCTTA CAATAACCCA TCAGTGGTGG CAGGAGTTTA ATACCTTTGT CAAGGGGATT AATCCCGAAG CCTTTCTTGT TGGAGAGAAC TGGACAGACA CCCATACGGT AGGAGAGTTT TTCCGGGATT TAGATGCCTC TTTTAATTTT GCCCTGGCCG ATGAAATTCT GAGAATGGTT AACGGGGTCC CGGTCGATAT TCTGGAGGAG GTAAAGGAGA TACACGGAGT ATATAGCCAG TATTCTGACC AGTATATTGA TGCCATTTTT CTGAGAAACC ATGACCAGAA CCGGGTCGGT ATTGAGGTAT TGCGGAACCG GGCAAAGATG AAACTGGCGG CATCCGTTCT GTTTACTCTG CCCGGAACCC CCTTCATCTA TTATGGTGAA GAACTGGGAC AGCTGGGGGC CAAGCCCGAT GACAATATCA GGGAACCCTT TGACTGGTAT AAAGATAGTA AGGGACCGGG AATGACCACT ATGTCTAAAG GTGGTTTCTA CCACTCCATG AGGTTTACCA AACCCCATGA CGGAATATCC CTGGAAGAAG AACGGGGTAA AAGTGGGAGT GTTTATGAAC ATTATAAGAA ATTAATCCAC ATCAGGAAGG AACACCCTCA GTTTTTTACC GGAAACTATC AGAAGATGGT TACCCCGAAC CGGATGTATG GCTATAAGGT AACCGATTCT GAAGTAGATT ATAACCTGTA TGTTATCCAT AATTTATCCA ATAAAGAACG GGGGATAACC ATTTCTAACA GAGCCCGGGA TTTACTTTCA GGAAAAACCA TTAAGGCCCA TCAGGGGATT ACATTACCAC CCTATGGCTC CCTGATATTA AAAACAACAG CCGCCTCTTT GAAAATAACC GGTCAGGATT TAAGCACCAA AAACCCTGTT GTAACCTTTA TCGTTGAGCT AACTGATAAA GCCAGGACAG ATGAAGATAT ATACCTGGCT TCAAAGTTGA GTAACTGGCA GGTTGATGAT AAATTCAAAC TTAAGAAGAG ATCAGATGGC AAATATGAAA TCACCCTGGA ACAACCGGCC GGGGCAGCCC TTATTTTTAA GTTTAAAACC CCGGGTACCT GGGAGGATAC CTCCGGTAGA GAAAATGAGG GGGATAACCG TTTCCAGGGG AGTGGCTATA ATAACAGAAT ATATACTTTT GAAAACAAGG AGACAGTCTA CCTAACCATA ACCGGTTGGG AATAA
|
Protein sequence | MLANLHKRRL LVFLVISLVV FTGLNLYTGS EAIAGETTAL DWEPAEWARK AVFYEVFVRS FYDGNGDGIG DFVGLKEKIP YFKELGVDTL WLMPVNDSQS YHGYDVVDYY NTEPDYGTLE EFREFLQEAH ANGLKVIMDL VLNHTSVNHY WFREAVNTRD SKYRDYYVWA ENEEQVKELG PWGQPVWHRS PDGGYYYGLF WSGMPDLNYR NPEVRAEAKK IAKFWLDPNG DGDFSDGVDG FRLDAALHID KDLTITHQWW QEFNTFVKGI NPEAFLVGEN WTDTHTVGEF FRDLDASFNF ALADEILRMV NGVPVDILEE VKEIHGVYSQ YSDQYIDAIF LRNHDQNRVG IEVLRNRAKM KLAASVLFTL PGTPFIYYGE ELGQLGAKPD DNIREPFDWY KDSKGPGMTT MSKGGFYHSM RFTKPHDGIS LEEERGKSGS VYEHYKKLIH IRKEHPQFFT GNYQKMVTPN RMYGYKVTDS EVDYNLYVIH NLSNKERGIT ISNRARDLLS GKTIKAHQGI TLPPYGSLIL KTTAASLKIT GQDLSTKNPV VTFIVELTDK ARTDEDIYLA SKLSNWQVDD KFKLKKRSDG KYEITLEQPA GAALIFKFKT PGTWEDTSGR ENEGDNRFQG SGYNNRIYTF ENKETVYLTI TGWE
|
| |