Gene Hore_22970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_22970 
Symbol 
ID7313049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2508176 
End bp2511340 
Gene Length3165 bp 
Protein Length1054 aa 
Translation table11 
GC content41% 
IMG OID643612749 
Productalpha-mannosidase 
Protein accessionYP_002510037 
Protein GI220933129 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0383] Alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TGCCTGAGGT AACAAATGCA CACATAATTT CTCATACCCA CTGGGATCGG 
GAATGGTTTT TAAATAGTAA ATATACCAAT GAATGGCTGG TACCTTTTTT TGACTCGTTA
TTTGATATGC TTGAAAAAGA AGAGGATTAT AGATTTATAC TGGATGGACA GACCCTGATG
GTAGAAGATT ATCTGGCTGA ACTTGAGAAA CAGGGTAAAA ATAGCAGCAA GTATAAAGAG
AAATTAAAAA AATACGTTAA AAAAGGGCAG ATAATAATAG GTCCGTATTA TCTTCAACCA
GACTGGCAGT TATTAAGTGA AGAATCACTG GTTAGAAACC TATTAATCGG ACATAAAGTT
GGGAATGAGC TGGGGGGAGT AATGAAAACA GGTTGGTTAT TGGATAATTT CGGTCAGATT
TCCCAGGCTC CCCAGATTCA TAAACTCTTC GGCCTGAAAG GGCTTTTCCT GTGGCGAGGT
GTGGAAATGC CTCCGACTGA TATTGACTCT GAATTTATGT GGAAAAGCCC TGACGGCACG
GAAATGGTCT CAGTATATTT CCTCAGCAGT TATCGTAATG CCATGAGGCT GGCCGAATAT
AAGGAGATGT TCAAAGAACG GATTGAAAAT GAAGTTAATA AAATCAGTCC CTTTGCCACA
ACTTCTAATG TTCTTTTAAT GAATGGTTAT GACCAGGAGA TGATACCGGA CAACATTTTA
CCTGAACTGG AGAATTTATC TCTGGAGGGT ATCAGGGCTA AACAGAGTAC TCCCCCTGAA
TATATTGATG CCATTAGCAA AGAAAATCCG GATTTAAAGG TATTGGAAGG AGCTCTTTAT
AGCGGGCGTT ATATTTCAGT ATTTCCGGGT ATTCTCTCAG CCAGGATGTA CCTCAAATTG
AAAAATGACC GCTGTCAGAG GGAGCTTACA AATTATGCTG AACCTTTATC AGCCTTATCC
TGGCTTATGG GCGGAAACTA TGAAGATGAT ACCTTCTTAG AGGCATGGAA AACTCTACTC
AAAAACCATC CTCATGATAG TATCTGTGGT GTCAGTATTG ATGATGTACA TACAGATATG
GAAGAGAGAT TGGCCAGGGC CTACTCTATA AGCCGGAGGT TGACTGAAAG AGGAGTGGAA
GAATTAACTT TAAATATAAA GACAGAAATC AAAGATGACG AAAAGCAGCC ATTTGTGGTC
TTTAATACTT TCCCATCGGG ACGGGGAGGG GTAGTCAGCT TAGATTCAAC CGGGGATAAC
TGTATTGTAA CTGACTCCAC CGGGAAGGCT TTACCCACTC AGAAGGGAAC CGGGGGTAAC
CACTATGTTT ATCTAGATGA TATTCCCGCC ACAGGGTATA AAACCATTTA TCTGGATGAG
GGTTCTGATA GCTTTAATGC TGCCAATTAC ACTGGAGACT ATGACCGGCT GGTGGTTAAA
GATAACTGGA TTGAAAATAA ATACCTGGCT ATAGAGATAG AAAGTGATGG TAGTTTAAAT
GTTTTTGATA AGGTAAACCA GGAACATTAC AGTGGCCTCG GTGTTTTTGT TGATGATGCC
GATGCTGGAG ATACCTATAA TTATTCTTAT CCTGAAAATG ACCTCAAGAT AACCACCAGA
AATCAAAAAG CAGATATAGA AATTGTCGAG GCAGGGCCTT TAGTAGCTAC AATCAAAATT
AGTCAGGTTA TGAAATTACC CGCTTCCCTG AGTGAAGACA GGAAGACAAG GTCTGAAGAG
CTACTTGACC TTCCTGTAGT AAACTGGGTG AAGGTAAAGG CCAACTCTCC TCGAATTGAG
TTTAAGACAG AGGTGAAAAA CACAGTTAAA GATCACCGGT TACGGGTATT ATTTCCCACA
GGAATTGATA GTAAATATTC ATATGCCGGA ACCCAGTTTG ATATTACCAG GCATGAAATT
TACCCCGGGG TCAGTGATGA TAGTGAGATA CCGGATAATG TCAAAAGAAT CATCATCGGG
GCCCGGGAAT CAGAACCAAT TACAACCTTT CCCCAGTGCT ATTTTGTAGA TATTAATGAT
GGAGATAAAG GAGTGGCAGT TTTAAATAAA GGTTTACCTG AATACGAGAT ACTGCCGGAG
GATAATATTA TTGCCCTGAC CCTCTTCAGG TCAGTAGGCT GGTTGGCCAG GGGAGATTTA
CTGACAAGGG TTGGTGATGC CGGACCCGTA ATCTATACCC CTGATGCCCA GTGTTTAAGG
GAGATGACCT TTGAGTATTC CCTGTATTTC CACAAGGGTG ATTGCCTCCA GGGAGAGGTA
TACCATCAGG CCCGGGACTA TAATAATGAG CTGCTGGTTA TCAAAACAGA GCAACATGAA
GGAGTTTTAC CACCTGAGAA CTACTTTATT AAACTACACA GTCCTGATAA TGCCCTGCAG
GTTACTGCAA TAAAGAGATC CGAAGATGGA GAAGGTCTTA TTTTAAGGCT TTTCAACACA
GGGGATAAGG AGGTATCAGG GACATTAACT ACCTCCCTTG ATTGTAGTAA AGCTTATTAT
GCTAACCTTA ACGAAGAAGT TGAAGAGAAA ATATCTATTG ATAGAAGTAA TGAACTATCA
ATAAATGTAA AACCCAGAGA GATAAAAACA ATAAAGATGG TACTTGAGTC CAGTAATATT
CTTGATACCG GCAGGGCAGG GGAGACAGAA ATTATAGAAA ACAGGTCTGC ACAGACTGAC
TTCAGTCAAT ATAAATCCCT GCCAATTGTA GAAAAAGAGG ATATTGAAAA AGAGAAAATC
CGGCTTCAGA GAGTAGAGGA GAAACTTAAT CAGTCTATGA AACGGGTGGA AAGCCTTAAA
AATAAGCTGG ATAAAGCCAC TAACCTGTCA TTAGTAGAAC TGGCAGAGCT TGAAGCTGAA
TATCATCAGG CCCGGGGAGA TGTTACCAGC TACAGGCGAG CCGCTTTAGA GGCCAGGCTT
TCGGTGGTCC TGACAAGGAA AAAATACCTG ACCCTGTACA GGAAGGATGA AGAAGGATAT
AAAGAGGCTA TGGATAAAAT TGATGAGGAA CTAAGAGAAA TAGGATATGC CCTAAATCAG
GCCAGAGTAG ATAAAAGGGT TTATGAATAT ATTGTTGAAT ATTATCAGCA TAGATTAAAA
TTAGCAAAGG AAAGTTGTTT AACTGAATCA GGCCATTTGT CCTGA
 
Protein sequence
MKKMPEVTNA HIISHTHWDR EWFLNSKYTN EWLVPFFDSL FDMLEKEEDY RFILDGQTLM 
VEDYLAELEK QGKNSSKYKE KLKKYVKKGQ IIIGPYYLQP DWQLLSEESL VRNLLIGHKV
GNELGGVMKT GWLLDNFGQI SQAPQIHKLF GLKGLFLWRG VEMPPTDIDS EFMWKSPDGT
EMVSVYFLSS YRNAMRLAEY KEMFKERIEN EVNKISPFAT TSNVLLMNGY DQEMIPDNIL
PELENLSLEG IRAKQSTPPE YIDAISKENP DLKVLEGALY SGRYISVFPG ILSARMYLKL
KNDRCQRELT NYAEPLSALS WLMGGNYEDD TFLEAWKTLL KNHPHDSICG VSIDDVHTDM
EERLARAYSI SRRLTERGVE ELTLNIKTEI KDDEKQPFVV FNTFPSGRGG VVSLDSTGDN
CIVTDSTGKA LPTQKGTGGN HYVYLDDIPA TGYKTIYLDE GSDSFNAANY TGDYDRLVVK
DNWIENKYLA IEIESDGSLN VFDKVNQEHY SGLGVFVDDA DAGDTYNYSY PENDLKITTR
NQKADIEIVE AGPLVATIKI SQVMKLPASL SEDRKTRSEE LLDLPVVNWV KVKANSPRIE
FKTEVKNTVK DHRLRVLFPT GIDSKYSYAG TQFDITRHEI YPGVSDDSEI PDNVKRIIIG
ARESEPITTF PQCYFVDIND GDKGVAVLNK GLPEYEILPE DNIIALTLFR SVGWLARGDL
LTRVGDAGPV IYTPDAQCLR EMTFEYSLYF HKGDCLQGEV YHQARDYNNE LLVIKTEQHE
GVLPPENYFI KLHSPDNALQ VTAIKRSEDG EGLILRLFNT GDKEVSGTLT TSLDCSKAYY
ANLNEEVEEK ISIDRSNELS INVKPREIKT IKMVLESSNI LDTGRAGETE IIENRSAQTD
FSQYKSLPIV EKEDIEKEKI RLQRVEEKLN QSMKRVESLK NKLDKATNLS LVELAELEAE
YHQARGDVTS YRRAALEARL SVVLTRKKYL TLYRKDEEGY KEAMDKIDEE LREIGYALNQ
ARVDKRVYEY IVEYYQHRLK LAKESCLTES GHLS