Gene Hore_19090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_19090 
Symbol 
ID7312724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2040845 
End bp2043250 
Gene Length2406 bp 
Protein Length801 aa 
Translation table11 
GC content41% 
IMG OID643612356 
ProductAlpha-glucosidase 
Protein accessionYP_002509652 
Protein GI220932744 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID[TIGR01563] phage head-tail adaptor, putative, SPP1 family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00110521 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGTG AGGTAATTAT GGACAAAGGC CCGGAAAAAT CAGGTTCAAA AAGTTATTTT 
ATCTTAAATG AAATTCTGGA TTATTGCAGG AAAGATAACA GGATTTCTTT TAAGCTGAAA
AAAGGAGAAG TTGTTTTAGA GTTTTTGACT GCTGATATAG TCAGGGTGGT TATGGGGAAA
AGGGATAAAG CCAGCCTGGA GACCACCTGT GCTGTCGTGG ATCACAACCT GGCCTACTCT
GATTTTACAA TTAATGAAAC AGAAAAGGTC CTTAACATTG AGACCGATCG GTTAATAGTC
AGGGTTAACC GACATAAATT TGGCCTCGGT TTTTATGATA AGGAGGGGAA TTTAATTAAC
CGGGATTACT CCAGACGGGC CCTGGGATGG AGTGGTAATG AAGTCCGGGC CTGGAAAGAA
GTTAAGCCCG GCGAAAGGTT TTACGGCCTT GGAGAAAAAA CAGGTTTCCT CGATAAAAGG
GGTAAAAAGT ATACAATGTG GAATTCTGAT GTCTTTGAGG CCCATGTTGA GAGTACTGAT
CCCCTCTATA AATCAATTCC CTTTTTAGTT GGGTTTAATA AAGGGAAGAC CTATGGTATT
TACTTTGATA ATACCTATAA AAGTCATTTC GATCTGGCTT CAGGTAATAA AGATTATTAT
TCATTCTGGG CCGAAGGAGG TAAAATGGAT TATTATTTCA TTTACGGTCC GGACCTTAAA
GAGGTAATAT CAAAATATAC CCTGCTAACA GGAAGGATGC CGTTACCACC TAAATGGTCC
CTGGGTTATC ACCAGTCCCG TTATAGTTAC CATCCTGACA GTGAAGTTAA AAGAATTGCC
AGAACATTGA GGAAAAAGGA TATTCCCTGT GATGTAATTC ACCTCGATAT TCATTATATG
GATGGCTACC GGGTTTTTAC CTGGAATGAA GAAGAGTTTC CCTGTCCTGG GGAAATGATA
TCTGATTTAT CAGAAGAGGG TTTTAAAATT GTTAATATCA TAGACCCCGG TGTTAAAGTA
GACCCCGAAT ATGAGGTGTA CCGGGAAGGA ATGAGAGAAG ACTATTTTTG TAAGTATCTT
GATGGAAGAC CCTTTGTCGG GAAAGTCTGG CCGGGTCAAA CAGTATTTCC CGATTTTACC
TGCCAGAAAG TCAGGGAATG GTGGGGTGAT TTGCACAAAA AATATGTTGA TCAGGGAGTT
AAAGGGATAT GGAATGACAT GAATGAGCCC TCTGTATTTA ACGAAACCTC AACCATGGAC
CTTAATGTCG TCCATGAAAA TGACGGTGAT ATGGGGACTC ATAGACGTTT TCATAATGTA
TATGGTCTCC TTGAAAATAA AGCAACCTAC CAGGGCCTTA AAAAACACCT TCAGGAGCGG
CCATTTATTT TATCAAGGGC TGGATTTGCC GGCATTCAGA GGTATGCAGC GGTCTGGACC
GGGGACAACA GAAGTTTTTG GGAACATTTA AAACTGGCAG TCCCCATGTT AATGAACCTG
GGGATGTCGG GAGTGACCTT TGCCGGAACT GATGTTGGTG GCTTTACCGG TGATTCCAAT
GGTGAGCTTT TGACCAGGTG GACCCAGCTG GGAGCTTTTA TGCCTCTTTT TAGAAATCAC
TGTACTATCG GGGCCCTTGA TCAGGAACCC TGGTCCTTTG GTGAAAAATA TGAGGCTATT
ATAAGGAAAT ATATTAAACT GCGCTATCGT TTATTACCAT ATACCTACGG TTTATTTTAC
CGGGCCAGTC AGGAAGGCTT ACCGGTAATG AGGCCTCTGG TTATGGAATA CCCTTTTGAC
CCCAGAACTT ATAATATCTC CGATCAGTAT CTGTATGGGG ACAGTATTAT GATAGCCCCG
GTTTACGAAC CAGACCGTAA AGAACGGCTG GTCTATTTAC CGGAAGGTAT CTGGTTTGAT
TTCTGGACCG GAGAGAAATA TGAAGGTGGA AAAAATATTA TAGCAAAAGC CCCTCTTGAT
ACTCTGCCCG TTTATATTAA GGCAGGGAGT ATTATACCAT TGACTGAATC CGTTAATTAT
GTGGGAGAAA AGGAGAATAG TGACCTGGAA TTAAATATAT ATCTCAGTTC TGAAGTTGAA
GAAGATAGCT ACCAGTTATA TGAGGATGAT GGGTATAGCT TTGATTATCA GAATGGAAAG
TACTCTCTGG TAGAATTTAA ATACAATTAT AGCGATAATG GCCTCACCTT TAACATAAAT
CCCTTCAAGA CCGGTTATAA GCTCCCTTAT CCTGATTATA TCCTGAACTT TAAAAACCTG
ACCCGGGAAC CTTCCAGTAT TATGGTAGAT GGTTCTGAAC TCAATGACTA TGTTTATGAT
GATCAGAGAG GTGAGTTAAG ATTAAAGGTC AATAAAAAAG CCAGGAAAAT AAAGGTTAAT
CTATAA
 
Protein sequence
MNSEVIMDKG PEKSGSKSYF ILNEILDYCR KDNRISFKLK KGEVVLEFLT ADIVRVVMGK 
RDKASLETTC AVVDHNLAYS DFTINETEKV LNIETDRLIV RVNRHKFGLG FYDKEGNLIN
RDYSRRALGW SGNEVRAWKE VKPGERFYGL GEKTGFLDKR GKKYTMWNSD VFEAHVESTD
PLYKSIPFLV GFNKGKTYGI YFDNTYKSHF DLASGNKDYY SFWAEGGKMD YYFIYGPDLK
EVISKYTLLT GRMPLPPKWS LGYHQSRYSY HPDSEVKRIA RTLRKKDIPC DVIHLDIHYM
DGYRVFTWNE EEFPCPGEMI SDLSEEGFKI VNIIDPGVKV DPEYEVYREG MREDYFCKYL
DGRPFVGKVW PGQTVFPDFT CQKVREWWGD LHKKYVDQGV KGIWNDMNEP SVFNETSTMD
LNVVHENDGD MGTHRRFHNV YGLLENKATY QGLKKHLQER PFILSRAGFA GIQRYAAVWT
GDNRSFWEHL KLAVPMLMNL GMSGVTFAGT DVGGFTGDSN GELLTRWTQL GAFMPLFRNH
CTIGALDQEP WSFGEKYEAI IRKYIKLRYR LLPYTYGLFY RASQEGLPVM RPLVMEYPFD
PRTYNISDQY LYGDSIMIAP VYEPDRKERL VYLPEGIWFD FWTGEKYEGG KNIIAKAPLD
TLPVYIKAGS IIPLTESVNY VGEKENSDLE LNIYLSSEVE EDSYQLYEDD GYSFDYQNGK
YSLVEFKYNY SDNGLTFNIN PFKTGYKLPY PDYILNFKNL TREPSSIMVD GSELNDYVYD
DQRGELRLKV NKKARKIKVN L