Gene Hore_20490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_20490 
Symbol 
ID7314373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2211551 
End bp2213803 
Gene Length2253 bp 
Protein Length750 aa 
Translation table11 
GC content42% 
IMG OID643612493 
ProductBeta-galactosidase 
Protein accessionYP_002509789 
Protein GI220932881 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000040433 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGGA AAATATTAAA CTTTAATACG GACTGGTTTT TTCTTGATAA GGATATAGAA 
GGAGCAAAAG GAATAGATTT TAGTCAATCA GGGATGGAGA AAGTAAATCT ACCTCATCCC
AACAGGATTT TACCACACCA TTATTTTGAA GAATCTGATT ATCAGTTTGT TTCCTGGTAC
AGGCGCCCTT TTTACCTGGA AGAAGAGTAT AAAGGGAAAA GGGTGATAGT AGAGTTTGAT
GGAGTAATGA CGGTTGCTGA AATATATGTT AATGGGCAGT TTGTGGGTGA GCATAAAGGG
GGCTATACTT CTTTTAGTTT TGATATAACA GATTATCTGC TATCTGGAGA AAATAATCTG
CTGGCAGTCA GGGTGGATTC CAGCCAGAGA AAGGATATTC CCCCGGAAGG CAACCTGGTC
GATTACCTTT TGTTTGGAGG TATATACCGG GATGTTAAAA TGGTGATAGT AGACCCTGTT
TATATTAACT GGTCTTTTAT TGAACTTAAA GATGTAAACC TGGAAGCAGG TGTTATAAAA
CCCAGGTTTG AGCTCGTAAA CACAACAGGT AACCGGCAAA AAATAGTCTT AAATAGTCAG
GTTATTAATA AAGAAGGTAA GGTTGTGGCA ATGGTAGAAT CCAGACACCT GCTTGAGCCC
GGTGTAACCT CTCTGGAGCA GCCTGAGGTC AAGATAAAAG AACCTGAGTT ATGGCATCCT
GACCATCCCT ATCTCTATCA TGTTTATACT GAAGTTAAAG TTGAAGGAAA GCTGGTTGAT
GATTATAAAA CCAGAATCGG ACTCAGGAAA GTGGAATTTA AAGAGGATGG AAAGTTTTAT
ATCAACGATA AGCCCCTTAA ACTCAGGGGA CTTAACAGGC ATCAAATGTT TCCGTATCTA
GGTAATGCTA TGCCTGACCG GGGCCAGAGG AAGGATGCTG AGATTTTGAA GTATGAACTG
GGGTTAAATT TTGTCCGTTC TTCCCATTAC CCGGCTGATT CTTCGTTTTT AGATAAGTGT
GATGAAATAG GTTTATTAGT CCTGGAAGAG ATCCCCGGAT GGCAGCATAT CGGGAACAGG
GACTGGCAGG AGTTATCTAA AAGAAATGTT GAAGAGATGA TAGTCAGAGA CCGGAACCAT
CCCTGTATTT TCCTCTGGGG TGTTAGAATT AATGAATCTC CGGATAACCA TGATTTTTAC
CTTGAGACAA ATGAAATTGC CCACAGACTG GACAGTACCA GGCCGACCTG TGGGATAAGG
AATTTTCAGG ATAGTGAGTT TCTGGAAGAT GTATTTACTT ATAATGATTT TGAGTTAAAT
CTCGAAGGAA AAATTAAATT ACCTAACCAC CAACCATATA TGATAACCGA ATATATGGGT
CATATGTATC CAACCAAGGC CTATGATAGT GTCGAAAGGT TAATTAAACA CGCTGTCCGG
CACGCCCATA TACAGGATAA GCAGTATGGG GTACCTTATC TGGCAGGGGC CTCAGGGTGG
TGTGCCTTTG ATTATAATAC CCATGCTGAT TTTGGATCAG GTGACAGGGT ATGCTATCAC
GGAGTCTGTG ATATGTTCAG GTTACCCAAA TTTGCTGCTT ATTTTTATAA AAGCCAGATA
GACCCGGATG TGGAAAAGGT TGTATTTATT GCTCGATACC TGACCCCATC TTTTAATGAG
GATTATGGCG ATGAGGTTAT TGTTTTTAGT AACTGTGAAG AGGTTGAACT ATATGTTGGT
GATAAATTAA TAACATCAGC TAGACCAAAC CGGGTTGATT ACCCCAGTTT ACCCCACCCG
CCCTTTACCT TTAAAGACTG TACCTGGTGG GAGTGGGGGG CCAGCACCAT TTCCTGCCTG
AAAGCGGTCG GTAAAATAGA TGGGAAACAG GTTGCCGAGC ACACTATTTA TCCCTTTGGC
AGGCCGGAGA GGTTAGTATT AAAGCCGGAT TACACTAAAC TTACGGCAGA TGGTGCTGAT
TGTACCCGGG TTGTGGTTGA GCTTCAGGAT GAGCACGGAC AGGTCCTCCA TCTGGCCCAT
CATCCGGTTT TCTTTGAACT GGAAGGGGTG GGGGAACTAA TTGGAGAAAA CCCCTTTAGC
CTGGAAGTAG GGAGAGGTGC TGTCTTTATA AGGGCCGGGA GAACTCCAGG GAAAATACAG
CTGACAGGTA AGGTCCAGGG ATTACCACCG GTCACAATAG TTGTATCTAC TGAACCTCTG
GAAGATAAGA TAGTACCATT ACCCAGGAAA TAA
 
Protein sequence
MKRKILNFNT DWFFLDKDIE GAKGIDFSQS GMEKVNLPHP NRILPHHYFE ESDYQFVSWY 
RRPFYLEEEY KGKRVIVEFD GVMTVAEIYV NGQFVGEHKG GYTSFSFDIT DYLLSGENNL
LAVRVDSSQR KDIPPEGNLV DYLLFGGIYR DVKMVIVDPV YINWSFIELK DVNLEAGVIK
PRFELVNTTG NRQKIVLNSQ VINKEGKVVA MVESRHLLEP GVTSLEQPEV KIKEPELWHP
DHPYLYHVYT EVKVEGKLVD DYKTRIGLRK VEFKEDGKFY INDKPLKLRG LNRHQMFPYL
GNAMPDRGQR KDAEILKYEL GLNFVRSSHY PADSSFLDKC DEIGLLVLEE IPGWQHIGNR
DWQELSKRNV EEMIVRDRNH PCIFLWGVRI NESPDNHDFY LETNEIAHRL DSTRPTCGIR
NFQDSEFLED VFTYNDFELN LEGKIKLPNH QPYMITEYMG HMYPTKAYDS VERLIKHAVR
HAHIQDKQYG VPYLAGASGW CAFDYNTHAD FGSGDRVCYH GVCDMFRLPK FAAYFYKSQI
DPDVEKVVFI ARYLTPSFNE DYGDEVIVFS NCEEVELYVG DKLITSARPN RVDYPSLPHP
PFTFKDCTWW EWGASTISCL KAVGKIDGKQ VAEHTIYPFG RPERLVLKPD YTKLTADGAD
CTRVVVELQD EHGQVLHLAH HPVFFELEGV GELIGENPFS LEVGRGAVFI RAGRTPGKIQ
LTGKVQGLPP VTIVVSTEPL EDKIVPLPRK