Gene Hore_17430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_17430 
Symbol 
ID7313665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1862024 
End bp1863724 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content35% 
IMG OID643612190 
ProductOrganic solvent tolerance protein OstA 
Protein accessionYP_002509487 
Protein GI220932579 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1452] Organic solvent tolerance protein OstA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTAAAA ATACAATGAT AATAAAGGAA TTTAGGCGGC TTTTTATTTT ATTTATAATT 
CTTATAATTA TAATAACAGG ATTTAGCTCT TATATTGTTC AGGCCAGCAG TGAAGAGGTA
GCCCCCTATG ACCTTAAAGC TGCCGGGGAT ATCCAGTATG ATTTAAAAAA TGGGCTTATT
TATGCCACAA AAGATGTTAG TTTTAAGATT AATAATTTAA ATATAGAATG CCAGAAACTA
AAGGTTAATC TGGCTGCAGA GGAAATAATA GCAGAGGGTG GTATCAAATT AATACTTAAA
GATGAGACCC TTATCGGAAA GAGTCTGGTT TACAATTATG GAACAGGTTC AGGAACTATG
GTTCAGGCAA AAACGAAAAT AGATAGCCTT AACTTCCGGG GAGGGGTAAT CCGGCTGGTT
GAAGGAGAAG AATACCGGGC TGAAATAGAG GACGCCAGCT TTACACCATG TATTCTGGAT
GAACCTCATT ACCAGATTAA AGCGAAGCGG GTTAAAATCT ATCCTGATGA GAAGGTAGTT
GGTGAAGATG TCGGGTTCTG GATAGGAAAA GTAAAGATAA TTTCTCTACC GGGTTATGTA
GTAGAATATA AAGAAGATGA AAAAACAGGG AAGAAGGAAC TATATAATAC CAGTCCTGTT
CCTTCTCTAG GTTATAATAC AGAGGATGGA CTTATTTTAA GTATATATTA TCCCTATCAA
TATGGAAATA ATCTCGAGGG TAAATTAAAT GCTTCAACAA CCCAGGTCGG AGGGCAGAAA
GCTATTCTGG AGAATAGATA TAAGATAACC CCGGACCTGG TAGCCACCAC CAGATATGTT
TACAGTAAGG AATTCGAAGA TGATGAGATC AATAAAGATA GTTTATTAAA GGGTGGCTTA
AGTTTTAAAA GGAATAGATT GAAAGTTACA GGACTACTGG GTTATGATTT TATAGATGAG
AAGCGCCAGG AAGAACTGGA TATAAAATAC CGTTATAGTA ATAGAGCTAA TATAACCTTA
TACCATGGTT TTACCAATGA AATGTTAGAA CAACAACTTT ATAAGTTAGA TGGACATATT
GGCCGATATC AATGGGAACT TAAGTATAAT AAAGGTTATG ATATTGATTC TTTCCCCTTT
ATCAGGTTAT ACTCACCCCG TTATAATTTA AATCTATTTA ATTTGAAGTT CATTACAGGT
GGTGGCCGGG TCACCAATAA AGGGATAACT ACAGATAAAG GCCTTGTCAA GCTGCTAATA
GACAAAAAAC TAAGGTTAGC GACAGGACTT GACCTTACTT TTCAGGAAAA GGTAACAACT
AATTTATATT ATAAAGATAG CAGTTTTTCT GATTATAAAG TTTATGATTC CAATCTTGGT
ATTGACTATG GAATTAACCT CGGGACCTCT TTAGATATAA ACTCAAATTT AAAATTTCAA
ATGGTTAATA CTGAAGGGGA CCCTTTCTTG CCTGATGATA CAGCAGAAGA AGTTAAATTA
TTAAAACCGG GCCTTAGTTT TAAGTATAAA CTACCGGAAC CCGGATCTAT GTGGATATTA
AAATTAAATG GTTCATATAA CCTCTACAGC AATTTTTGGG AAAGTGGAAC AGTGTTAGTT
CAGAGAAATT ATGATTGTTT CAATTATTCC CTGGAAGTTG ATTTAGTCAA TGAAAGCCTC
GGGGTCAATA TTAATTTTTA A
 
Protein sequence
MVKNTMIIKE FRRLFILFII LIIIITGFSS YIVQASSEEV APYDLKAAGD IQYDLKNGLI 
YATKDVSFKI NNLNIECQKL KVNLAAEEII AEGGIKLILK DETLIGKSLV YNYGTGSGTM
VQAKTKIDSL NFRGGVIRLV EGEEYRAEIE DASFTPCILD EPHYQIKAKR VKIYPDEKVV
GEDVGFWIGK VKIISLPGYV VEYKEDEKTG KKELYNTSPV PSLGYNTEDG LILSIYYPYQ
YGNNLEGKLN ASTTQVGGQK AILENRYKIT PDLVATTRYV YSKEFEDDEI NKDSLLKGGL
SFKRNRLKVT GLLGYDFIDE KRQEELDIKY RYSNRANITL YHGFTNEMLE QQLYKLDGHI
GRYQWELKYN KGYDIDSFPF IRLYSPRYNL NLFNLKFITG GGRVTNKGIT TDKGLVKLLI
DKKLRLATGL DLTFQEKVTT NLYYKDSSFS DYKVYDSNLG IDYGINLGTS LDINSNLKFQ
MVNTEGDPFL PDDTAEEVKL LKPGLSFKYK LPEPGSMWIL KLNGSYNLYS NFWESGTVLV
QRNYDCFNYS LEVDLVNESL GVNINF