Gene Hore_10150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_10150 
Symbol 
ID7314603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1105164 
End bp1106384 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content37% 
IMG OID643611454 
ProductSporulation integral membrane protein YlbJ 
Protein accessionYP_002508766 
Protein GI220931858 
COG category[S] Function unknown 
COG ID[COG3314] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR02871] sporulation integral membrane protein YlbJ 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.206415 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATTCTT ATAACCATGA TAAAAAAATA GTTATAACAG CTATAATTGC AGTTATAACT 
ACTATCCTAA TTATTATTTT TTCAGAAAAT GCCTTTAATG CAGCTTTGGA AGGGTTAGAA
GTCTGGTGGG AAGTCGTATT TCCTTCCCTG TTACCCTTTT TTATAATTGC CGAAATATTA
ATGGGACTGG GTGTGGTCCA CTTTATGGGA GCACTGATGG AACCACTTAT GAGACCATTA
TTTAAAGTAC CAGGGGTAGG GGCCTTTGCC ATGGCAATGG GGCTGGCTTC CGGTTACCCC
ATCGGTGCTA AAATAACTGC AGCCTTAAGA CGTAAAAAAT TATGCACCAA AACTGAAGCA
GAAAGGTTAG TTTCCTTTAC CAACACCGCT GACCCTCTAT TTATGATTGG AGCTGTAGCA
GTAGGTATGT TTCACCGGGC AGACCTGGGA ATAATTATTG CCGGAGCCCA TTATATATCC
AGTCTGATTA TTGGTTTTAT AATGAGATTT TATCGGGGAA GAGAACAAAG AAAAAATAAA
ACAGAAAAAA AACGAAAGAA AAACATATTT ATTTATGCCC TTGAAGAACT AATTGAAGCC
CGTAAAAATG ACGGAAGACC CCTGGGAGAG CTGGTTGGTG ATGCCGTAAA GGAATCAGTA
AACACCCTCC TTTTAATCGG TGGATTTATA ATCCTTTTTT CAGTTATAAC AGAGATTATA
ATTGTAACCG GCTTAATAAC AGTCTTATCA AACATTATAT CCTTTATTCT CCATCCCCTG
GGTCTATCTT CAGAAATGGT CTTACCGATG ATAAGTGGCT TTTTTGAGAT AACCAATGGA
AGTAATCTGG CCAGTCTGAC ACAAAGTCCC ATGTTACACA AAATGATTGT AGTAAATGCC
ATAATTGCCT GGAGTGGCCT ATCAGTCCAC GCGCAGGTAG CAACTATGGT CCATGGTACT
GATATTAATC TAAAACCATA TTTTTTGGCG AGAATTTTAC AGAGTGTTAT CGCCGGTGCT
GTTACTATAT TCCTCTTTAA ACCCTTTATT ACAGAAACAG AACCAACCAT GTTAACTGTA
GTTAACAACC TTGTGCCACA AAATGGTGTT ATTATTAGCT TTGGTTTTAT AGTTTTAATT
TTAATGATAA GTTTTATGTT ATCAATGATT TTGTATTTAC TACAGCGTAT AGAGGTCATA
TTCTTCCACT ACCGTGAATA A
 
Protein sequence
MHSYNHDKKI VITAIIAVIT TILIIIFSEN AFNAALEGLE VWWEVVFPSL LPFFIIAEIL 
MGLGVVHFMG ALMEPLMRPL FKVPGVGAFA MAMGLASGYP IGAKITAALR RKKLCTKTEA
ERLVSFTNTA DPLFMIGAVA VGMFHRADLG IIIAGAHYIS SLIIGFIMRF YRGREQRKNK
TEKKRKKNIF IYALEELIEA RKNDGRPLGE LVGDAVKESV NTLLLIGGFI ILFSVITEII
IVTGLITVLS NIISFILHPL GLSSEMVLPM ISGFFEITNG SNLASLTQSP MLHKMIVVNA
IIAWSGLSVH AQVATMVHGT DINLKPYFLA RILQSVIAGA VTIFLFKPFI TETEPTMLTV
VNNLVPQNGV IISFGFIVLI LMISFMLSMI LYLLQRIEVI FFHYRE