Gene Hore_12560 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_12560 
Symbol 
ID7313577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1349170 
End bp1350432 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content36% 
IMG OID643611696 
Productputative membrane CBS domain protein 
Protein accessionYP_002509001 
Protein GI220932093 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATATATA ACGGTATTGG GTTAATCATA TTATTTATTT TATCTGGATT TTTTTCCGGT 
GCAGAAACTG CTTTGATGTC AGTTAATAGG ATTCGTATTA AAGAACTGGC CAATCAGGGC
GATAAAAGGG CAAGGCTGGT CGACAGTCTT TTAAATAATA AAACCAGATT ATTGACCACT
ATTTTAATAG GTAATAACCT GGTTAATATA TGGGCCTCGG CCATTGCTAC ATCTATAGCC
ATTTCCCTGT TTGGTAACAA AGGTGTCGGG ATCGCTACAG GTGTTGTTAC CCTGCTGGTT
CTTATCTTTG GTGAGATAAC TCCTAAAGCT ATGGGGAGTA AAAAGGCAGT CCGGTACTCA
AAATTTAGTT CAATTTATTT ATACTGGCTG GAAAGGGTTC TTTATCCAGT GGTTGTTTTT
TTTGAGTATT TAATAAAAAT ATTTGTTGAT AATGAAGATC TTCTATCATC AAAATTATTG
AGTGAAGAAG AGATTAAACG GTTTGTAAAT GTCAGTGAAG AAGAAGGGGT CATCAAAACC
GATGAACGTA GAATGATAAA TAGTATTTTT GAATTTGATG ATACAACGGT TAAGGAAATA
ATGGTTCCCA GAATAGATAT GGTCTGTATT AAAAGTGATA CTGAACTCTC TGAAGTTATA
AAAATAGCTG TAGACAGGGG TCATTCCCGT ATTCCGGTTT ATAAAAATAC TATTGATGAA
ATAATCGGTG TAGTTTATGT TAAAGATTTA CTCGGGTATT TAACCAAACC TGAGAATGAT
GCCAGACTGG CTGATTTTAT AAGGTCTCCT TATTATGTTC CTGAAAGTAA GAAAATTAAT
GAACTCTTAA CAGAAATGAA GAAAAAGAAA GTCCATATGG CCATTGTTCT TGATGAGTAT
GGGGGAACAT CGGGTCTGGT CACCATTGAA GATATCCTGG AAGAGATTGT CGGGGATATT
CAGGATGAAT ATGATACTGA GCCCAGCCAG ATAGAATTTA TCAATGATAA AGAATTATTA
ATTGATGCCC GGGTAGATAT AGATGACCTT AACGAGATCC TTCCAGAACC ACTACCAGGG
GAAGAAGATT ATGAAACTAT TAGTGGGTTT ATTTTACATT ATCTGGGGTA TGTCCCCAAA
ACGGGTGAAG AGCTTGAGCT GGATGGACTT CATATCCTGG TAGAAGAAAG CAGCAAACAT
CAGATTAAAA AAGTCAGGTT AAAAAGTTCT ACCAAACTTA ATAGAATAAA GGAAGGGGGG
TAA
 
Protein sequence
MIYNGIGLII LFILSGFFSG AETALMSVNR IRIKELANQG DKRARLVDSL LNNKTRLLTT 
ILIGNNLVNI WASAIATSIA ISLFGNKGVG IATGVVTLLV LIFGEITPKA MGSKKAVRYS
KFSSIYLYWL ERVLYPVVVF FEYLIKIFVD NEDLLSSKLL SEEEIKRFVN VSEEEGVIKT
DERRMINSIF EFDDTTVKEI MVPRIDMVCI KSDTELSEVI KIAVDRGHSR IPVYKNTIDE
IIGVVYVKDL LGYLTKPEND ARLADFIRSP YYVPESKKIN ELLTEMKKKK VHMAIVLDEY
GGTSGLVTIE DILEEIVGDI QDEYDTEPSQ IEFINDKELL IDARVDIDDL NEILPEPLPG
EEDYETISGF ILHYLGYVPK TGEELELDGL HILVEESSKH QIKKVRLKSS TKLNRIKEGG