Gene Hore_20690 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_20690 
Symbol 
ID7314393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2238347 
End bp2239498 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content40% 
IMG OID643612513 
ProductROK family protein 
Protein accessionYP_002509809 
Protein GI220932901 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TAAACCAGCA GACAATTTTA GAATTAATAA ATAATAAAGG GCCTATTTCC 
AGGGCTGAAA TAGCTGAGAT TACTGGATTA ACGCCGGCTA CTGTCTCCAA CATAGTAAAG
GATCTCCTTA AGATGGATCT GGTCAGAGAA ACCCGGCAGG GAGAATCCCG GGGAGGAAGA
AAACCCATCT TACTGGAGGT AAATCCAGAA GGAGCCTACG TAATCGGCCT TGAATGGGGA
ATAGGGGAAA TAAAGGCTGT TCTACTTAAT TTAAATAAGA AGGTAATTAA AACTATAAAA
AAACAGGTAG ATAGTTTTAA ACCTGAGTGG TTTTTAAAGA CTACAGTAAC AATATTTGAA
GAGGTTACTG GTTATGTAGA AAATCCAGAT AAGGTATTTG GTCTCGGGAT AGGGATTCAT
GGTTTAGTTG ATCCAGATGA AGGTGTTTCC CTGTATGCCC CCCATTTTGG CTGGGAGAAT
ATTAAAATAG GTAAATTATT AAAACAGGAA TTACAGATTC CTATTATGCT GGATAATGAT
GTCAGGATGA TGGCCCTGGC TGAAAAATGG GAAGGCAGGG ATAATTTTAT ATTTATTAAC
ACCGGGCCAG GGATAGGTTC AGCTATAGTT ATTAAAGGAG AACTCCTCTA TGGTAGAGAT
TTCGGAGCCG GGGAATTCGG CCATATGACT ATTGTTGAAG ATGGGGCCCT CTGTAGTTGT
GGTAATCGCG GTTGTATTGA AGCCCTGGTT TCTGTTAATA ACCTTGTCAG GGAATATAAT
GATTCACTAC CGGAACATAT ATCATTCCAT GATATAAAGC GGGAGTGGAA TCTTTTAATA
GATTTAGCCC GTGAAGAAAA ATCCAGGGCC TATTCTATAA TTGAAAAGGC GGGCGTGTAT
CTAGGTAAGG GAATAGGAAA TGTGGTTAAT CTTTTAAACC CGGAAGCGGT AGTAATCGGA
GGAGACTTTT TACTGGCCAG GGATTTGATT TTTCCGGTTA TTAAAGAACA GGTATTAGAG
ACTGCCCTTA AGGTTCCGTC AAGGGACCTT GAAATAACAG GGACTGCTTT TGGTGAGAAG
GTTGGTGCTA TCGGGGCCGG TACCAGAGTC CTGCAGGAAA TTTTTAAATT AAAAAAGGAG
GAAGATAAAT GA
 
Protein sequence
MKKINQQTIL ELINNKGPIS RAEIAEITGL TPATVSNIVK DLLKMDLVRE TRQGESRGGR 
KPILLEVNPE GAYVIGLEWG IGEIKAVLLN LNKKVIKTIK KQVDSFKPEW FLKTTVTIFE
EVTGYVENPD KVFGLGIGIH GLVDPDEGVS LYAPHFGWEN IKIGKLLKQE LQIPIMLDND
VRMMALAEKW EGRDNFIFIN TGPGIGSAIV IKGELLYGRD FGAGEFGHMT IVEDGALCSC
GNRGCIEALV SVNNLVREYN DSLPEHISFH DIKREWNLLI DLAREEKSRA YSIIEKAGVY
LGKGIGNVVN LLNPEAVVIG GDFLLARDLI FPVIKEQVLE TALKVPSRDL EITGTAFGEK
VGAIGAGTRV LQEIFKLKKE EDK