Gene Hore_19540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_19540 
Symbol 
ID7312769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2095419 
End bp2096594 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content40% 
IMG OID643612400 
ProductROK family protein 
Protein accessionYP_002509696 
Protein GI220932788 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones60 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGTTA AAGTGGGAAA CTCCACCTTT ATAAAAAATT TAAACCAGAA AAGGATTTTT 
AAGCTGATTC ATAAAGAGGG ACCAATAACC CGGAAAGAGT TAGCCGATAA TACTGATTAC
AGTGCCGGTA CTATTTCCAA CCATGTTAAG GCCCTGATTG ATCAGGGGTT TGTTATTGAA
ACTGAAAAGG GATATTCCAG TGGTGGAAGA AAACCGGTCT ATCTTACCGT TAACCCCGAT
AAGGGATATA TAATAAGTGT TGAAGTAGAA GTGACCCGGG TTAAAATTGT ATTATTTAAT
TTAAAAATTA AGGTTGTGGC CAAAACTGTA TTTCCAATAA ACGGGCCCGC CCAGGCGAAG
GAGACCATTT CTCAAATTTT TTCAGAAATT GAGAAAATCC TGTCAGAAAG GGAAATAAAG
CCAGATAAAA TACTCGGTAT CGGGGTGGCT GTTCCCGGCC TGATTGATAA AGAAGAAGGT
TTGCTGGAGT TTGCTCCCAA TTTAGGCTGG AGTAAAGTGC CTATTGTAAA ATATTTTGAG
GATAAATATG GGGTTCCGGT TGTTCTTGAA AATGAAGCCA ATGCAGCAGC AGTAGGTGAA
AAGGAGTTTG TTTATCCCGA TATTAAAGAT ATGGTTTATG TATCGATTAA TGAAGGTATC
GGGTGTGGGC TTATCTTTAA TGGACGGCTT TACCGGGGAG CCGGAGGTAA TGCCGGTGAG
TTTGGACATA TTATCATTGA TAGTGATGGA CCATTATGCC ACTGTGGTAA TAGTGGGTGC
TGGGAAACCC TGGCTTCTGA AAACCATATC CAGAAGGAAT ATCAGGAGTT GACCGGGGAA
GAAAATGTGG AAAAGAACGA AATTTACCGG AAAGCCATTG AAGGTGAGGA TAGGGCCTTA
AGTGTGGTTA AACAGGCTGC TCATAATATC GGCCTGGGCC TTGCCAATAT AGTTAACAGC
CTCAGCCCGA GGTTGATTGT CCTTGGAGGG GGTATAATTG AAGCCGACAC CTTAATTATA
GATACTGTAC AAAGTATTCT GAAAGAAAAA TGCCTGCTCC TTTCTTATGA TAAGGTTGAT
ATTGAGTTTA CCAGGCTTAA AGATCTGGCC TGTTTGTACG GGCTGGCCAG TTATGTTTTT
AATAAAAGTA TAGATTTTGA AACTAAAAAA AATTAG
 
Protein sequence
MNVKVGNSTF IKNLNQKRIF KLIHKEGPIT RKELADNTDY SAGTISNHVK ALIDQGFVIE 
TEKGYSSGGR KPVYLTVNPD KGYIISVEVE VTRVKIVLFN LKIKVVAKTV FPINGPAQAK
ETISQIFSEI EKILSEREIK PDKILGIGVA VPGLIDKEEG LLEFAPNLGW SKVPIVKYFE
DKYGVPVVLE NEANAAAVGE KEFVYPDIKD MVYVSINEGI GCGLIFNGRL YRGAGGNAGE
FGHIIIDSDG PLCHCGNSGC WETLASENHI QKEYQELTGE ENVEKNEIYR KAIEGEDRAL
SVVKQAAHNI GLGLANIVNS LSPRLIVLGG GIIEADTLII DTVQSILKEK CLLLSYDKVD
IEFTRLKDLA CLYGLASYVF NKSIDFETKK N