Gene Hore_00510 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_00510 
Symbol 
ID7314268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp56965 
End bp59133 
Gene Length2169 bp 
Protein Length722 aa 
Translation table11 
GC content41% 
IMG OID643610468 
ProductRNA binding S1 domain protein 
Protein accessionYP_002507807 
Protein GI220930899 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000000209068 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCAAA GAATAATTGG ACAAATCTCT AAAGAATTAA AACTGAAAAC TAACCAGGTA 
AAAGGAACGG TTAAACTTCT TGATGAAGGT AATACCGTTC CTTTTATTGC GCGTTACCGT
AAAGAAGTGA CTGGAGGACT TGATGAAGCA CAGATAAGGA CTATTGAAGA AAGACTGGAA
TACCTCCGCA GTCTTCAAAA GCGGAAGGAA GAAGTTATAA GGCTGATTGA AGAGCAGGGA
AAGTTAACTC CGGAACTTGA AGAAAAGATT AAAAAAGCTT CCATTTTACA GGAAGTAGAA
GATCTCTACA GGCCTTATAA GCAGAAGCGG CGGACCCGGG CTACCAGGGC CAAAGAAAAA
GGCCTTGAAC CCCTGGCTAA GTTAATGTGG ACACAAGAAC TTACTTCTGG TAATCCTGAA
GATATAGGTA AGGAATATAT CAACCCCGAA GTTGAACTGG AGAGCATTGA AGATGTTTAT
CAGGGAGCCA GGGATATTAT AGCAGAATGG GTTTCAGATG ATGCCGGAAT TAGAAAAGAA
ATCAGGAAAA TAACCTTTAA GCAGGGAGTT ATTCAGAGCA CCTGTAAAGA TAGTGAGACC
GATGATGAAG GCAAATATGA GATGTATTAT GATTACAGGG AACCTGTCAG TAAAATACCA
CCCCACCGGG TTCTGGCTAT TAACCGGGGG GAGAAAGATG AAGTGCTCCA GGTTAAGGTT
TTAGCTCCTG AAGAAGATAT TATAGAATTA ATCAAGGATA GGGTGGTTAA CAATCCTGAA
AGTATATTTT ACAATGATAT AATTGAAGCT ATTAAAGATG GATATAAAAG GTTAATTGCT
CCTTCCATTG AAAGGGAGGT TAGAAATAGT CTTACTGAAA AAGCAGAAGA GCATGCCATA
AATATTTTTT CTAAAAACCT TCGCAATCTG CTTTTGCAGC CACCACTCAG AGGTCATACT
GTTATGGGAA TTGACCCTGC CTATAGAACG GGTTGTAAAG TCTGTGTTGT GGACCCGACC
GGGAGGTTAC TGGATACAGC AACTATTTAC CCCCATCCGC CCCAGAGCCG GACAGGTGAA
GCTAAAAAGG TTGTTAAAGG TTTGATAAAT GAATACCAGG TTACTACGAT TGCTATCGGG
AATGGGACAG CATCCCGGGA AACCGAGTTT ATGGTTGCTG ATATAATTAA GGAACTTAAA
AACACTCAGG TTAACTATGT AATAGTAAAT GAAGCCGGGG CTTCAGTTTA TTCTGCATCC
AAACTGGCCA GAAAAGAGTT TCCTGAACTC GATGTAGCCA TGAGAGGAGC CATTTCCATT
GCGAGGCGGT TACAGGACCC CCTGGCTGAG CTTGTTAAAA TAGATCCCAA ATCCATTGGG
GTTGGTCTTT ATCAGCATGA TGTTAATCAA AAAAACCTTG AAAAATCCCT CGGTAATGTA
GTGGAATCGG CCGTTAATTA TGTTGGAGTT GATTTAAATA CAGCTTCGCC ATCCCTTTTA
AAATATGTGG CCGGTATTAA TAGCCGGGTG GCGTCAAATA TTGTTAAATA CCGTGAGGAA
AATGGTAAAT TTGAAACCAG GGATGAATTA TTAAAGGTGA AGGGTCTGGG TAAAAAAACA
TTTACCCAGG CAGCTGGTTT TTTAAGAATA CCGGATGGAA CAAATCCCCT GGATAATACC
CCAATCCATC CTGAATCCTA TCAGGCCGCT AAAGGTCTAT TACAGGATGT CGGGTTTAAA
CTGTTAGATA TTACTGATAA GGAAAAGCTT AAGGAAGTGC GTGAAGAGCT GGACTCCATC
AATATAAAAT CCAGGGCTGA AAAACTGGAG ACAGGAATAC CAACTTTAAA AGATATTGTA
GATGCTTTAA AAAAACCGGG ACGCGACCCG CGTGATGAAT TACCTAAACC TATCTTCAGG
TCTGATGTAT TGAAAATGGA AGATTTAGAG GCTGGCATGC TCCTTCAGGG TACGGTCCGG
AATGTAGTGG ATTTTGGTGC TTTTGTTGAT ATTGGGGTCA AGGTGGACGG GCTTGTTCAT
ATTTCTGAAA TGAGTCATGA TTATGTAGAT GATCCCCTCA AGGTGGTACA GGTAGGGGAT
ACTGTAAAGG TTAAAATATT AGAGGTAGAT GAGAGGCGAA ACAGGATTTC CCTGAGTATG
AAGTTGTAG
 
Protein sequence
MNQRIIGQIS KELKLKTNQV KGTVKLLDEG NTVPFIARYR KEVTGGLDEA QIRTIEERLE 
YLRSLQKRKE EVIRLIEEQG KLTPELEEKI KKASILQEVE DLYRPYKQKR RTRATRAKEK
GLEPLAKLMW TQELTSGNPE DIGKEYINPE VELESIEDVY QGARDIIAEW VSDDAGIRKE
IRKITFKQGV IQSTCKDSET DDEGKYEMYY DYREPVSKIP PHRVLAINRG EKDEVLQVKV
LAPEEDIIEL IKDRVVNNPE SIFYNDIIEA IKDGYKRLIA PSIEREVRNS LTEKAEEHAI
NIFSKNLRNL LLQPPLRGHT VMGIDPAYRT GCKVCVVDPT GRLLDTATIY PHPPQSRTGE
AKKVVKGLIN EYQVTTIAIG NGTASRETEF MVADIIKELK NTQVNYVIVN EAGASVYSAS
KLARKEFPEL DVAMRGAISI ARRLQDPLAE LVKIDPKSIG VGLYQHDVNQ KNLEKSLGNV
VESAVNYVGV DLNTASPSLL KYVAGINSRV ASNIVKYREE NGKFETRDEL LKVKGLGKKT
FTQAAGFLRI PDGTNPLDNT PIHPESYQAA KGLLQDVGFK LLDITDKEKL KEVREELDSI
NIKSRAEKLE TGIPTLKDIV DALKKPGRDP RDELPKPIFR SDVLKMEDLE AGMLLQGTVR
NVVDFGAFVD IGVKVDGLVH ISEMSHDYVD DPLKVVQVGD TVKVKILEVD ERRNRISLSM
KL