Gene Hore_15150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_15150 
Symbol 
ID7313108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1617298 
End bp1619328 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content32% 
IMG OID643611958 
ProductCRISPR-associated protein, Crm2 family 
Protein accessionYP_002509260 
Protein GI220932352 
COG category[R] General function prediction only 
COG ID[COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) 
TIGRFAM ID[TIGR02577] CRISPR-associated protein, Crm2 family 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAAGA TATTTAAAGA ACACCCTTTA TCAGGGGAAC AATTGATAAA AACTTTTGAA 
GACTACATAC CTGAGATAAG TTTGAAGAAT AAATCATTAG CTATTTTTAC AATAGGACCG
GTTCAATCTT TTATATCTGC GGCTAAGAAA ACTAAAGAAT ACTGGGCTGG AAGTTATTTG
CTCTCATATT TAATCTGGCA GGCACTAAAA CATATCATGG AAGAAAGCAA AGAAGGTGAA
AACAGTATTA TTTATCCCTA TTTAAAAAAT CAGCCTCTTT ATAAAGATTT AAGAAATGAA
GAGATTACAC TACCTACTTT ACCCAATCGC TTTCTGGCAA TTTTAGAAAA CAATGAAGCA
GAACAATTAT TAGAAGAATG CAAAAAAATG GTTGAGAATA AACTTTTGGA TATTGCCATT
AATCTTTTCA ATATCAGTAA GGAAGATAAA ATATATAAAC AGCTGGAAGA ACTACTTGAA
ATATACTGGG TATTATTACC TCTGGAAAAG AGCAAAAAAA CAGAAATAGA AAGATACAGA
AGTTTAACTG GCATGGAACC AGCAAATGAT ATTTCGCATT ATTCTCTGGT AACATCTATT
ATTGAAGAAT TAATGGGGGC TCGTAAAAAT ATTAGGGAAT TTAATTATAT AGAAGAAAAG
GGTGTTAAAT GCTATATCTG TGGAGAAAGA ACTGGATTTG AACCTGGAAT CATGAATATT
GAAGATGAGA AGAAAATCTG TGGAGTCTGT GGTTTAAAAA GGAAGTTCAG TGATTATTTA
AAAACAGAAT TTGGAATTGA AATTCATTAT CCGTCAGTAA TTGATATTGC CACAGCCGAT
TATAAGGAAG AACTGGTTAG CAGTCTGGAC AAAGATGAGT TAAATCAGTT ATTGAATTAT
ATGGATAAGG AGTTTGGTAA CCAGGATTAC CAAATGCCCG AATTGGTTAA TGGACTGGTA
GGAGATGTTA AAAGTAAGTC TCCGGAGGAA AAAAAGGTAT TAAAAATAAA AGGAACATAT
TTTGACATTG AAGGTAATGA ACTTAAGGAA AGTATAAAAG TAAAAAAGCT ACTTAAAAAG
ATTAATAAAA AATATAATCT TAAATTAAAT AAATATTATG CCTTGCTTTA TATGGATGGA
GATGACATGG GGAAATGGGT AAGTGGTGAG AAGTTAAAAC ATAGAGAAAT GACTCCAGAA
TTTCATTACC TGATTAGTAA GTCTCTTACC AATTATTCTT TAAAGTTTGT TAAAAGGATT
GTTGAAGGTA AAGGACGCAG GGGAAAATTG GTTTATTCAG GTGGTGATGA TGTAGTAGCT
TTTGTCAATT TAAAAAATCT TTTTGAAGTG CTCAGGGAGT TAAGGGGTTA TTTTTCAGGT
AATGTATTTG ATAATCAAGT TAATTATAAG GCCAGTGATG GAATTATACG AGAAAAGGGA
GAGGACATAT TAACCCTGGG TAGTAAAGCA AGTGTCAGTA TGGGTATTTG TATTGCCCAT
TATAAAGAAC CCCTTCATTT TGTTATTAAA AATGCAAGGG CAATGGAGAA AAAAGCCAAA
AAACATATTA TTTCAGGGAA TCGAAAGAAG AATGCCTTTG CCATTGGATT AATTAAACAC
TCCGGTGAAA CAAGGGAAGC GGTCTCAAAA TGGTTTTATG ATGATGGAGA AGATATAATT
GAAAAAGGGA TTAAGCCTCT GCTGAACTTA ATTCAAAAAG AGCATATTAG TCAAAGCTTT
ATATATACAT TAAAAAACGA AATGGAGTTA TTGGAAGGTC TAGAGGCTGG TGATATTCTG
GATGATGAAA TAATAAGATT AATGAAACGG AAAATTACTG TAGATGAAGA TGAGGAAAAA
GAAGATAGCA AAAGAAAGAA ACTGCAAAAA CAGACTGAAG CCAGTTTGAA AGTCTTACAG
CAGATTTATA AAAATAGATA CTGGATTGAG GATAATAAAG AGCCGAACAA TATCTACAAT
TTTATTAATT TACTGGAGAT TATATTTTTT ATAGGTAAGA GAGGTGAATA A
 
Protein sequence
MKKIFKEHPL SGEQLIKTFE DYIPEISLKN KSLAIFTIGP VQSFISAAKK TKEYWAGSYL 
LSYLIWQALK HIMEESKEGE NSIIYPYLKN QPLYKDLRNE EITLPTLPNR FLAILENNEA
EQLLEECKKM VENKLLDIAI NLFNISKEDK IYKQLEELLE IYWVLLPLEK SKKTEIERYR
SLTGMEPAND ISHYSLVTSI IEELMGARKN IREFNYIEEK GVKCYICGER TGFEPGIMNI
EDEKKICGVC GLKRKFSDYL KTEFGIEIHY PSVIDIATAD YKEELVSSLD KDELNQLLNY
MDKEFGNQDY QMPELVNGLV GDVKSKSPEE KKVLKIKGTY FDIEGNELKE SIKVKKLLKK
INKKYNLKLN KYYALLYMDG DDMGKWVSGE KLKHREMTPE FHYLISKSLT NYSLKFVKRI
VEGKGRRGKL VYSGGDDVVA FVNLKNLFEV LRELRGYFSG NVFDNQVNYK ASDGIIREKG
EDILTLGSKA SVSMGICIAH YKEPLHFVIK NARAMEKKAK KHIISGNRKK NAFAIGLIKH
SGETREAVSK WFYDDGEDII EKGIKPLLNL IQKEHISQSF IYTLKNEMEL LEGLEAGDIL
DDEIIRLMKR KITVDEDEEK EDSKRKKLQK QTEASLKVLQ QIYKNRYWIE DNKEPNNIYN
FINLLEIIFF IGKRGE