Gene Hore_15140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_15140 
Symbol 
ID7313107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1616115 
End bp1617296 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content33% 
IMG OID643611957 
ProductCRISPR-associated protein, Cmr3 family 
Protein accessionYP_002509259 
Protein GI220932351 
COG category[L] Replication, recombination and repair 
COG ID[COG1769] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) 
TIGRFAM ID[TIGR01888] CRISPR-associated protein, Cmr3 family 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCAGGTAA CTGTTAGTCC GCTGGATGTA TTATTTTTTA GGAATGGGAA ACCTTTTGAT 
GCTGATGACA GTCCCATAGG TGAAACTATA GATATGCCCT ATCCCTCTAC TTTTTACGGG
GCTTTCCGGA GTAGAGTGTT GCTGGATAAT AGCGAAAGAT ACTTTGAATT TCTAGAGGGA
AAAGCAGGGG AAATAACTGA AGTAATTGGA AGTCCTGATT TTAAAGGTTC CCTTAAAATT
AACTTTTTTT CTTTGATTAA GGAAGATAAG GTCTTTAAGG ATATTTTATT GCCATTACCG
CAGGATATGG TTGTGAAAAA AGGGGATAAA AGTTCTGGAC TGTTACATTT AAGATTTGTT
TCAAAAAAAA GCTGGATTAA AATGAATAAC TCTCTTTCTC ACCTTTTAAT TAATCCTGTA
AGCAAACAGG TGGAGTGGCC CGGACCAGCT TATATTAAAA TAAAAGATCT TGAATATTAT
TTGAACAATG AACTTGAGGA TGCAGAAGTA AAAGTCTTCG ATAGAATGAA TGATATTTTC
GATAAAGAGT ATCGAACAGG GATAGAAATA GATAATGTAA CAAAATTAGC CAAGGAAAAA
AAGCTGTACC GAAGAGAAGT ATTGAGATTT AAAAACAACA GAGATAAGTC TTATAGTTTT
TTTCTTGAAT TAACTGGAGA TAAAGGACTG CTATCCGAAA GTGGTTTATT AAAATTAGGG
GGTGAGCAAA AGGCAGCAGA GTATAGGAAA GTTAAAGATG TCAGTAACAA AATAGAACTA
TATGCTTCAA CTAAAAAAAG AATTTTAAAA AGCAAAAAGT TCAAAATTTA TCTTTCTACT
CCGACTGTCT TTAAAAGAGG ATGGTTACCT GAATGGATTA ATCCTGATGA CTTTACAGGC
AAATTACCTG CAAGTGGTAT CAGGGTAAAG TTATTAACTG CAGCTGTAGG AAAACATAAA
ATAGTGAGTG GCTGGGATAT GGCTAAAAAG ACTGACAAAA ATAAAAGGGG TAAAGCAAAA
ACAGGTTTTA GGGTAGTACC TGAAGGTAGC CTTTATTATT TCCAGATACT GGATAAAAAA
TTTGATATTG AGGAATTAAT AAATGAATTA CATGGACAAT CTATTTCTGA TTTAAAAAGC
AAAGAAGGTT TTGGTATCTC ATTTATAGGA GGGATTAAGT AG
 
Protein sequence
MQVTVSPLDV LFFRNGKPFD ADDSPIGETI DMPYPSTFYG AFRSRVLLDN SERYFEFLEG 
KAGEITEVIG SPDFKGSLKI NFFSLIKEDK VFKDILLPLP QDMVVKKGDK SSGLLHLRFV
SKKSWIKMNN SLSHLLINPV SKQVEWPGPA YIKIKDLEYY LNNELEDAEV KVFDRMNDIF
DKEYRTGIEI DNVTKLAKEK KLYRREVLRF KNNRDKSYSF FLELTGDKGL LSESGLLKLG
GEQKAAEYRK VKDVSNKIEL YASTKKRILK SKKFKIYLST PTVFKRGWLP EWINPDDFTG
KLPASGIRVK LLTAAVGKHK IVSGWDMAKK TDKNKRGKAK TGFRVVPEGS LYYFQILDKK
FDIEELINEL HGQSISDLKS KEGFGISFIG GIK