Gene Hore_15210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_15210 
Symbol 
ID7313114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1625089 
End bp1626066 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content32% 
IMG OID643611963 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_002509265 
Protein GI220932357 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTTGG TTATCAACAC ATACGGTAGT TACCTTCATG TAAAACAAAA ATCGTTTGAA 
ATAAAGACTG AAGAGGATAA AAAGAGGGTT TCTGCTAAAA AGGTTAGTTC AATATTAATT
ACCACAGGTG CTGCCATCAG TACAGATGCA GTTAAACTGG CCCTGGAAAA TAATATTGAA
ATACAGTTTC TGGATGAGTT TGGTTGTTCG TTAGGAAAAG TCTGGCACCC TAAACTTGGT
AGTACTACTT ATATTAGAAG GAAACAGCTT GAGCTGGCAG AAAGTGAAGA AGGTACTGAA
CTGGTAAAAG AATTTATGCT TGATAAAATA GATAATATGA TTAACCATTT ACATGATCTG
GCTATAAAAC GTTCTAAATC AAAAGAGAAA TATATAAATA AGAAGATAAA AGAGATCTGT
GAATTGCGTA ATAAATTAGA AAAAGTTACA GGGTATATTG AAGATGTAAG AAATACTATA
ATGGGGTATG AAGGTAATAT ATCCAGGAAG TATTTTGCCA GCTTAAGTTT TCTTTTGCCA
GATAGATATA AGTTTAATGG CAGGAGTTTC AGACCTGCTG AGGATGAATT TAATTGTTTG
CTGAATTATG GTTATGGTGT ATTATATGGC AAAGTAGAAA AAGCATTAAT TATTGCAGGG
TTAGATCCTT ATGTTGGCAT TCTACATACT GATGGTTATA ATAAAAAGTC TTTTGTCTTT
GATTTTATTG AACCCTACCG ACACCATATA GACAGAGTAG TAATGAAGTT ATTTAGTAGA
AAAAAAATCC GTAAGTTACA TTTTGATAAA ATTCAGGGAG GATTAACTCT TAACGATGAA
GGAAAAAAAT TGCTTCTTAC AGAATTAAAT GATTATTTTG ATAAAAAAAT TAGATATAAG
GGGAGAGAGA TAAAAATTAA TAATACGATT CAGTATGATT GTCACTCCCT GGCCAACAGA
ATTATAGAGG AAGGTTGA
 
Protein sequence
MQLVINTYGS YLHVKQKSFE IKTEEDKKRV SAKKVSSILI TTGAAISTDA VKLALENNIE 
IQFLDEFGCS LGKVWHPKLG STTYIRRKQL ELAESEEGTE LVKEFMLDKI DNMINHLHDL
AIKRSKSKEK YINKKIKEIC ELRNKLEKVT GYIEDVRNTI MGYEGNISRK YFASLSFLLP
DRYKFNGRSF RPAEDEFNCL LNYGYGVLYG KVEKALIIAG LDPYVGILHT DGYNKKSFVF
DFIEPYRHHI DRVVMKLFSR KKIRKLHFDK IQGGLTLNDE GKKLLLTELN DYFDKKIRYK
GREIKINNTI QYDCHSLANR IIEEG