Gene Huta_1449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1449 
Symbol 
ID8383728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1419487 
End bp1420479 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content58% 
IMG OID644972512 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_003130358 
Protein GI257052525 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGACA ACTACCACAT CTTCTCGGAC GGACGGGTCG AGCGCCACAA CGACACGGTA 
CGGCTCGTCA CCGAGGACGA CGAGAAGAAG TACCTCCCGA TCGAGAACGC CGAGGCGCTG
TACCTCCACG GGCAGATCGA CTTCAATACG CGAGTGATCT CGTTTCTCGA CGATCACGGT
GTCGCGATGC ACGTCTTCGG CTGGAACGAC TACTATTCGG GCTCGATCAT GCCTGAGCGT
GGCCAGACGT CGGGTCAAAC GGTCGTCGAA CAGGTCCGGG CCTACGACGA CGAAGCCCAT
CGTGGCAACA TCGCCCGTGA GATCGTGGCG GGAAGCATCC ACAACATGCG GGCGAACGTC
ACCTATTACG ACAACCGGGA CTACGACCTC AGCGCGACAC TCGAGTCACT CGATCGTCGA
CGTGACGAGA TCAAGTCCGT GGCGTCAGTC GAGGAAGCAA TGGGCGTCGA AGCGAGTGCG
CGACGTGCCT ACTACGCGAT CTTCGATCAG ATTCTTCCCG ACGCCTTCGT CTTCGGGGGT
CGAAAGTACA ATCCGCCAAA CAACAAGGTG AACAGCCTGA TCTCGTTCGG GAACAGCCTG
GTCTACGCGA ATATCGTCTC GGCAATCCGG GCCACGGCAC TCGATCCGAC GATCAGTTAT
CTCCACGAGC CTGGCGAGCG ACGGTACTCA CTGGCGCTGG ATCTTGCCGA TCTCTTCAAA
CCCGTCCTGA CTGATCGCGT CGTCTTCCGA CTCGTGAATC GAGGGCAGTT GTCCGACGAT
GATTTCGATT CGGAGATGAA CGCGTGTCTG CTCACCGAGA GTGGCCGGGA GACGTTCTCG
AAGGAGTTCG AGCAGACGCT CGATCGGACG ATCGAACACC CAAATCTCAA CCGGAAGGTC
AGTTATCAGT ATCTCCTTCG GGTCGAGGCG TACAAACTCA AGAAGCACTT GTTGACTGGC
GAGTCCTACG AGTCCTTCGA GCGGTGGTGG TAA
 
Protein sequence
MNDNYHIFSD GRVERHNDTV RLVTEDDEKK YLPIENAEAL YLHGQIDFNT RVISFLDDHG 
VAMHVFGWND YYSGSIMPER GQTSGQTVVE QVRAYDDEAH RGNIAREIVA GSIHNMRANV
TYYDNRDYDL SATLESLDRR RDEIKSVASV EEAMGVEASA RRAYYAIFDQ ILPDAFVFGG
RKYNPPNNKV NSLISFGNSL VYANIVSAIR ATALDPTISY LHEPGERRYS LALDLADLFK
PVLTDRVVFR LVNRGQLSDD DFDSEMNACL LTESGRETFS KEFEQTLDRT IEHPNLNRKV
SYQYLLRVEA YKLKKHLLTG ESYESFERWW