Gene Huta_1453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1453 
Symbol 
ID8383732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1424450 
End bp1425529 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content61% 
IMG OID644972516 
ProductCRISPR-associated protein, Csh2 family 
Protein accessionYP_003130362 
Protein GI257052529 
COG category[L] Replication, recombination and repair 
COG ID[COG3649] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR01595] CRISPR-associated protein, CT1132 family
[TIGR02590] CRISPR-associated protein, Csh2 family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAC CCACACAGAC CGTCGAGAAC CGTTCCGAGA TCGTGTTCCT GTACGATGCC 
GTCGACGCGA ACCCGAACGG CAATCCGCTG AGCGGATCGA ACAGGCCGCG GATCGATCCC
CAGACCCAGC AGGCGATCGT CACGGACGTT CGGCTGAAGC GGTACCTCCG CGACCAGCTG
GACGACGACG GCCACGGCGT CTACATCCGG AACGTTCAGG AAGAGGGCAC GCAGTACACA
CGCGCCGAAC TCCTCGAAGA CAGGCTGAAG GCCGTCGATC CCGACGATTA CGACCTCGAT
GACGACGAGG CAGCGGCGCA GTTCCGGGAC GACGTCTTCG GGGAATACCT CGAAGAGAGT
GCCGACGTAC GCTACTTCGG CGCGACGATG TCTGTCGATA CTGACAATGC GTACGCGAAA
CATCTCCCGG ACCACTTCAC TGGCCCAGTT CAGTTCTCGC CAGGCAAGTC GATCCACGCT
GTCAACGAAA ACGAGGAATA TGACAGTCTC ACCAGCGTGA TCGCCACCCA GGAGGGGAAA
GAACAGGGTG GGTTCGACCT CGACGACCAC CGCATCCAGT ACGGGCTCAT TCGGTTCCAC
GGACTCGTCG ACGAACACGG GGCCGCCGAC ACGAATCTCA CGCGAGCGGA CGTAGAGCGT
CTCGATACAC TCTGTTGGCG GGCGATCAAG AACCAGACCA TCAGTCGGAG CAAAGTCGGC
CAGGAGCCAC GTCTCTACTG TCGCGTCGAA TACGGCGAGG AAAGCTACCA TCTCGGCGGC
CTGGACAAGG ATCTCACCCT TGACGACGAG GCCTCGAAGG ACCACGACGA ACTCCGAAAC
ATCCGCGATC TTACGCTGGA GATCGATGAT TTCGTGGATC GGATCTCGAA CGCCAGCGAC
CAGATCGAAC GCATCCGGGT GGTCGCGAGC GACGTCCTGG AACTCTCTCA CGGGACCGAC
AGCGGTGGGC CGGACCTCCT GTACGACGCG CTTCGAACGG CAATCGGACC GGACCGAGTC
GATGTCGTCG ATGTCTACGA CGAGTATCCT GAAACGCTGC CACAGAGCAC CGGCGAGTGA
 
Protein sequence
MSEPTQTVEN RSEIVFLYDA VDANPNGNPL SGSNRPRIDP QTQQAIVTDV RLKRYLRDQL 
DDDGHGVYIR NVQEEGTQYT RAELLEDRLK AVDPDDYDLD DDEAAAQFRD DVFGEYLEES
ADVRYFGATM SVDTDNAYAK HLPDHFTGPV QFSPGKSIHA VNENEEYDSL TSVIATQEGK
EQGGFDLDDH RIQYGLIRFH GLVDEHGAAD TNLTRADVER LDTLCWRAIK NQTISRSKVG
QEPRLYCRVE YGEESYHLGG LDKDLTLDDE ASKDHDELRN IRDLTLEIDD FVDRISNASD
QIERIRVVAS DVLELSHGTD SGGPDLLYDA LRTAIGPDRV DVVDVYDEYP ETLPQSTGE