Gene Hhal_0628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0628 
Symbol 
ID4709442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp706750 
End bp707901 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content73% 
IMG OID639855092 
ProductCRISPR-associated Cmr3 family protein 
Protein accessionYP_001002215 
Protein GI121997428 
COG category[L] Replication, recombination and repair 
COG ID[COG1769] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) 
TIGRFAM ID[TIGR01888] CRISPR-associated protein, Cmr3 family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.75883 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGAGT ACCGCTACAT CGAGCCCCAG GACGTCCTCT TCTTCCGCGG CAACCGCCTC 
TTTGGCGAGC CCGGCAACGC CGGCGCCGCC CTCATGCCGC CCTGGCCCTC GGTCTTCGCC
GGGGCCCTGC GCAGCGCCAT GCTCAGCGCC GCCGGCGCGG ACCCGGCGCA GCTGCGCAGC
GGCGAGCTGC CTGCCCCGCT CGACACGGTG CTCGGCACCC CCGAGGAGCC GGGGACCTTC
ACCCTCACCG GCGTGACGCT GGCTCGCCGG CAGTCGTCCG GCACCGCCGA GCCCCTCCAC
CCCCTGCCGG CGGACCTGAG CGTCGAGCGT GATGAGGCCA CAGGCGAGTG CGCGGTCCAT
CGCCTGACGC CGCAGCCACT GCCCGCCGGC GTCGCCAGCA GCCAACCCCT GGAGCGCCTG
CCGGTGCTCC GGCGCAGCGA CCGCGGTAAG CCGGCCGCCG GCTACTGGCT CACCCATAGC
GGCTGGCAGC GCTACTGCCA GGGCGAGCCC CCACCGGCTG AGGCCCTCGT TCACCGCTCG
GCGATCTGGA GCAGCGACCC GCGCCTGGGC ATTGCCCTGA AGCCCGAGCA GCGCACGGCG
GCCGAGAGCC AGCTCTACAC CACCGAGGGC ATCCGGCTGT GCGAGGGATA CGGCTTTCTC
GCCGCCATCG CCGGCTCCGA CCCGACCAGC CTGCCTAAGC AGGCAACCCT GCGCCTCGGC
GGCGACGGCC GCGGCGCGTT CATGAGCGCC GTCGCAGCCC CGGTCGAAAC CTCGCCGGAG
CCGGCCGCCA TCGAGGCAGA GGGCCGCCTA CGGATCGTAC TCACCACGCC CGGCATCTTC
CCGGGGGGGT GGCAACTCCC CGGGCTCGAC GCCAATGGCC TCTGGCACTA TCCCGGCGGC
CGGGCGCGGC TGGTCGCCGC TGCTGTGCCC CGCGGCCAGG TCATCAGCGG CTGGGATCTG
GCTCACCACC GGCCCAAACC CGCCCAACGT GCGGCCCCAG CCGGGACCGT CTACTGGCTC
GAGGCGCTTG AGGGCGGGCT CAAGCCGCTT CGCAAGCTTG CCGAATCCGG TCTCTGGGGC
ATGACCCCGG AGAATGAAGA CCCAGCACGG CGAGCCGAGG GCTTCAACCG CTTCGCCATC
GCCAACGCCT GA
 
Protein sequence
MAEYRYIEPQ DVLFFRGNRL FGEPGNAGAA LMPPWPSVFA GALRSAMLSA AGADPAQLRS 
GELPAPLDTV LGTPEEPGTF TLTGVTLARR QSSGTAEPLH PLPADLSVER DEATGECAVH
RLTPQPLPAG VASSQPLERL PVLRRSDRGK PAAGYWLTHS GWQRYCQGEP PPAEALVHRS
AIWSSDPRLG IALKPEQRTA AESQLYTTEG IRLCEGYGFL AAIAGSDPTS LPKQATLRLG
GDGRGAFMSA VAAPVETSPE PAAIEAEGRL RIVLTTPGIF PGGWQLPGLD ANGLWHYPGG
RARLVAAAVP RGQVISGWDL AHHRPKPAQR AAPAGTVYWL EALEGGLKPL RKLAESGLWG
MTPENEDPAR RAEGFNRFAI ANA