Gene Hhal_0619 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0619 
Symbol 
ID4711443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp697188 
End bp698150 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content71% 
IMG OID639855083 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_001002206 
Protein GI121997419 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0887113 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGCACCC TCTACATCGA CCGGCGCCGG ACGCGCCTGG AGCTCGCGCA CAAGGCGCTC 
ACCATTCGGG AACCGGAGGC CCAGCCCCGC TCGGTGCCGC TGAGCCTCAT CGACCGACTG
ATCGTCATTG GCCAGGTCGA GCTGAGCAGC GGCGTGCTCA CTACTCTCGC CGAGAGCGGC
GTCAGCCTGG TCTTCATGCC GAGCCGTGGA CAGCGGCGCA GCGCCTTCCT CCGCAGCGAG
GGCCATGGCG ATGCCGTCCG CCGCCTCGGC CAGTACAGGC TCATCCACCT CGAGGCTGAG
CGCCAGGCCT GGGCGCGCCG CCTCGTGCGA CTGCGTCTGG CCGGGCAGCA GCGGCTCCTC
GCGAGTGCGC TATACCGGCG TCCTGATCAA CGCCAGCCGC TCACGGCTGC CCACCGCGAG
ATCGAGGCGG CCCAGGCGAC CGTGCGCCGC GAGGCGCCCG CCGGTGAGCA ACTGCGGGGG
CAGGAGGGTA CGGCCGCGGC GGCCTTCTTC CGCGGCTACG GCGCTCTCTT CGCCGAAGCG
CTAGGCTTCT CCGGGCGAAA TCGCCGGCCA CCCCGGGATC CCGTCAACGC CGTCCTCTCG
CTCGGCTACA CCCTCGCGCA CGGCGATGCA CTGCGGGCCG TCACCGCTGC CGGCCTCGAT
CCGGCCATCG GCGTACTGCA CGAGCCTGCC TGGGGGCGAG ACTCCTTGGC CTGCGATCTC
ACGGAGATCG CCCGGGCCCG GGTGGAGCGG CTGACCTGGG AGCTATTCGC GAGCGAGACG
CTCCAGCGCA CGGACTTCAC CAACAGCACC GAGGGCGTAC GACTAGGCAA GGCTGCACGG
CAGACCTTCT TCGGCTGCTG GGAACGCCAT GCCGGGCTCC ATCGACGCTG GCAGCGCCGC
GCCGCTCAGG CCCTAGCCGC CGAGTGCGCC CACCACGGCG CCCAAACTAT TCCCGAGGCG
TAG
 
Protein sequence
MGTLYIDRRR TRLELAHKAL TIREPEAQPR SVPLSLIDRL IVIGQVELSS GVLTTLAESG 
VSLVFMPSRG QRRSAFLRSE GHGDAVRRLG QYRLIHLEAE RQAWARRLVR LRLAGQQRLL
ASALYRRPDQ RQPLTAAHRE IEAAQATVRR EAPAGEQLRG QEGTAAAAFF RGYGALFAEA
LGFSGRNRRP PRDPVNAVLS LGYTLAHGDA LRAVTAAGLD PAIGVLHEPA WGRDSLACDL
TEIARARVER LTWELFASET LQRTDFTNST EGVRLGKAAR QTFFGCWERH AGLHRRWQRR
AAQALAAECA HHGAQTIPEA