Gene Haur_2234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2234 
Symbol 
ID5734121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2840994 
End bp2842046 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content51% 
IMG OID641279375 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_001545002 
Protein GI159898755 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.982085 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAACAC TCTACCTTTC GGAGCAATAT AGCATTGTCA AACGCGAAGG CGAGGCCTTG 
CGCGTCGAGA TTCCCGAAGA TCAACAACTT GGTCGCCAAC GTCAGGTTGT GCGAGTACCA
TTAAACGTGA TTGAGCGGGT GGTAGTGCAG GGCGAAATCA CCCTAACTGC CTCGGCATTA
GCCTGCTTAT TGGAGCGACG CATTTGCACC CATTTTTTGA GCTACAGCGG ACGTTCCCAA
GGAGCACTAA CGCCTGATCC GACGCGTAAT GCAAGCCTGC GTTTAGCTCA ATATGCCGCG
CATACCAGCA TCCAACATCG ATTTAGCCTT GCACGAACCT TTGTCGATGG GAAATTGCGC
AATTTACGCA CCCAAATTTT GCGTTTCAAT CGTTCGCAGC GTGAGCCAAC TCTGACCCAA
GCGATCGAGC GTTTACGCGA TGCCCATCGC GATCTCCATG GATTAAGCAT TCCAGAGTAT
GTTGACCCGC TTGATCGCAT GCATGGAATG GGCCAGATTT TGGGCTGCGA AGGGCAAGGA
AGCGCCGCCT ACTGGGATTG TTGGGGAATG TTGCTCAATC AGCCGTGGGA GTGGCATGGC
CGTCGTCGTC GCCCACCGCC TGATCCAGTC AATGCCCTGT TATCGTATGG CTACGTGATT
CTGACCAGTC AAGTTTTGAG CCAATTAGCG ATTGTGGGCT TTGATCCCTA CATCGGCTTT
TTGCATCAAT CGAGTTTTGG CAAACCAGCC TTAGCACTTG ATCTCATGGA AGAATTTCGC
CCAGTGATCG TTGATTCAGT AGTTTTGACC GTGCTTAACA CCAAAATTCT GAACCAGCAG
CATTTTCAAC GTGAGCCTGG GAGCGTGCAA CTAAGCAAAG AAGGCCGTAA ACTCTTTCTG
ACCAAGCTCG AAGAACGCTT CAGTAGTGAA ATCCAACACC CAATTTTTGG CTATCGGGTG
AGCTATCGAC GCTGCATCGA ACTCCAAGCG CGGCTGCTTG CCAAAGCCCT GATGGGCGAG
ATTCAGCACT ATATTCCATT TCTCGTGAGG TGA
 
Protein sequence
MQTLYLSEQY SIVKREGEAL RVEIPEDQQL GRQRQVVRVP LNVIERVVVQ GEITLTASAL 
ACLLERRICT HFLSYSGRSQ GALTPDPTRN ASLRLAQYAA HTSIQHRFSL ARTFVDGKLR
NLRTQILRFN RSQREPTLTQ AIERLRDAHR DLHGLSIPEY VDPLDRMHGM GQILGCEGQG
SAAYWDCWGM LLNQPWEWHG RRRRPPPDPV NALLSYGYVI LTSQVLSQLA IVGFDPYIGF
LHQSSFGKPA LALDLMEEFR PVIVDSVVLT VLNTKILNQQ HFQREPGSVQ LSKEGRKLFL
TKLEERFSSE IQHPIFGYRV SYRRCIELQA RLLAKALMGE IQHYIPFLVR