Gene Gdia_0344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0344 
Symbol 
ID6973738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp387014 
End bp387907 
Gene Length894 bp 
Protein Length297 aa 
Translation table11 
GC content58% 
IMG OID643389876 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_002274755 
Protein GI209542526 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03639] CRISPR-associated endonuclease Cas1, NMENI subtype 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0826483 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0182764 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATGGC GAGGCGTGCA TATCTCCCAC CCCTCCCGGT TGACGCATCG GAATCGGCAG 
CTCGTTGTTG CTCAGGATGG TGGCGAGGTA TCATTGGCGG TGGAGGACAT CGCGTGCCTT
ATCCTCGATA CGCGACAAGT GAGCATCACC GGGTCTCTTC TCTCTGCGCT TGCAGAAAAT
GGCGTTGCCA TGATCGTGCC CGATGCCAGG CATCATCCTG CCGGTATCCT GCTGCCTTTT
CACCAGCATC ATGCCCAGGC GCACATAGCA CATGCCCAGA TCTCGATCAG CCAACCATTG
AAGAAGCGCC TGTGGCAGAC ATTGGTCGTC GCCAAGATAC GTAATCAGGC TGCACTACTG
GACCAACTCG GCCGGCCGCA AGGACAAACG ATTGCAGCAA TGGCTGGACG GGTCGCTTCC
GGCGATCCGG GCAATGTGGA AGCACAGGCG GCCCGAGCTT ACTGGGCGAG CCTGTTTTCG
GATTTTACAC GCGCAAACGA GAATGATCGT CGTAATGCGT TGCTTAACTA TGGTTATGCG
ATCATGCGAG CCGCGATTGC ACGCGCATGC GTGGCGCTGG GATTGCTCCC AGCTTTCGGG
GTACATCACG CATCGAAAAC CAATGCGTTC AATCTCGTCG ACGATCTGAT CGAGCCGTTC
CGCCCCTTTG TGGACCGCAT GGCGCATGAC CGGGCTTTGG AACATGTAGG GGACACGCTG
TCTATCGAGG ATCGCCGTCA AATGTCGACG ATCCTCAATG ACAATGCGGC CATCGGTCGC
GAGCGAATGA CCGTCCTGGC CGCAACCGAA GCGGTAGCCA TGTCCGTGGT GCGCGCCATC
GAGCATGGCA GTGCCGCGCT TCTCTCGACT CCAACTCTGA AAGCCCGGGA TTGA
 
Protein sequence
MAWRGVHISH PSRLTHRNRQ LVVAQDGGEV SLAVEDIACL ILDTRQVSIT GSLLSALAEN 
GVAMIVPDAR HHPAGILLPF HQHHAQAHIA HAQISISQPL KKRLWQTLVV AKIRNQAALL
DQLGRPQGQT IAAMAGRVAS GDPGNVEAQA ARAYWASLFS DFTRANENDR RNALLNYGYA
IMRAAIARAC VALGLLPAFG VHHASKTNAF NLVDDLIEPF RPFVDRMAHD RALEHVGDTL
SIEDRRQMST ILNDNAAIGR ERMTVLAATE AVAMSVVRAI EHGSAALLST PTLKARD