Gene Avin_31580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_31580 
Symbolcas1 
ID7762058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3266829 
End bp3267869 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content63% 
IMG OID643806032 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_002800296 
Protein GI226945223 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCGGC AACTCAACAC CCTGTATGTC ACCTCGGAAG GTGCCTGGCT GCGTAAGGAT 
GGCGCAAACA TCGTCATGGA AATCGCCGGT GAAATTCGCG GCCGGATTCC GGTACACATG
CTGGAAAGCC TGGTCTGCAT CGGCCGGGTG CTAGTTTCCC CACCTTTGCT GGGCTATTGC
GCCGAGCAGG GCATCCGCAT CAGTTACCTG ACCCCCAATG GCAAATTCCT AGCCCGCGTC
GAAGGGCCGG TATCCGGTAA TGTGTTGCTG CGTCGCGAGC AGTACCGGCG CAGTGACGAT
CCGGCAGGCA GTGCCGCCAT CGTCGCCAAT CTGCTGGTCG GCAAGGTGTA CAACCAGCGC
GCCGTGATCG GCCGCGCCCT GCGTGATCAT GGCGACAAGC TGGCAGAGGA GGCGCGGATT
GCCCTGAACA TCACCCACAA GCGGTTGATG CGGATTGCCG ACCGCTTGTT GCTGGAGCAG
GACGTGGATG TGCTGCGTGG GTTGGAAGGG GAGGCAGCCC AGGCCTATTT CGGCGTGTTC
GATCACCTGA TCCGTGTGCC GGATGCCTCC TTGCGCTTCA GTGGCCGCAG CCGCCGGCCA
CCGCTGGATG CGGTCAATGC CCTGCTGTCG TTTCTCTACA CCCTGCTGAC CCACGACTGC
CGTTCGGCAC TGGAAACGGT CGGCCTCGAT CCGGCGGTCG GCTACCTGCA CCGCGACCGG
CCAGGCCGGC CGAGTCTGGC GCTGGATCTG GTCGAGGAGT TCCGTCCGGT ACTGGCCGAC
CGGCTGGCGC TGTCGCTGCT CAACCGGAGG CAACTGTCCG CTGCCGACTT CCAGACGCTG
GATAACGGTG CCGTGCTGCT GCGCGACGAA GCGCGCAAGC AGGTACTGAC AGCGTATCAG
GAGCGCAAAC GCGAAGAGCT GCGGCATGCC TTTCTGGAAG AGCAGGCTCC GCTCGGTTTG
TTCATGTTCA TCCAGGCGCA ACTGCTGGCC CGCCACCTGC GCGGCGATCT GGATGCCTAC
CCACCCTTCA TCTGGAAGTG A
 
Protein sequence
MRRQLNTLYV TSEGAWLRKD GANIVMEIAG EIRGRIPVHM LESLVCIGRV LVSPPLLGYC 
AEQGIRISYL TPNGKFLARV EGPVSGNVLL RREQYRRSDD PAGSAAIVAN LLVGKVYNQR
AVIGRALRDH GDKLAEEARI ALNITHKRLM RIADRLLLEQ DVDVLRGLEG EAAQAYFGVF
DHLIRVPDAS LRFSGRSRRP PLDAVNALLS FLYTLLTHDC RSALETVGLD PAVGYLHRDR
PGRPSLALDL VEEFRPVLAD RLALSLLNRR QLSAADFQTL DNGAVLLRDE ARKQVLTAYQ
ERKREELRHA FLEEQAPLGL FMFIQAQLLA RHLRGDLDAY PPFIWK