Gene HMPREF0424_0765 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0765 
Symbolcas2 
ID8709646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp868034 
End bp868975 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content38% 
IMG OID646482867 
ProductCRISPR-associated endonuclease Cas1, ECOLI subtype 
Protein accessionYP_003373989 
Protein GI283783235 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0747275 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA AATTCGGTGC CAAAAAAGCT GAAATACCAG AGTTTCCACG TATAAGCGAC 
AGAGTGAGTT TTATATACGT AGAGCATGCG AAAATTAATC GTCTTGATAG TGCTGTTACA
GTCTTTGATG CAAATGGAAC TATAAGAGTT CCTGCTGCAA TGATTGGCGT TTTACTTTTA
GGTCCTGGCA CTGAGATTAC TCACAGAGCT ATGGAATTAC TTGGAGATGT TGGCGCAAGC
ATAGTTTGGG TTGGCGAGCA TGGTGTTCGC AATTACGCTC ATGGTAGAGC ATTGTCAAGA
AGCTCAAGAT TATTGGAAAA ACAATCAAAA CTTGTTACTA ATAGTAGATC AAGATTGAAC
GTTGCGAGAA AAATGTATCA AATGCGTTTC CCTAATGAAA ACGTATCGTC ATATACTTTG
CAACAATTGC GAGGGAGAGA GGGCGCTCGT GTTAGACATT TATACAGGGA AATGTCTAAC
AAATACAATG TCCAATGGAA TGGTCGCGAT TATAAAGTCA ATGATTTTGA AAGTGGCACT
GTTGTTAATA AAGCATTGTC TGTAGGTAAT GTATGCCTTT ACGGTTTAGT ACATAGCATA
ATATCTGCTT TAGGTTTAGC ACCTGGATTA GGTTTCGTGC ATACTGGACA TGATCTTTCT
CTTGTATACG ATATTGCAGA CTTATATAAA GCTGAATTAA CTATTCCTGC ATCTTTTGAA
ATTGCTGCTC GGTGTGAATC CGACGATGAC ATTGAACAGT TAATGCGTTT GAAGATGCGC
GATTGCTTCG CAAATTGCAA TATAATGTCT CGTATTGTTA ATGATATACA AAATCTTCTT
GAAATCCCGA TTGACGACCA AATTACTGTT GATGTAATAC ATCTTTGGGA CGATAAGGAA
CTTCTAGTAG CTTCTGGCGT AAATTACAGT GAGGTGAATT GA
 
Protein sequence
MKKKFGAKKA EIPEFPRISD RVSFIYVEHA KINRLDSAVT VFDANGTIRV PAAMIGVLLL 
GPGTEITHRA MELLGDVGAS IVWVGEHGVR NYAHGRALSR SSRLLEKQSK LVTNSRSRLN
VARKMYQMRF PNENVSSYTL QQLRGREGAR VRHLYREMSN KYNVQWNGRD YKVNDFESGT
VVNKALSVGN VCLYGLVHSI ISALGLAPGL GFVHTGHDLS LVYDIADLYK AELTIPASFE
IAARCESDDD IEQLMRLKMR DCFANCNIMS RIVNDIQNLL EIPIDDQITV DVIHLWDDKE
LLVASGVNYS EVN