Gene HMPREF0424_0766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0766 
Symbolcas1 
ID8709011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp868977 
End bp870056 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content30% 
IMG OID646482868 
ProductCRISPR-associated endoribonuclease Cas2 
Protein accessionYP_003373990 
Protein GI283783236 
COG category[L] Replication, recombination and repair 
COG ID[COG2176] DNA polymerase III, alpha subunit (gram-positive type) 
TIGRFAM ID[TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family
[TIGR01873] CRISPR-associated endoribonuclease Cas2, E. coli subfamily 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0695815 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTTAA CTGTGATTAC GATGACGAAC TGTCCACTAT CTTTAAGAGG TGATTTAACT 
AAGTGGATGC AAGAAATAGC ATCAGGCGTT TACGTCGGCA ATTTTAATAG TCGAGTCCGA
GAAGAATTAT GGAAGCGTAT AGAAGATAGT GTTGGCAATG GTGCTGTAAC AATGAGCTTT
TCTTCTAGAA ATGAAATTGG CTATGATTTT AAAACTATCC ATTCGCATCG TGAAGTGGTT
TATTCAGATG GATTGCCTCT AGTACGTATT CCAACAGTTG ATACACTAGA AAACGATATT
AAACATGGAT TTAGCGATGC AGCACATTTT CATAATGCAA AGAAATTTTC AAATATAAGG
CGTAAAAATA TTAATACTCC TTCTAGCACA ATTGAAAATA ATCTTGCTAA TGAAAATATT
AGTTATCTAA GTTATGATAA TGCTTCTCAA ACTGCAAAAT TTGCTACGGA TAATGAAGAG
ATAACTTGTA CCGATTACTG GAAGGATTTT ATAGATAAAA TAGATAAAAA TTTTGTTGTA
ATAGATTGCG AAACTACAGG TTTAAATAGC AAAGTTGATA AAATCATTGA AATTGGGGCC
GTTAAGTTCA TTGATGGAAA AATTGAAGAA TTCCAGAAAT TAATAAATAT TAATTGCGAT
AGCGTTAGTA ATTTTAACAA AAATAATTTT ATTCCTAATG AAATTATTGA GTTAACTGGT
ATTACATCCA ACATGCTTGA ATCTTATGGT GTCAGATTAG ATTTAGCATT AAAAGAATTT
ATTAATTTTG TCGAAAGATT ACCTGTACTT GGATATAACG TACAATTTGA TATGTCATTT
TTAAATTCTG CTTTATCTTC ATTAGATGAA AGATTTGATC ATTCTCTTTC AATTAACAAA
ATTTTTGATA TTTCTCGTTT TGTAAAGAAA GAAAAGAAAT TTTTAAAAAA CTACCAATTA
AAAACAGTGC TGCACGAATA CAACATTGCT GAATCTGTTC CACATAGAGC TCTTCTGGAT
GCTAGACTAA CAGCGCATCT TGTTTTTAAT TTGACAGAAT TGTTAAAATT TCTTGCATAA
 
Protein sequence
MPLTVITMTN CPLSLRGDLT KWMQEIASGV YVGNFNSRVR EELWKRIEDS VGNGAVTMSF 
SSRNEIGYDF KTIHSHREVV YSDGLPLVRI PTVDTLENDI KHGFSDAAHF HNAKKFSNIR
RKNINTPSST IENNLANENI SYLSYDNASQ TAKFATDNEE ITCTDYWKDF IDKIDKNFVV
IDCETTGLNS KVDKIIEIGA VKFIDGKIEE FQKLININCD SVSNFNKNNF IPNEIIELTG
ITSNMLESYG VRLDLALKEF INFVERLPVL GYNVQFDMSF LNSALSSLDE RFDHSLSINK
IFDISRFVKK EKKFLKNYQL KTVLHEYNIA ESVPHRALLD ARLTAHLVFN LTELLKFLA