Gene Dtur_0619 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtur_0619 
Symbol 
ID7081659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDictyoglomus turgidum DSM 6724 
KingdomBacteria 
Replicon accessionNC_011661 
Strand
Start bp620788 
End bp621783 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content30% 
IMG OID643457695 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_002352521 
Protein GI217967015 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAA GTATTTATAT TTTTTCAAGT GGGGAACTAA AAAGAAAGCA CAATACAATA 
TTTTTCGAAA CGTCCGACGG TCAAAAGAAG TATCTTCCCA TAGAAAACAT TAGGGATATA
CACTTTTTTG GTGAGGTAAC CCTTAATAAA GAGTTTTTAG AGTTAATATC TCAAAAAGAA
ATAATATTGC ACTTCTATAA CTATTATGAA TATTATATAG GAAGCTTTTA CCCAAGGGAA
CATTATAACT CTGGTTTTAT GGTTCTAAAA CAAGCAGAGC ACTACTTAGA TAATGAAAAA
AGATTAAAGA TAGCAACAAA AATTGTAAAA GGAGCTTCGG GAAATATAAA AAGAGTGATT
AATTATTATC TAAATCGCGA AAAGAAAGAG TTAGAAGATT ATCTTGAGAG AATTCAAGAA
CTGGAAGAAA AAATAGATAT TACTAATAAT GTGGACGGAC TTATGGGAAT AGAAGGAAAT
ATAAGAGATA TTTATTACTC CTGTTTCAAC ATAATAACAG AAAAAGAAGA ATTCTTTATT
GATGAAAGAA CCAAAAGACC CCCCTCAAAC TATATGAATG CTCTCATAAG TTTTGGAAAT
TCCCTCCTTT ATACTACTAC ACTCTCTGAA ATATACAAAA CTCACCTTGA CCCAAGAATT
GGATATCTTC ATACTACTAA TTTCCGTAGA TTCACATTAA ATCTTGACGT AGCTGAGATA
TTTAAACCTA TCATTGTGGA TAGAATAATA TTTAGCCTTG TAAATAAAGG AGAGATTACT
CCTAAGGATT TTGAAGAAAA GTTAGATGGA GTTTACATGA ATGAAAAAGG CATGAAAATC
TTTGTACAAA ATTTTGAAGA AAGAATGAAA ACTACAATAC AGTATAAAAA TCTTGGCAAG
GTCTCGTATA GAAGACTTAT AAGACTTGAG CTTTATAAAC TTGAGAAACA CCTAATAGGA
GAGGAAGAAT ACGAACCTTA CATATCCTCT TGGTAA
 
Protein sequence
MSESIYIFSS GELKRKHNTI FFETSDGQKK YLPIENIRDI HFFGEVTLNK EFLELISQKE 
IILHFYNYYE YYIGSFYPRE HYNSGFMVLK QAEHYLDNEK RLKIATKIVK GASGNIKRVI
NYYLNREKKE LEDYLERIQE LEEKIDITNN VDGLMGIEGN IRDIYYSCFN IITEKEEFFI
DERTKRPPSN YMNALISFGN SLLYTTTLSE IYKTHLDPRI GYLHTTNFRR FTLNLDVAEI
FKPIIVDRII FSLVNKGEIT PKDFEEKLDG VYMNEKGMKI FVQNFEERMK TTIQYKNLGK
VSYRRLIRLE LYKLEKHLIG EEEYEPYISS W