Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dde_0862 |
Symbol | |
ID | 3755662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio desulfuricans subsp. desulfuricans str. G20 |
Kingdom | Bacteria |
Replicon accession | NC_007519 |
Strand | - |
Start bp | 888895 |
End bp | 890025 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637781727 |
Product | CRISPR-associated Cse4 family protein |
Protein accession | YP_387358 |
Protein GI | 78355909 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01869] CRISPR system CASCADE complex protein CasC/Cse4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGATT TCATCCAGCT GCACGTATTG ACAAGCTACC CTGCCTCCAA CCTTAACCGC GATGACCTCG GCCGTCCCAA AAGTGTTGTC ATGGGCAATA CAACCCGTCT GCGCATTTCC TCTCAGTGCC TCAAGCGGGC ATGGCGCACT TCGGATATAT TCCAGAATAT CGGGGCTGAA CATGTAGGCA TCCGCACCAG AGAAATGGGA CGTTATATTT TTAAGGCTCT GACCGAGGGC TGCACATTGA ACGAAGCGCT GGCCGGAAAA GCCGGCGGCA GCCTCGCCAC GGTCAAAGAA AAGGACGCAG TGGCTATCGC CCGCGCCATG GCGGGTGTGT TCGGAAAGCT GAAGGCGGAA TACAAGCCCA AAAAAGATGA TAAGGCCGAA GCCGCACGCA CGCAACGGGA AGAATCACTT GAGATAGAAC AGCTTGCCCA CTTTTCGCAG GAAGAGGTGA ATGCCGTCGC CGCTCTGACG GAAACCTGCA GAACCGGCGG CAAAGCCCCC GCCAAAGAGC AGCTTGCCCT GCTCCGCAAA GACATCAAAA CGGTGGACAT AGCCATGTTC GGCCGCATGC TGGCAGCTGA AAAGGCATTC AACGTGGAAG CAGCTGTGCA GGTGGCACAC GCCATGACCG TGCACCCCGT TACTGTGGAC GACGACTTCT TCACCGCGGT GGATGATCTG AATCGCGACG ACAGCGGCGC AGGCCATATG GGCGTTTCGG AATTCGGTGC CGGAATTTTC TATCTGTACC TGTGCATAGA CCGCGGGCTG CTCAAGCACA ACCTGCAGGG CGACACAGAG CTGACCAACC GCGCGCTGGC TGCCCTGTTG CAGGCCGTGG CACAGGTCAG CCCCTCGGGC AAACAAAACA GCTTCGGTTC CAGAGCCTAT GCCTCGTACA TTCTTGCAGA ACGTGGCAAT GACCAGCCGC GAAACCTTTC CGTCGCATAT CTGGACGGTG TGAGCAAGAA ACAGAACATC ATGCAGGAAG CCATAAAGCT GCTTACCGAA ACCCGCGCAA ACATGAACAC CGTGTACGGA CAGACATTCG AATCCGCAGA GATCAACACG CTCACCGGCA GCGGCAGCCT CAAAGAACTT GCTGCCTTCA TCGCGGGTTA G
|
Protein sequence | MSDFIQLHVL TSYPASNLNR DDLGRPKSVV MGNTTRLRIS SQCLKRAWRT SDIFQNIGAE HVGIRTREMG RYIFKALTEG CTLNEALAGK AGGSLATVKE KDAVAIARAM AGVFGKLKAE YKPKKDDKAE AARTQREESL EIEQLAHFSQ EEVNAVAALT ETCRTGGKAP AKEQLALLRK DIKTVDIAMF GRMLAAEKAF NVEAAVQVAH AMTVHPVTVD DDFFTAVDDL NRDDSGAGHM GVSEFGAGIF YLYLCIDRGL LKHNLQGDTE LTNRALAALL QAVAQVSPSG KQNSFGSRAY ASYILAERGN DQPRNLSVAY LDGVSKKQNI MQEAIKLLTE TRANMNTVYG QTFESAEINT LTGSGSLKEL AAFIAG
|
| |