Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_2973 |
Symbol | |
ID | 4661890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008741 |
Strand | - |
Start bp | 33096 |
End bp | 34127 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639813893 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_961172 |
Protein GI | 120586827 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAC TGCTTAACAC GCTCTATGTG ACCACGCAGG GCACCTACCT CGCCAAAGAG GGCGAGTGCA TCGTCGTGCG CGTGGGAGAC GAGGTGAGAC TGCGTGTGCC GGTGCACTCC CTTGGTGGCG TGGTCTGTTT CGGGCAGGTC TCGTGCAGTC CTTTCCTGAT GGGCTTCGCC GCCGAACGCG GGCTTGGCTT CAGCTTCCTG ACCGAGCACG GGCGATTCCT TGCGCGGGTT CAAGGCCCCG TATCCGGGAA TGTCCTTCTG CGGCGGGAGC AATACCGCAG GGCCGACTCG CCGGAAGCCA GCGCCGAGGT GGCGCGAAGC ATCGTCAGCG CCAAGGTCGT CAACGCCCGC GGCATCCTGC AACGGGCCAT GCGCGACCAT GGTGACAAGG TCAACGGGGC GGCGCTGGAA GCGGAAGTGC TCCACTTGAG GGACTGCCTC ATGCGCTTGC AACAGCCCGC AGGGCTGGAT GCCGTGCGCG GCATCGAGGG CGAAGCTGCC AAAGGCTACT TCAGCGTGTT CGACAACCTC ATCCTGACAC GGGAGGCGGC GTTCCGCTTC GAGGGACGCA GCCGGCGCCC ACCGCTGGAC AGGGTCAACT GCCTGCTGTC GTTCATCTAT ACGTTGCTCG GGCATGATGT ACGAAGCGCG CTGGAAGGCG TCGGGCTTGA TTCGGCAGTG GGGTTCCTGC ACCGCGACCG CCCCGGCAGG CACGGCCTTG CACTTGATGT CATGGAAGAG TTCAGGGCCG TTGTCGCCGA CAGGCTGGCG CTTTCGCTCA TCAATCTCGG CAAGCTCAAG AAGAACGACT TCGAGATACA GGAGACGGGT GCCGTCCGGA TGACGGATGA TGCCCGCAAG GCATTGCTCG TGGCATACCA GAAGCGAAAA CAGGATGAAA TCGTCCATCC CTTCCTGAAA GAACGCATCC CCTTGGGGCT TGTCTTCCAT GTACAGGCCA TGCTCATGGC AAGGTGGCTT CGCGGCGACA TCGACGGCTA CCCACCCTTT GTATGGAAAT GA
|
Protein sequence | MKKLLNTLYV TTQGTYLAKE GECIVVRVGD EVRLRVPVHS LGGVVCFGQV SCSPFLMGFA AERGLGFSFL TEHGRFLARV QGPVSGNVLL RREQYRRADS PEASAEVARS IVSAKVVNAR GILQRAMRDH GDKVNGAALE AEVLHLRDCL MRLQQPAGLD AVRGIEGEAA KGYFSVFDNL ILTREAAFRF EGRSRRPPLD RVNCLLSFIY TLLGHDVRSA LEGVGLDSAV GFLHRDRPGR HGLALDVMEE FRAVVADRLA LSLINLGKLK KNDFEIQETG AVRMTDDARK ALLVAYQKRK QDEIVHPFLK ERIPLGLVFH VQAMLMARWL RGDIDGYPPF VWK
|
| |