Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sfum_2830 |
Symbol | |
ID | 4458828 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Syntrophobacter fumaroxidans MPOB |
Kingdom | Bacteria |
Replicon accession | NC_008554 |
Strand | + |
Start bp | 3487709 |
End bp | 3488629 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639703604 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_846943 |
Protein GI | 116750256 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGTCGC AGTTGCCGCC CCTCAGGCCG ATTCCCATAA AGGCTCGTGT TTCTGTCCTC TTTCTTGAAA AAGGCAACCT GGACGTTTTG GACGGAGCCT TCGTGGTTGT GGACAAGAAC GGCGTAAGAA CCCATATCCC GGTCGGAGGC GTTGCCTGCC TCATGCTCGA GCCCGGAACA CGGGTTTCCC ACGCTGCAGC GGTGCTTGCC TCAAGGGTGG GATGCCTTCT CATCTGGGTG GGAGAGGCGG GGGTCCGGCT CTATGCCTCC GGACAGCCGG GCGGCGCACG CGCCGACCGT CTGCTCTACC AGGCGCGGCT CGCGCTTGAT GATACGGCGC GTCTGAAAGT CGTCAGAAAA ATGTATGCCG TGAGGTTTGG CGAGGAACCG CCTGCACACC GGAGCATCGA GCAACTGAGG GCCATCGAAG GGAGCCGTGT GAAGAATCTC TATAAGCTTT TCGCCAAGCA GTACGGGGTA CCGTGGCATG GCAGGAAATA CGATCCCGAT GAGTGGGGCA GCGGGGACAC GGCCAACCGT TGCCTGAGCG CTGCCACGGC TTGTCTCTAC GGAGTGGCGG AGGCGGCCGT CCTGGCGGCA GGGTACGCCC CGGCGGTGGG CTTCATTCAT ACGGGAAAGC CGCTTTCCTT CGTCTATGAT ATTGCGGACA TCTTCAAATT CGAGACGGTC GTGCCCGTTG CATTCAAAGT TGCGGCCCAA AGGTCGGGCG AACCGGAGAG AGCGGTTCGC ATTGCATGTC GAGACGCTTT CAGGAAAAGC CAGCTTCTTA CGCGCATCAT CCCCACCATT GAAACCATTT TGAACGCAGG CGGGCTGAAT CGCCCCGATC CGCCGGAAGA TTCCGTACCG CCGGCTATCC CGAACAAGGA GCAATCAGGC GATGCTGGTC ATCGTGGTTG A
|
Protein sequence | MQSQLPPLRP IPIKARVSVL FLEKGNLDVL DGAFVVVDKN GVRTHIPVGG VACLMLEPGT RVSHAAAVLA SRVGCLLIWV GEAGVRLYAS GQPGGARADR LLYQARLALD DTARLKVVRK MYAVRFGEEP PAHRSIEQLR AIEGSRVKNL YKLFAKQYGV PWHGRKYDPD EWGSGDTANR CLSAATACLY GVAEAAVLAA GYAPAVGFIH TGKPLSFVYD IADIFKFETV VPVAFKVAAQ RSGEPERAVR IACRDAFRKS QLLTRIIPTI ETILNAGGLN RPDPPEDSVP PAIPNKEQSG DAGHRG
|
| |