Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Daud_1811 |
Symbol | |
ID | 6026615 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Desulforudis audaxviator MP104C |
Kingdom | Bacteria |
Replicon accession | NC_010424 |
Strand | - |
Start bp | 1914303 |
End bp | 1915298 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641594628 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001717939 |
Protein GI | 169831957 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0537215 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCAAAAAA CACTTTATCT CTTCGCGAGC GGGCGGCTGC GGCGTAAGGA CAATACCGTT TGCTTGGAAA GCGAAAACGC CACTAAATAC TTCCCGGTGG CCAGCCTGCG GGACATTTTC GTGTTTGGCG AACTCGACCT GAACAAGAAA CTGTTCGAGT TTATGGAAGA ACAGGAGATC GTCCTGCACT TTTTCGGCTA CTACGGCAAC TACGTGGGCA GTTTTTACCC GCGGGAGCAC TACAACTCGG GGTACGTCAT CCTGCGACAG GCTGAACACT ACCTCGATGG GGCACGCCGC CTGGAACTGG CGCGCCGGTT TGTGGAAGGG GCACTCGCCA ACATCCTCCA GATCCTGCGT TACTACCAGA ACCGGGGGAA GGAGTTGGCG GGCTCCATCG AGTCCATTTC GCGACTGCTC GAAGGAAGCC TGCCGGCCTG CGGCCGTGTG GAAGAACTGA TGGCCGTGGA GGGCAACGCC CGGGAATACT ATTACGAAAG CTTCAACGTT ATCCTGGACA AATCGCCTTT TACCATGCCC GGCCGCAGCA AGCGTCCGCC GACGGATCCG TTAAACGCGC TGATCAGCTT CGGCAACTCC CTGATCTACG CCAAGATCCT GACCGAAATC TACAAAACAC ACCTGGACCC GCGCATCGGC TACCTGCATT CCACTAATTT TCGCCGCTTC ACTTTGAACC TCGACCTGGC GGAGATATTC AAACCGGTGC TGGCCGACCG CGTGTTGTTC CACCTGTTGG GCAAAAAGAT GCTTGACCTG AAGGATTTCG AGCAGCAGGG CGGGGCCTAT CTCCTTAGAG AGCGGGGCCG GCGCCTGTAT GTCGAAACCC TGGAAGAAAA ACTGCAAAGC ACCTTTCATC ACCGGCGGCT GCGCCGCAAC GTGAGCTACC AGACCCTGCT CCGTTTGGAA CTCTACAAGA TTCAAAAACA CCTGATGGGC GAGCAGCCCT ACAAACCGTT CGTCAGTCGG TGGTAA
|
Protein sequence | MQKTLYLFAS GRLRRKDNTV CLESENATKY FPVASLRDIF VFGELDLNKK LFEFMEEQEI VLHFFGYYGN YVGSFYPREH YNSGYVILRQ AEHYLDGARR LELARRFVEG ALANILQILR YYQNRGKELA GSIESISRLL EGSLPACGRV EELMAVEGNA REYYYESFNV ILDKSPFTMP GRSKRPPTDP LNALISFGNS LIYAKILTEI YKTHLDPRIG YLHSTNFRRF TLNLDLAEIF KPVLADRVLF HLLGKKMLDL KDFEQQGGAY LLRERGRRLY VETLEEKLQS TFHHRRLRRN VSYQTLLRLE LYKIQKHLMG EQPYKPFVSR W
|
| |