Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_2433 |
Symbol | |
ID | 7317135 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 2578761 |
End bp | 2580473 |
Gene Length | 1713 bp |
Protein Length | 570 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643617337 |
Product | CRISPR-associated Cas1/Cas4 family protein |
Protein accession | YP_002514498 |
Protein GI | 220935599 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1468] RecB family exonuclease [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR00372] CRISPR-associated protein Cas4 |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0199418 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGATC AGCAGCAGCC CGAACTGCCG CTCCCTTTTC CGGAGCTATC CGGCGATATG CCCTTGCTCC CGGCGCGCAT GGTCAACGAA TACCAGTACT GCCCCCGTCT GGCGTATCTG GAATGGGTGC AGGGGGAGTG GTCCGAGTCC GCGGATACCG TGGATGGACG CTACCGTCAC CGTCGGGTGG ACAAAGGTTC TGGAGATCTT CCGGCCCCCG GCGTCGCCGA GGCGGGTGAA CGCTACCATG CCCGCTCCAT AACCCTGTCC TCCAACCGCC TGGGCCTCAT CGCCCGCATG GATCTGCTGG AAGGCGAGGG TGACCATGTC ACACCCGTCG ACTACAAGCG TGGCAAGCGG CCTCATGTGG CCCGCGGCGC CTACGAGCCC GAAAGGGTCC AGTTGTGCGT GCAGGGCATG ATTCTGGAGG AATGCGGCTA TACCTGCGAC GAAGGGGAGT TGTACTTCAC CGAATCCCGC GAGCGTGTAC GTGTGCCCTT CGACGAGGAA TTACGCCGGT TGACGCTCAA CGCCATCAAC GGACTACGAT TCATCGCCGC GGGTGGACAG ATCCCCGCAC CCCTGGAGGA CAGTCCCAAG TGTCCGCGCT GTTCGCTGGT AGGTATCTGC CTGCCCGACG AGGTCAACTA CCTCCGCCGG GAACAGACGC CGCCAAGGCC CCTGGCCGTC GCCCGGGATG AGGCCCTGCC ACTCTATATC CAGGCCCGGG GCGCCAAGCT GGCCAAGCGG GGCGAGACTT TGGAGGTCAC GGTGGATGAC GAAAAAGTAC AGTCGGTGCG CCTCATCGAC GTCTCCCAGG TGATCGTCAT GGGCAATGTC TATATCACCA CGCCGTGCCT TCAGGAACTC ATGCAGCGCG AGATCCCCGT CAGTTGGCAT TCCCATGGCG GCTGGTTCAT GGGCCACACC ATGGGCACCG GCCACAAGAA CGTGGAGATC CGGACTGCCC AGTACAAGGC GAGCTTCGAG GAGCACCAGT GTCTGCACAT CGCCAAGGGC CTGGTGGAGG CCAAGATCCA GAATTGCCGC ACGCTGTTGC GGCGCAACTG GAAAGGCGAG GACAAGCCCG TGGATCTGCT CGACGGCCTG CAGGTGGATA TCAGGAAATC CCGTCGTGCA TCCAATCTTC AGGAGCTGCT TGGCATCGAA GGGGCGGCCG CCTCCCGCTA CTTCGGTGCC TTCGCCCGGC TGTTGAAGCA CAGCGATGCC GGGCCCGAAC TGACATTCGA CTTCACCACG CGCAACCGCC GTCCGCCCAC GGACCCGGTC AACGCCCTGT TGTCCTACGC CTACGCCCTT CTGACCCGGT CCTGGACGGC TTCGCTCTCG GCAGTGGGCC TTGATCCGTA TCGAGGCTTC TATCATCAGC CCCGTTACGG CCGTCCGGCG CTGGCATTGG ACATGATGGA GCCGTTCAGA CCCTTGATCG CGGATTCCAG TGTCATTCAG GCGATCAACA ACGGTGAAGT GCGTCCATCG GATTTCCAGA GCGTGGCCGG CAGCGTCGCG CTGACCAACG ACGGACGCAA GCGCTTCATC GCCACCTTCG AGCGGCGCAT GAGTCATGAG ATCACCCATC CCCTGTTCGG ATACCGGCTC AGCTACCGCC GGCTGCTGGA GGTCCAGGGC AGGCTGCTCG CGCGCTATCT GTTGGGTGAG CTGCCCGACT ATCCGAACTT CACGACCCGG TGA
|
Protein sequence | MDDQQQPELP LPFPELSGDM PLLPARMVNE YQYCPRLAYL EWVQGEWSES ADTVDGRYRH RRVDKGSGDL PAPGVAEAGE RYHARSITLS SNRLGLIARM DLLEGEGDHV TPVDYKRGKR PHVARGAYEP ERVQLCVQGM ILEECGYTCD EGELYFTESR ERVRVPFDEE LRRLTLNAIN GLRFIAAGGQ IPAPLEDSPK CPRCSLVGIC LPDEVNYLRR EQTPPRPLAV ARDEALPLYI QARGAKLAKR GETLEVTVDD EKVQSVRLID VSQVIVMGNV YITTPCLQEL MQREIPVSWH SHGGWFMGHT MGTGHKNVEI RTAQYKASFE EHQCLHIAKG LVEAKIQNCR TLLRRNWKGE DKPVDLLDGL QVDIRKSRRA SNLQELLGIE GAAASRYFGA FARLLKHSDA GPELTFDFTT RNRRPPTDPV NALLSYAYAL LTRSWTASLS AVGLDPYRGF YHQPRYGRPA LALDMMEPFR PLIADSSVIQ AINNGEVRPS DFQSVAGSVA LTNDGRKRFI ATFERRMSHE ITHPLFGYRL SYRRLLEVQG RLLARYLLGE LPDYPNFTTR
|
| |