Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1800 |
Symbol | |
ID | 7317610 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 1917250 |
End bp | 1919325 |
Gene Length | 2076 bp |
Protein Length | 691 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643616692 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_002513869 |
Protein GI | 220934970 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCACG ACGCCGACCA CTGGATCGCC CCCCTGTTGC CCCTGCGCGC ACTGGGCTTC ACCCTCGCCT TCACCGAGGA CACGCCGGCG AGGTCCTGGC TGGAGCCCGC CCTCACTGCC TTCCTGCGCA GCCTGCTGGG CAGCCCCGCC GATTTCGACC GGCTCATGAG CCTGCGCGTC GATCAGGTCC CCGAGGGAGA AACCTGGGTC GAGGGTGCCA CCCTGACTGC CACGCTCATC GCCCTGCCCG GCGGCGAGGC GCTGCTTGCC TGCGCCGTGG ATCGCCTGCG CGCCCTGCCC GGCAGCGCGC CGCGTCTTGA CCAGCCCCTT CCGGTGCGCG ACAACCTGCG CTGCCTGGAA CTGTTCGACG GCTTCACCCG TGCGTTCGTG GAGCATGCCA CCGACCTTCA GCCCTGGGAC GATGCAGCCT TGCAGCACGA AGTGCGGATC CTCGCCCGCC AGACCTCCCT GCAGCTGTGC TTCACCTCCC CGGTGCGTCT TCTCAAGGCC AAGGCACTGC GCGGCGATGC GAAAGGCGAG GCCCGCTACT GCAGCGAGGC GGCACACCTG ACCCCGGCAC TGTTGTTCGC CCGGCTCCAC GACACCGTGG CCGACATCCT GCGCCGCCAC GGGCATGCGC CACCGCCGCG CCTGGACCTC GGTCCTGTGC CGGGACAGGT CCTCGAGGCG CGCTGGATCG ACCATGGTTA CCGTGATGGT GCTGGCCGCG ACCAGCCCAT GGGCGGCCTG CTGGCCAGCC TGGAACTGCA ACCGGATCGG CTGGCGCCTG AACTGCTCAC GCTGCTGGCA CTTGGCCGCC ACCTGGGACT GGGCCAGCGG CGCAGCTTCG GCTGGGGACG CTACCGGCTG GAAGACGAAT GGGGCATCGA GCAGCCCGTA GGCAGTGATC GACCGGTAGG GCCCGCTGAA CCCGTCGCGC CAGCCGACCC CGTTGGCGCG CAAACCGGCG ATGACATCGA CTGGTTCGGA CTGGACGAAA CCCCAGGCAG CGCCGAGGAG ATCGACTGGT TCGGCGACAT GGAGGCCGAG AACCTGCCCG TGGCCCTGGG CCAGGCGGCC GCCGAAGGCA CCCAGCTGCT GCTGGCCGGG GAACCGGCCC GGGTCGGCCT CGATGGCGGA CGCCTGCGGG TGCAGCGGGG TGAAGCCGAA CTCCTCTCCG CGCCCCTGGA AACCCTCGCC GGCGTCACCC TCCTCGGCCC GCACCAGGTC AGCACCCAGT TGCTGGGCGC CCTGCTGGAC CGGGGCATCC CCCTGGCCCT GGCCACGGGC CAGGGTCGCC TGCGCGGCGT GCTCTGGAAC GGCGTCCCGG GCGATGCCGG TCCCGGGCTA TGGATGCGCC AGGCGGCCTG TTTCGAGGAT GACGCCCGGG CCCTCGAGGC CGCCCGCGCC GTGGTGGATG CGCGCCTGCG CCAGCAGCGT GAAGTACTGC GCAACCGCAT GTCACCGGAA CGCTGCGACG ACCTGTTGCC GCGCCTCGAC CGGCTCATCG CCAAGACCGC CGCGGCCTCC GACCGGGCCA GTCTCAACGG ACTGGAAGGC CAGGCCGCCC GACTCTACTT CGGTGCCCTC GCTGAACTGC TGCCCCCCGA ACTGGGCTTT ACCGGACGCA ACCGCCGCCC GCCCCGGGAC CCTTTCAATG TCCTGCTCTC CCTGGGCTAC ACCGTACTCC ACGCCCACGT GGACACGGTG GTGCGGCTCA ACGGTCTCTA CCCCTGGCGC GGCTTCTACC ACCAGCCCCA TGGCCTGCAC CCGGCGCTCG CCTCGGACCT CATGGAACCC TTCCGCCACC TGGTGGAGCG GGTTGCGCTC AATGTCGTCG CCCGGGGACG CATCCGTGTC AGCGACTTCG CGCAGCAGGG TGATGCCTGC CGCATCGAAG CCGGGGCCCG CCGTCGTTAC CTGGCCGATC TCAGCGAGCG CCTGCTCACC CCCGTGCGCG CCCGGGGTGA CAGCGAGGCC CGCAGCCTGC ATGATCACCT GCATCGCCAG GCCCGCAGCC TGATCGCCTG GATTCGCGAG GAGGCCCCGG CCTTCGAACC CTTCCGGACC CGTTGA
|
Protein sequence | MNHDADHWIA PLLPLRALGF TLAFTEDTPA RSWLEPALTA FLRSLLGSPA DFDRLMSLRV DQVPEGETWV EGATLTATLI ALPGGEALLA CAVDRLRALP GSAPRLDQPL PVRDNLRCLE LFDGFTRAFV EHATDLQPWD DAALQHEVRI LARQTSLQLC FTSPVRLLKA KALRGDAKGE ARYCSEAAHL TPALLFARLH DTVADILRRH GHAPPPRLDL GPVPGQVLEA RWIDHGYRDG AGRDQPMGGL LASLELQPDR LAPELLTLLA LGRHLGLGQR RSFGWGRYRL EDEWGIEQPV GSDRPVGPAE PVAPADPVGA QTGDDIDWFG LDETPGSAEE IDWFGDMEAE NLPVALGQAA AEGTQLLLAG EPARVGLDGG RLRVQRGEAE LLSAPLETLA GVTLLGPHQV STQLLGALLD RGIPLALATG QGRLRGVLWN GVPGDAGPGL WMRQAACFED DARALEAARA VVDARLRQQR EVLRNRMSPE RCDDLLPRLD RLIAKTAAAS DRASLNGLEG QAARLYFGAL AELLPPELGF TGRNRRPPRD PFNVLLSLGY TVLHAHVDTV VRLNGLYPWR GFYHQPHGLH PALASDLMEP FRHLVERVAL NVVARGRIRV SDFAQQGDAC RIEAGARRRY LADLSERLLT PVRARGDSEA RSLHDHLHRQ ARSLIAWIRE EAPAFEPFRT R
|
| |