Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1803 |
Symbol | |
ID | 7317613 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 1921651 |
End bp | 1922856 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643616695 |
Product | CRISPR-associated protein, TM1812 family |
Protein accession | YP_002513872 |
Protein GI | 220934973 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02221] CRISPR-associated protein, TM1812 family [TIGR02549] CRISPR-associated DxTHG motif protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCGAC ACACCCTCAT CACCTTCCTG GGCAAGGGCG CCCGGGACCC GAACACCGGT TATCGTAGCA CCACCTATCG CTTTGCCGAC CGGGACCGGA GCGGTGCCTT CTTCGGCCTG TTGCTCGCCC AGCACCTGGC GCCGGATCGT CTGCTCATAC TCGGCACACG TTCCAGCCAG TGGGATCTGC TGGTGGAATC CCTGGCCGGT TCAGGGGAGG ACGAGGAGGC TCGTCTGGCC CTCATGGAGG CGGTGTCCGA GGGCGCGGTG ACACCCCGTC TTCTGGAACG TGTCAGGCCC CTGATGGAAC GGGCCGTGGG GATTCCCCTC ACCCCCTGCC TGATTCCCTT CGGCGAGGCT CAGGACGAGC AGTTGGCGAT CCTCGACACC ATAGCCACCC ACGTGCCCGG TGGCGAGGTC AGCTTCGACC TCACCCACGG CTTCCGTCAC ATGGGGATGC TGGGCTTTCT GTCCGCCTTC ATGCTCGAGC GCATCGGCCA CCTGCGGGTG CGCGGGCTCT GGTATGGCGC CCTGGACATG AGCCGGGACA ACATCACGCC GGTAGTGCGT CTGGACGGGC TCTCCGCCGT GCGCCACTGG ATCGAGGCCC TCTCGCATTT CGAGGCCACT GGCGACTACG GCGTGTTCGC ATCGCTGCTG GTGGCCGATG GCGTGGCCGA GGACAAGGCC CGCTGCCTGG AAGAGGCGGC CTTCCACGAA CGCACCTTCA ATCTCCTGGA TGCCGTGCGC AAGCTGCGCA CCTTCCTGCC GGTACTCGAT ACGCCATTGG GCGGCGCCTC GGGACTTTTC CAGGCGGTGC TCCTCAGGCA CCTGGAATGG AGTCGGGAAA GCAGCCTGGA TGCTTACCAG AAACGCCTCT CCAGGGCCTA CCTGCACATG GGCGATTTCG TACGCGCCGT GATCTTCGGC TGGGAGGCGG TCATCACCCG GCATTGCCTG GCCACGGGCA GGGATCCACG AGATTTCGCT TTGGGCCGCA AGGCCGCCGC CGAGGATCTG GACGCGGATG TCAGAAATGG AATACGCGAT AGGGACTTCG TGCAGGCCTA CTGGACCATC AAGAACCTGC GCAACTCGCT GGCCCACGGA AACCCGCCCA AGGACAAGGA TCTGCTCAGG GTGCTGAAGT CCGAACAGGA CACCCGCAGG TTCATCACCG AGTCCCTCCA ACGTCTGTTG GATTGA
|
Protein sequence | MNRHTLITFL GKGARDPNTG YRSTTYRFAD RDRSGAFFGL LLAQHLAPDR LLILGTRSSQ WDLLVESLAG SGEDEEARLA LMEAVSEGAV TPRLLERVRP LMERAVGIPL TPCLIPFGEA QDEQLAILDT IATHVPGGEV SFDLTHGFRH MGMLGFLSAF MLERIGHLRV RGLWYGALDM SRDNITPVVR LDGLSAVRHW IEALSHFEAT GDYGVFASLL VADGVAEDKA RCLEEAAFHE RTFNLLDAVR KLRTFLPVLD TPLGGASGLF QAVLLRHLEW SRESSLDAYQ KRLSRAYLHM GDFVRAVIFG WEAVITRHCL ATGRDPRDFA LGRKAAAEDL DADVRNGIRD RDFVQAYWTI KNLRNSLAHG NPPKDKDLLR VLKSEQDTRR FITESLQRLL D
|
| |