Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GYMC61_1192 |
Symbol | |
ID | 8525031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. Y412MC61 |
Kingdom | Bacteria |
Replicon accession | NC_013411 |
Strand | + |
Start bp | 1213948 |
End bp | 1215288 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | |
Product | CRISPR-associated protein with DxTHG motif |
Protein accession | YP_003252327 |
Protein GI | 261418645 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAAGAC GAGTGTTGTT GTCGTTTCTC GGGTTAGGTG ATTATGAGTA TTGTTACTAC ACATACGAAG GAAAAACGTC TACGTATACC CGCTTTATTC AAACGGCGGT GTATGAGCTG TTCCGCAACG ATGAACCGAT GGATGTCGTC GTGTTTGCGA CGAAAGAAGC GCAGGATCGG AATGGACAAG ATCGAAAAAA AGGGGACAAG CTTCTTGAAG GGATCGGCAC AGCGTTTAGT CGCATTGCGC CAGAAGCGAA CGTGAAAATT GTCGAGATCG AAAGCGGCCA AGACGAGCAG GCCAATTGGC GGCTATTTGA CCGTATCATG GATGAAATCA AGGAAGGGGA TACGATTTAT TTTGACATCA CCCACAGTTT CCGCTCCATC CCGTTTGTGG CGCTTATCGT GTTGAATTAT GCGCGTTTAG TGAAAAAGGC AGATATTGGA GCGATCATCT ATGGCTGGTT TGAAGTGCTT GGGCGCCCCA TTGATGTTGA GCAGATGCCA GAAGAACAGA GAGTCGCGCC GATTGTCAAC TTGACGAGCA TGGCGAAGTT GCTCGATTGG ACCAACGGGG TCGATCAATT TTTGCGCACG GGGGATGCCT CCATCATCCA GGCGCTGACC GCGAAAGAAA ACAGCGAAGT GTTCCGCAAT CCGTCTTTGA GCCAAGCGGT AAAGGACGAA GTGAAGGAAT TAAGAGAGTT GACGAAGCGG CTTGATCAGA CGGAAAAAGC GATCCGCACG TGCCGAAGCT TGCAAATCGA TGAAGAAGTG CAGAAATTTC ATGAACAGCT CGGCCGCGTT CGCTCAGCGT CGGCGGAAGC GATTAAACCG CTTGTCCCAT TGTTGGATGT GATGGAGAAA AAGTATGCGA TGTTTGATGA TGATCCGATC ATGAACAGCT GGAAGGCTGT GCGTTGGTGT TTGGACCACG GATTGATTCA GCAGGCGCTG ACGATGTTGG AGGAAAATGC GGTGACGGCC GTTTGCCGGG TGTTGGGGCT TGATTTGCGG AATGAGAAGG CCCGGGGAGA TGTTCATTCG GCTATTGAAA TTTTATTGCG AGATATTCCG AAAGAGGAAT GGCGCGTTCG TTCCGTAGAG CGGGTTGAAC AAATAATCGA TTTCTTGTCT CCTTATAAGG GCGACTTAAA ACGATTTAGC ACGCTTAAGG AGCGGCGCAA CGACATCAAT CATGCTGGGG CGCGGCCGCA ACCGCTAAAG GCGGAAAAGT TTTGGCCCGA TGCGGAGCAG TCGTTCCGAG AACTTGGTGC GTTTTTTGAA CGAATGTCCG CACTTGCAAA ATCGATGCAA ACCGTAAAGG GGAATGGGTG A
|
Protein sequence | MGRRVLLSFL GLGDYEYCYY TYEGKTSTYT RFIQTAVYEL FRNDEPMDVV VFATKEAQDR NGQDRKKGDK LLEGIGTAFS RIAPEANVKI VEIESGQDEQ ANWRLFDRIM DEIKEGDTIY FDITHSFRSI PFVALIVLNY ARLVKKADIG AIIYGWFEVL GRPIDVEQMP EEQRVAPIVN LTSMAKLLDW TNGVDQFLRT GDASIIQALT AKENSEVFRN PSLSQAVKDE VKELRELTKR LDQTEKAIRT CRSLQIDEEV QKFHEQLGRV RSASAEAIKP LVPLLDVMEK KYAMFDDDPI MNSWKAVRWC LDHGLIQQAL TMLEENAVTA VCRVLGLDLR NEKARGDVHS AIEILLRDIP KEEWRVRSVE RVEQIIDFLS PYKGDLKRFS TLKERRNDIN HAGARPQPLK AEKFWPDAEQ SFRELGAFFE RMSALAKSMQ TVKGNG
|
| |