Gene GYMC61_1192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_1192 
Symbol 
ID8525031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp1213948 
End bp1215288 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content50% 
IMG OID 
ProductCRISPR-associated protein with DxTHG motif 
Protein accessionYP_003252327 
Protein GI261418645 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAGAC GAGTGTTGTT GTCGTTTCTC GGGTTAGGTG ATTATGAGTA TTGTTACTAC 
ACATACGAAG GAAAAACGTC TACGTATACC CGCTTTATTC AAACGGCGGT GTATGAGCTG
TTCCGCAACG ATGAACCGAT GGATGTCGTC GTGTTTGCGA CGAAAGAAGC GCAGGATCGG
AATGGACAAG ATCGAAAAAA AGGGGACAAG CTTCTTGAAG GGATCGGCAC AGCGTTTAGT
CGCATTGCGC CAGAAGCGAA CGTGAAAATT GTCGAGATCG AAAGCGGCCA AGACGAGCAG
GCCAATTGGC GGCTATTTGA CCGTATCATG GATGAAATCA AGGAAGGGGA TACGATTTAT
TTTGACATCA CCCACAGTTT CCGCTCCATC CCGTTTGTGG CGCTTATCGT GTTGAATTAT
GCGCGTTTAG TGAAAAAGGC AGATATTGGA GCGATCATCT ATGGCTGGTT TGAAGTGCTT
GGGCGCCCCA TTGATGTTGA GCAGATGCCA GAAGAACAGA GAGTCGCGCC GATTGTCAAC
TTGACGAGCA TGGCGAAGTT GCTCGATTGG ACCAACGGGG TCGATCAATT TTTGCGCACG
GGGGATGCCT CCATCATCCA GGCGCTGACC GCGAAAGAAA ACAGCGAAGT GTTCCGCAAT
CCGTCTTTGA GCCAAGCGGT AAAGGACGAA GTGAAGGAAT TAAGAGAGTT GACGAAGCGG
CTTGATCAGA CGGAAAAAGC GATCCGCACG TGCCGAAGCT TGCAAATCGA TGAAGAAGTG
CAGAAATTTC ATGAACAGCT CGGCCGCGTT CGCTCAGCGT CGGCGGAAGC GATTAAACCG
CTTGTCCCAT TGTTGGATGT GATGGAGAAA AAGTATGCGA TGTTTGATGA TGATCCGATC
ATGAACAGCT GGAAGGCTGT GCGTTGGTGT TTGGACCACG GATTGATTCA GCAGGCGCTG
ACGATGTTGG AGGAAAATGC GGTGACGGCC GTTTGCCGGG TGTTGGGGCT TGATTTGCGG
AATGAGAAGG CCCGGGGAGA TGTTCATTCG GCTATTGAAA TTTTATTGCG AGATATTCCG
AAAGAGGAAT GGCGCGTTCG TTCCGTAGAG CGGGTTGAAC AAATAATCGA TTTCTTGTCT
CCTTATAAGG GCGACTTAAA ACGATTTAGC ACGCTTAAGG AGCGGCGCAA CGACATCAAT
CATGCTGGGG CGCGGCCGCA ACCGCTAAAG GCGGAAAAGT TTTGGCCCGA TGCGGAGCAG
TCGTTCCGAG AACTTGGTGC GTTTTTTGAA CGAATGTCCG CACTTGCAAA ATCGATGCAA
ACCGTAAAGG GGAATGGGTG A
 
Protein sequence
MGRRVLLSFL GLGDYEYCYY TYEGKTSTYT RFIQTAVYEL FRNDEPMDVV VFATKEAQDR 
NGQDRKKGDK LLEGIGTAFS RIAPEANVKI VEIESGQDEQ ANWRLFDRIM DEIKEGDTIY
FDITHSFRSI PFVALIVLNY ARLVKKADIG AIIYGWFEVL GRPIDVEQMP EEQRVAPIVN
LTSMAKLLDW TNGVDQFLRT GDASIIQALT AKENSEVFRN PSLSQAVKDE VKELRELTKR
LDQTEKAIRT CRSLQIDEEV QKFHEQLGRV RSASAEAIKP LVPLLDVMEK KYAMFDDDPI
MNSWKAVRWC LDHGLIQQAL TMLEENAVTA VCRVLGLDLR NEKARGDVHS AIEILLRDIP
KEEWRVRSVE RVEQIIDFLS PYKGDLKRFS TLKERRNDIN HAGARPQPLK AEKFWPDAEQ
SFRELGAFFE RMSALAKSMQ TVKGNG