Gene GYMC61_1194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_1194 
Symbol 
ID8525033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp1216193 
End bp1217833 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content51% 
IMG OID 
ProductCRISPR-associated protein, Crm2 family 
Protein accessionYP_003252329 
Protein GI261418647 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAATC GCTATGTTGT CATTTTTACG GTCGGGCCTG TGCAGTCGTT TATCGCTTCA 
GCCCGCAAAA CAGAAGATTT TTGGAGCGGA AGTTATATTT TGTCCCATCT CGTGCGGGAG
GCGATCAAGC GGTTTTATCA ATTTGACGCC AACTGCGAAA TGATCTATCC GCTTGTTACA
GAGAAAGAAC TGAGATCCCC TTCCCAACGG GATGTTCACA TTGCGTCGAT CCCGAATCGC
CTTACGGTGA TCATGGAGGG AACAGAAGAA GAAGTTTGCG GTTGGCTGCG CGAAGCGGAG
CAAGACGTTT GCAATGTGTT TCTCGATTTT TGCTTTCAAG CCGTCCGGCG CGTATTTCCG
CATCTCACTG ATCATGAGCA GGAATTACTG GAACGGATGG TCGAACAGCA AGTGCGGTCG
TTTTTAGAAA TCTACTGGGT AGCTGAACCG TATGAACCGT CTATGTCGTT CCGTGAGGTG
CGTGAACGTG CCGAACGGCG GTTAGGGGCG CTGAAAAACG AAAAACATTA CATTCCCGTT
TCTCAGCAGG GGCTCGTTTG TTCTGTATGC AAGGAACGTG AGGCGCTGCG TCTTGAGGAA
ATCGGGGAAT TTGATCATTA CGGCGAGATC AAAAAGAAAA CGATGAGCCT TTGGCAACGG
CGCGCCGTGA AATTTCAAGG AACGGGGGGT GATGAAGAGG ACGACCGCCG CGCGGGGCGC
ATCAAAGACA ATGAGTTTTT ATGCGGCATT TGTTTAGGAA AACGAGCGGC GCGCGACTAC
TTCGCGGACT TGTTCGGTAC ATCGTTCCCA TCTTTTCCCT CTGTCCTCGA CATCGGTCAA
GGAGATTATT ACGCGGTGTT GCTGATGGAT GGGGACGACA TGGGGAAGTG GTTTTCTGGG
GAACGGAAAG ATGAGTATAG CCGAACGAGC CAGAAACTGG CCCGTTTTGC CAAAGAAACC
GTTCCGTGGA TTGTGCAAGA ACGATGCAAA GGCAAACTTG TCTATGCGGG GGGTGACGAT
GTCCTTGCTT TTGTGCCAGT GGAGACGGCG TTGAAAGCGG CGGAGATGCT TCGATTGGCG
TTTGCCGACG AACGTCAAGG ATTAGGGAGC GGCGCAACCG CTTCGTTTGG AGTGGTCATC
GCCCATAAAA AAGCACCGTT GCAGCACGTG CTCAACGCCG CAAGGGCATT GGAACAAAAG
GCGAAACGGT ATTACAATGA CAAGACGGCG CAGAGAAAAG ACGCCCTAGC CTTAGCGGTG
CATACCCGCT CCGGAGAGAT TTCCGAAGCG GTTCTCCCGT GGATCGTTGG GGAGAAGATG
GTTTCAAAAC TATTGGAAGA ATGGCTTGGG CTATTGAAAA CATCGCTGTC ACCGAACTTT
ATTTTTCATT TTGCGTCAGC GTTTGCCCCG CTGTTGCATG AAAAAGACTG CCCGAAATGG
GAGAACCGCG ACATGCTCGC GACAGAGCTC CGCCGTCTGC TTCGGCGCTC CGTGAAAGAG
GGGAGTTGCC TAGATGCCCA AGACATCGCC CGCTATACAA GTACGCTTTT GGTTTTGCAT
GAAGCGGTTC GGAGCAGTTA TGACTTTTTG CATTTGTTAA AGATGTTGAC CTTTTTCAAA
CGAAGCAAGG GGAGAGGTTG A
 
Protein sequence
MTNRYVVIFT VGPVQSFIAS ARKTEDFWSG SYILSHLVRE AIKRFYQFDA NCEMIYPLVT 
EKELRSPSQR DVHIASIPNR LTVIMEGTEE EVCGWLREAE QDVCNVFLDF CFQAVRRVFP
HLTDHEQELL ERMVEQQVRS FLEIYWVAEP YEPSMSFREV RERAERRLGA LKNEKHYIPV
SQQGLVCSVC KEREALRLEE IGEFDHYGEI KKKTMSLWQR RAVKFQGTGG DEEDDRRAGR
IKDNEFLCGI CLGKRAARDY FADLFGTSFP SFPSVLDIGQ GDYYAVLLMD GDDMGKWFSG
ERKDEYSRTS QKLARFAKET VPWIVQERCK GKLVYAGGDD VLAFVPVETA LKAAEMLRLA
FADERQGLGS GATASFGVVI AHKKAPLQHV LNAARALEQK AKRYYNDKTA QRKDALALAV
HTRSGEISEA VLPWIVGEKM VSKLLEEWLG LLKTSLSPNF IFHFASAFAP LLHEKDCPKW
ENRDMLATEL RRLLRRSVKE GSCLDAQDIA RYTSTLLVLH EAVRSSYDFL HLLKMLTFFK
RSKGRG