Gene GWCH70_0310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0310 
Symbol 
ID7977428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp356716 
End bp357717 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content41% 
IMG OID644797303 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_002948503 
Protein GI239825879 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.753182 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA CGCTGTATAT TTTTCAAAGC GGAGAGCTTC GCCGCAAAGA CAATAGTTTG 
TATTTCGAAA CAGAAGAGCG GAAACGATAT ATTCCTGTAG AGAACACTAA CGATATTTAT
ATTTTCGGCG AAGTAGATGT ATCAAAAAAG TTTCTTGAGT TTGCGGCGCA AAAAGAGATT
ATTGTTCATT ATTTCAACCA TCATGGGTAT TATGTCGGCT CTTTTTATCC ACGGGAACAC
TTAAATGCTG GTCATGTAAT TTTAAAACAA GCGGAACATT ACATTGATTC GAACAAACGG
CTCGATTTAG CACAGCGGTT TGTGGAAGGT GCTATTCAGC AAATGACGAG AGTAATCAAA
TACTACCGAA CTCGCCTTTC TGAGGCAGAT GTGTTAGCCC GTGCGTTAGA AGTGATCGAA
CAAGAAACAG AACGGATGAG ACAAGCGCAG TCGGTGGAGC AGCTAATGGC AGCGGAAGGG
CATATTCGGG AAATGTATTA TTCCACCTTT GATGTCATTA TTCGTCATCC CGATTTCGTG
TTTGAAAAAC GGACGAAGCG CCCTCCGCAT AACCGTCTAA ATGCTCTTAT TAGCTTTGGT
AATTCGATTG TGTATACGAT GTGTCTTAGC GAAATTTACA AAACGTATTT AGACCCACGG
ATCGGTTTTT TGCACAGCAC GAATTTTCGA CGGTTTTCAC TCAATCTCGA TGTGGCAGAA
ATTTTTAAAC CGATTATCGT GGACCGCCTT ATTTTCACTC TTGTGAATAA GAAAATGCTA
TCGGCAAAGC ACTTTGAAAA ATTGACAGAT GGGATTTTGC TTAATGAAGA AGGACGAAAA
TTATTCGTCA GTGAACTTGA GAAAAAAATG CAGACGACGG TGCAGCACCG TCACCTTGGC
AAATCCGTTT CCTATCGCCG ATTGATTCGG CTAGAGCTAT ATAAAATTCA AAAACATTTG
TTAGGTGAAA AAACGTACGA GCCATACGCG GCGAAATGGT AG
 
Protein sequence
MKKTLYIFQS GELRRKDNSL YFETEERKRY IPVENTNDIY IFGEVDVSKK FLEFAAQKEI 
IVHYFNHHGY YVGSFYPREH LNAGHVILKQ AEHYIDSNKR LDLAQRFVEG AIQQMTRVIK
YYRTRLSEAD VLARALEVIE QETERMRQAQ SVEQLMAAEG HIREMYYSTF DVIIRHPDFV
FEKRTKRPPH NRLNALISFG NSIVYTMCLS EIYKTYLDPR IGFLHSTNFR RFSLNLDVAE
IFKPIIVDRL IFTLVNKKML SAKHFEKLTD GILLNEEGRK LFVSELEKKM QTTVQHRHLG
KSVSYRRLIR LELYKIQKHL LGEKTYEPYA AKW