Gene Cag_1010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1010 
Symbol 
ID3746738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1349524 
End bp1351998 
Gene Length2475 bp 
Protein Length824 aa 
Translation table11 
GC content40% 
IMG OID637773539 
ProductCRISPR-associated helicase Cas3 family protein protein 
Protein accessionYP_379315 
Protein GI78188977 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGCCT TCGATAACAA ATTATATGCT CATACGCTCG AAGGTGTTAA GAATAAATCG 
CAGTGGCAAA CACTTAGAGA GCATGCGTTG TCAACAGCTC ATTTGGCTTC GGATTATGCG
ACTTCATTTG GGTTAGCGGA ATGTGGCTAC TGGCTTGGAC TTATTCACGA TCTTGGTAAA
AGTTTGCCTC AATTCCAGCA GCGTTTGGAA GATGATCGGG TCAAGGCAGA TCACAAACAT
GCTGGCGGTC TCTTTCTGTG GGATAAGCTC AATAATGGCA CCAAACCTTC TCACTTGGCA
GCACAGTGCT TAGCCTTGTG CGTTATTTCC CATCACGGTG GTTTGGTTGA TTGTTTAAAT
CAACTCGGCG AAGACAACTT TATTAATACG ATAGAAAACA AGCTGTATCA GGCAAATTTG
AAGGATTCAT TAGAGAACTT GCAGCTTGAT ACTGAACTTG AAGACAATAT CAAGAAAATT
TCAGGAGCAC GAAGCCTTGT ACAAGATGAA ATAGATTTCT TTTTCCAAAA TATTTTACAG
CAAGCAAAAA AATACTGGCC GACTGAGGAT GATAAAAACA AGCGAAAAAA GCTGGAGCTT
TTCAGAATAG GCTTGATGAC AAAAATGCTT TTTAGTTGTC TTATAGATGC TGATCATACC
GACACCGCCA ACTTTCATGA CGAAGAGCGC AAGAATAAAA ATCTGCCACA TTTGCCAAAA
TGGGATGAAT TGAGAGATAT GGTGGAACGG TATCTGGAAA CGCTGCCACA AACATCTTCA
GTGGATATTG AACGAAAAAG AATCTCTGAT AAATGTATTC AAGCCTCAGT CTGCGAATCA
GGAACTTACT TGCTCACTGT GCCGACTGGA GGCGGAAAAA CCCTCGCAAG CATGCGGTTC
GCTTTGCATC ATGCCGTCAA CAGAGAGCCA TATATCCCTT TCAAACGTAT CATCTATGTT
ATCCCTTATA CAACCATCAT TGAGCAGAAT GCACAAGCTA TAAGGAAAGT GTTTGTATCA
CAACTTAACG AAGATGTATT AAATGAGATG ATTCTTGAGA GCCATTCCAA TGTTCTGCCA
AATGAAGAAA ACCGAAATAA CCGTGTGCTT GCAGAAAACT GGGATGCGCC AATAATTTTT
ACGACTAATG TTCAATTTCT TGAGGCTTTT TACGGTGTAG GAACCCGTAA TGCGCGAAAA
TTGCACAACC TTGCAAACTC TATCATCATT TTTGATGAAG CACAAACATT ACCAGTGCGC
TGCCTCCATC TATTTTGTCA TGCCGTGAAT TTCCTTGTTG AACATTGCAA CTGCACGGCT
ATTTTATGCA CTGCAACACA ACCTCTTTTG CATGAAATAC CGGCAGAGCA TGGCGCTCTA
TGGCTCTCCA AAAATTTTCA AATTCTTCCC GATAAATTTC GTAAGGATTC AGCAGATTCA
CTCAAACGAG TAACTGTAAT TGATGAATGC AAGCCTCAAG GATGGAGACT TGAAGAAGTT
GCTGACAAGG TGTCATGTAT TCACAAACAA GGAAATAGTT GCATCATTAT TCTCAATACG
AAAGCTGACA CAAGGGAACT GTACACGATA CTCCGCAAAC GTCATGGTGA AGAGTTAACG
TATCACCTAA GCACAGCCAT GTGTGCCGCT CATAGGATGG ATATTCTATC AGAAGTAAAA
ACACTTCTTC GTAATAATCA ACCTGTTATT TGTGTTAGCA CACAATTAAT AGAAGCTGGA
ATAGATATTG ATTTTGATAC GGGTATCCGT GCTTTGGCAG GTATAGATTC AATAGCTCAG
GCTGCCGGAC GTATAAATAG AAATGGTAAA AAGCCGGCTG ATTCTGCTCT ATATATCCAA
AATATCTCTG GTGAAAACCT TAAAAACCTG CAAGACATTG CGGTTGCTCA AGTTGAAGCC
CAAAAAGTTC TTCGTGAGTT CAAAGAAAAT CCAAATGAGT TCGGCAATAG TCTTCTCTCA
GAAGCCGTTA TGAAGCGGTA CTTTAAATTC TACATCTTCA ACAGGAAAGA TGAGATGACT
TACAAGATCA AATCAGATAA TCTTGTGAAT CTACTTTCTT CAAATGTAAA CGCGGTTGGC
GAATACAAGC GAACGCACAA AAACCAGCCT TATCCTAATA TCTTACGTCA ATCTTTTGCA
ACAGCCGCGA GGGAGTTCAA GGTCATTAAT TCAGATACAC AAGGCATTTT TGTTCCCTAC
AACGACGAAG CAAGAGGATT ACTAAATCAA CTTCGCAACA CAAAATCTTC AGAATTTCAG
CGCTATTTAT TTCGCCGGTT ACAACGTTAC ACCGTCAATG TTTATCCTTA TATGCTCAAA
AAATTAACGA AGATACATGC GTTAGAACCC TTGTGCGAAA ACAGCGGGAT TCTGGCTCTT
TATGAAATTT TTTATGATTC CCGTTTTGGT GTGAATATTA ACTCAACCAT TTCACCAGAC
ATGCTTATAC AGTGA
 
Protein sequence
MDAFDNKLYA HTLEGVKNKS QWQTLREHAL STAHLASDYA TSFGLAECGY WLGLIHDLGK 
SLPQFQQRLE DDRVKADHKH AGGLFLWDKL NNGTKPSHLA AQCLALCVIS HHGGLVDCLN
QLGEDNFINT IENKLYQANL KDSLENLQLD TELEDNIKKI SGARSLVQDE IDFFFQNILQ
QAKKYWPTED DKNKRKKLEL FRIGLMTKML FSCLIDADHT DTANFHDEER KNKNLPHLPK
WDELRDMVER YLETLPQTSS VDIERKRISD KCIQASVCES GTYLLTVPTG GGKTLASMRF
ALHHAVNREP YIPFKRIIYV IPYTTIIEQN AQAIRKVFVS QLNEDVLNEM ILESHSNVLP
NEENRNNRVL AENWDAPIIF TTNVQFLEAF YGVGTRNARK LHNLANSIII FDEAQTLPVR
CLHLFCHAVN FLVEHCNCTA ILCTATQPLL HEIPAEHGAL WLSKNFQILP DKFRKDSADS
LKRVTVIDEC KPQGWRLEEV ADKVSCIHKQ GNSCIIILNT KADTRELYTI LRKRHGEELT
YHLSTAMCAA HRMDILSEVK TLLRNNQPVI CVSTQLIEAG IDIDFDTGIR ALAGIDSIAQ
AAGRINRNGK KPADSALYIQ NISGENLKNL QDIAVAQVEA QKVLREFKEN PNEFGNSLLS
EAVMKRYFKF YIFNRKDEMT YKIKSDNLVN LLSSNVNAVG EYKRTHKNQP YPNILRQSFA
TAAREFKVIN SDTQGIFVPY NDEARGLLNQ LRNTKSSEFQ RYLFRRLQRY TVNVYPYMLK
KLTKIHALEP LCENSGILAL YEIFYDSRFG VNINSTISPD MLIQ