Gene Clim_0929 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0929 
Symbol 
ID6354166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1012746 
End bp1014941 
Gene Length2196 bp 
Protein Length731 aa 
Translation table11 
GC content53% 
IMG OID642668556 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_001942987 
Protein GI189346458 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair
[COG3344] Retron-type reverse transcriptase 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.400864 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATGGC TCTACAACCA GATGGCAATG CCCGAAACTA TTTTTCAGGC ATGGTACAAG 
GTGGCTTCGA ACGACGGCCG TCCCGGATGG GATAACAAAT CCATCGAGGA CTACTCCCTG
CAGCTTGAAG AGAACCTTAA AGCCCTATCG CAAGCTCTTC TGACAGGCAC CTACAAACAG
GGTCCGTTGA TGAAACTCGT GTTGCTGAAA CCTGATGGAA AGGATCGGGT TCTTTTGATA
CCAGGCGTAA TGGACAGGGT TGCCCAGACT GCGGCGGCAA TCGTGCTGAG CCCGATCATC
GAAGCCGAAC TGGGTAACTG TACCTTTGCC TACCGTCCCG GCATATCGCG CGAAGGAGCT
GCACGGGAGA TTGACCGGTT GCACCGTGAA GGGTATCAGT GGGTGCTCGA TGCAGATATC
CGCAGCTTTT TCGACAACGT CCGCCATGAC CTGCTCTTCC AGAGGCTGGT TGAGCTTATC
GACGACAAGG AAATGATTTC GCTGCTGCAC CGGTGGCTTA CCGCTGAAAT CGTTGACGGC
ATCAATCCCC GCATACAAAA TACCATGGGC CTGCCACAGG GATGCCCGAT CTCACCGGCT
CTGGCCAATC TCTATCTCGA CCGCTTCGAT GAAACAATGG AAAAAGAAGG GTTCAAACTG
GTTCGCTTCG CCGACGATTA TCTCGTGCTC TGCAAAACCC GTCCCAAAGC CGAAGCCGCG
CTGAAGCTTT CGGAAACCGC GCTTGCCGAA CTGAAACTTG AACTGCACAG CGATAAAACC
CGTATTACCA CCTTTGCCGA AGGGTTCAAG TATCTCGGTT ACCTCTTTAT TCGCGCACTG
GTTATTCCCA CCAAAATGCA CCCCGAAGAG TGGTACGACA AGCTCGGCAA GTTCAAGCTT
CGCAAAAAGA GCGAACATGC CCTGCCCTCC GACCCCGACG CAATGACCGG CGAAACAGCA
AAGTTCGAGC TCGAAACCGA TCAGGGCGAA AAAATAGAAC TCACCAAAAA CGAGCTTCTG
CAAACCGAGT TCGGGTGCAA GCTGCTTGAA AGTCTCGATA AAAAACAGTT GAGCGTTGAC
GAGTTTCTCG AAAAAGTTGC CCGGCAGGAC GAAGAACGGC AGAAAGAGAA GCGCGATGCG
CTGAAAAAAC TCTATTCACC TTTTCTGAAC ACCCTCTACC TGCAGGAGCA GGGGAGTCTC
ATGCGCAAGG ACGGAGAGCG GTTCAGCATT GAAAAGGATG GAGCGGTCAT CAACGAAGTG
ATCGTCCGCC GCATCGAACA GGTTGTGGTG TTCGGTAATA TCGCCCTCAC CACTCCGGTC
ATGCAGTACT GCATGCAAAA CGAAATTCCG GTTACCTTTC TTTCGCAGCA TGGCAAATAC
TTCGGCAGGC TTGAATCGAC CATGGCCGAC AATGCCGAAA TGCAGCGCTA TCATTTTCTT
CGTTCCATCG ACGAACCCTT TGCGCTTGAA ACCGCCCGCT CCATCGTTTC GGCAAAAATC
GGCAACAGCA GGACCATGAT TCGCCGCCGA AAATCCGTCA TGCAGGATTG CGACGGCACG
CTGCAGAGTA AAATGACCTG CAACCTCGAC ATCATGGCCG ATCTTCTCCT CAAGGCCGAA
ACCTCCACAG ACATCGACGT ACTGCGAGGT CTCGAAGGCA AGGCTTCGGC TCTCTACTTC
GAGTGTTACG GCATGTTCTT CAGCAAAAAC CTGCCGTTCC ATACCGCTTC GTTTCTGCGG
GTTCGACGTC CGCCAACCGA TCCTGTCAAC AGCCTGCTCA GTTTCGGTTA CTCGCTTTTG
CATACCAACG TATTCTCAAT GGTGCAAATG AGCGGGCTCA ACCCCTATAT CGGCTTTCTC
CATGCAGAAC GAAAAGGCAA TCCCGCCCTG GTCAACGATC TCGTCGAAGA GTTCCGCACC
GTAATCGATT CTCTGGTGCT CTATACCATC AACAGGGGTC TTCTGCATGA GAACGACTTC
TACTATCGCA AAGATCAGCC GGGCTGCTTT CTCTCGAACG ACGCCCGAAA ACGTTTTTTA
CAGATTTTCG AAACAAGAAT GTGGCAGGAA TCCCGGGACG GCTACACCGG CAAAACGCTC
AACTTTCGGC GGCACATAGA AAAGCAGGTG AGAATCATGC GGGATGTCAT ATCCGGAACC
CGAACGCAGT ACGATCCGTA CAAGCTGGTA TGGTGA
 
Protein sequence
MGWLYNQMAM PETIFQAWYK VASNDGRPGW DNKSIEDYSL QLEENLKALS QALLTGTYKQ 
GPLMKLVLLK PDGKDRVLLI PGVMDRVAQT AAAIVLSPII EAELGNCTFA YRPGISREGA
AREIDRLHRE GYQWVLDADI RSFFDNVRHD LLFQRLVELI DDKEMISLLH RWLTAEIVDG
INPRIQNTMG LPQGCPISPA LANLYLDRFD ETMEKEGFKL VRFADDYLVL CKTRPKAEAA
LKLSETALAE LKLELHSDKT RITTFAEGFK YLGYLFIRAL VIPTKMHPEE WYDKLGKFKL
RKKSEHALPS DPDAMTGETA KFELETDQGE KIELTKNELL QTEFGCKLLE SLDKKQLSVD
EFLEKVARQD EERQKEKRDA LKKLYSPFLN TLYLQEQGSL MRKDGERFSI EKDGAVINEV
IVRRIEQVVV FGNIALTTPV MQYCMQNEIP VTFLSQHGKY FGRLESTMAD NAEMQRYHFL
RSIDEPFALE TARSIVSAKI GNSRTMIRRR KSVMQDCDGT LQSKMTCNLD IMADLLLKAE
TSTDIDVLRG LEGKASALYF ECYGMFFSKN LPFHTASFLR VRRPPTDPVN SLLSFGYSLL
HTNVFSMVQM SGLNPYIGFL HAERKGNPAL VNDLVEEFRT VIDSLVLYTI NRGLLHENDF
YYRKDQPGCF LSNDARKRFL QIFETRMWQE SRDGYTGKTL NFRRHIEKQV RIMRDVISGT
RTQYDPYKLV W