Gene Clim_0441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0441 
Symbol 
ID6354436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp492171 
End bp493559 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content36% 
IMG OID642668072 
ProductCRISPR-associated CXXC_CXXC protein Cst1 
Protein accessionYP_001942513 
Protein GI189345984 
COG category 
COG ID 
TIGRFAM ID[TIGR01908] CRISPR-associated CXXC_CXXC protein Cst1 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAGTT TATTTCAATA CACAGGAAAT CCTTTTGTGG ATGCGGGGAT TTCTGCACTC 
ACAAACTGGT GTGATAAAAA AACACCGCAA GAGCTAACTG AAGCGGATAT TAAAAAAGCC
TTGCCTGAAA TCGCTAATCT TTTTTCTCAA GGGGCATGGG TTAAAACTTT TTATACCACC
TTTTCTAACG GGGTGATGGT ACAGCCTTCA AATAAAGGGA AAGAAAGAGA AAAATGGTTG
GAGTTTATAG GTGATCTTGT AAAAGAATTA CAACCATTAG CTGACCATGG TTCTTGCGTT
GCTTGTGGAT GTCGAAATGC TATAAAAATT AAGAAAGAAA AAAGAGGACT GTTGAGAAGT
GAAGTCCCTA TGGCAAGCGG CTCACTTAAT TATTACTCTT TTGCTTCTAC TGGAGCCGAT
TATTGTGGGA CTTGTGCAAT TGCAATTCAA GTTTCACCTT TGGTTCTCTA TCGAAGTGGT
GGAAAGATGA TTTTGGTGCA TTCTAGCTCA GAGAAAGCTA TGTGCTCATG GGCAAAAATG
GCAATAAATG AAGTTCGATC TCAAATTAGC CTTAGGAATT ATACGGGATG TTTTACCGAA
AATTTTACAA ATCCACAAAA CGCCTTATTT CGTATTGCTA AAATACTCAT TCAGGATAAG
GATGATTGGA AGAGTGATCC GATTACTATT CGTATTTATT ATTTCACCAA CTATGGACAA
GGCGCAGAAT TGAAATACTA TGATTTGCCT AACCGGGTTT TTCATTTTTT AAATGAAGTT
CACCATAGTG AAGAATTGAA AGATTGGGAT AAAATTATCG GAAGTACATA CTTTTTTAAA
AAAAATAACT CTAAAATTTA TTTGAATACT GATGATAAAT CCGAGGAAGA ATACAAAAAT
AATAACAACG TTATTTATGA AGGCTTGTTG AAAGATGAGT GGATTGTTAA ATATTTTTAC
AACTTTCTGC AACGCAAGGC CTATGCGAAG TGGGAGCTTG TTCAACTGTA TTTAAAGGAG
GTTAGACAAA TGGATAAACA AAGAACAGAA GTGATTAAAA GGGTGGCTGA TGAAATATCA
TTAGTGATTC AAAGAGACGA ATCACATAAT CCAAAACGTC TGTGGCAGCT TGAGCGAGCA
AACAGCTATG GCACTTTTCG CAACGTTCTA CGCCTAATAA TAAAGGATCG TATTAAAAAT
GGTGCCGAGC GTCCGTTGTT CAGTATTGAA GATTATACAG AGAGACTTTT TCCTGATGGA
GCGCTTTGTT GGCGGGAAAC TCAAGACCTT ATTCTCTTTC GCTTGTACGA GATGCTACAT
GGCTGGTTAA AAGAAAGAGA TATTGTAATA GATGAAGTTG AAGAAAACAG TACAACTGAA
ATTGAATAA
 
Protein sequence
MSSLFQYTGN PFVDAGISAL TNWCDKKTPQ ELTEADIKKA LPEIANLFSQ GAWVKTFYTT 
FSNGVMVQPS NKGKEREKWL EFIGDLVKEL QPLADHGSCV ACGCRNAIKI KKEKRGLLRS
EVPMASGSLN YYSFASTGAD YCGTCAIAIQ VSPLVLYRSG GKMILVHSSS EKAMCSWAKM
AINEVRSQIS LRNYTGCFTE NFTNPQNALF RIAKILIQDK DDWKSDPITI RIYYFTNYGQ
GAELKYYDLP NRVFHFLNEV HHSEELKDWD KIIGSTYFFK KNNSKIYLNT DDKSEEEYKN
NNNVIYEGLL KDEWIVKYFY NFLQRKAYAK WELVQLYLKE VRQMDKQRTE VIKRVADEIS
LVIQRDESHN PKRLWQLERA NSYGTFRNVL RLIIKDRIKN GAERPLFSIE DYTERLFPDG
ALCWRETQDL ILFRLYEMLH GWLKERDIVI DEVEENSTTE IE