Gene Ccel_3120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3120 
Symbol 
ID7311714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3654189 
End bp3657254 
Gene Length3066 bp 
Protein Length1021 aa 
Translation table11 
GC content33% 
IMG OID643610023 
ProductCRISPR-associated protein, Csn1 family 
Protein accessionYP_002507391 
Protein GI220930482 
COG category[S] Function unknown 
COG ID[COG3513] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01865] CRISPR-associated protein, Csn1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0115398 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATATA CATTAGGTCT TGATGTTGGA ATTGCTTCTG TAGGTTGGGC GGTAATTGAT 
AAGGATAATA ATAAAATTAT TGACTTAGGA GTTAGATGTT TTGATAAAGC AGAGGAATCT
AAAACGGGTG AGTCACTTGC AACAGCTAGA AGGATAGCTA GAGGTATGAG AAGAAGAATT
TCGAGAAGAT CCCAAAGGCT ACGTTTAGTT AAAAAACTGT TTGTTCAATA TGAAATTATT
AAAGATTCAA GTGAATTTAA CCGGATATTT GACACTTCGC GGGATGGGTG GAAAGATCCT
TGGGAATTAA GGTATAATGC TTTATCAAGA ATACTTAAAC CTTATGAACT TGTTCAGGTA
CTTACACATA TTACTAAGAG AAGAGGCTTT AAAAGTAACA GAAAGGAAGA CTTGTCTACT
ACAAAAGAAG GTGTAGTTAT AACTAGTATT AAAAACAATT CTGAGATGCT TCGTACGAAA
AATTACCGTA CTATTGGAGA AATGATTTTT ATGGAGACTC CTGAAAATAG TAACAAAAGG
AATAAGGTAG ACGAGTATAT TCACACTATT GCCAGAGAAG ATCTTCTCAA TGAAATAAAA
TATATATTTA GTATACAAAG AAAGCTTGGA AGCCCCTTTG TAACTGAAAA ATTAGAACAT
GACTTTTTGA ATATATGGGA ATTTCAACGT CCTTTTGCCA GTGGTGATAG TATACTTTCA
AAAGTGGGAA AGTGTACCCT GCTAAAGGAG GAGTTGAGAG CACCGACTTC CTGTTACACA
TCAGAATATT TTGGATTACT TCAATCGATT AACAATCTAG TTTTGGTTGA AGATAACAAT
ACATTAACAT TAAATAATGA TCAAAGAGCA AAAATAATAG AATATGCTCA TTTCAAGAAT
GAAATCAAGT ATTCTGAAAT AAGAAAATTA TTAGATATTG AACCTGAAAT TTTATTCAAA
GCACATAATT TGACACACAA AAATCCCTCA GGAAACAATG AGAGCAAAAA GTTTTATGAA
ATGAAGTCTT ATCATAAACT GAAAAGCACA TTACCTACAG ACATCTGGGG GAAATTGCAT
TCTAACAAGG AATCTCTTGA TAATCTTTTT TACTGCCTTA CGGTCTATAA AAACGATAAC
GAAATAAAGG ACTATTTACA AGCGAATAAT CTTGATTATT TAATTGAATA TATAGCAAAA
TTGCCAACTT TCAACAAATT TAAACATCTA TCTTTAGTTG CCATGAAAAG GATTATTCCG
TTTATGGAAA AAGGGTATAA ATATAGTGAT GCCTGTAATA TGGCGGAATT AGATTTTACA
GGTTCCAGCA AACTTGAAAA GTGTAATAAG TTAACTGTTG AACCAATTAT TGAGAATGTA
ACTAATCCAG TTGTAATAAG GGCTCTGACG CAAGCAAGGA AAGTTATAAA TGCGATTATA
CAGAAGTATG GTCTCCCGTA TATGGTAAAT ATAGAACTTG CACGTGAAGC GGGAATGACA
CGTCAGGATA GAGATAATTT AAAAAAAGAA CATGAAAACA ACCGAAAAGC AAGGGAAAAA
ATATCAGACC TAATACGCCA AAATGGTAGA GTTGCAAGTG GTCTGGATAT ACTGAAATGG
CGTCTTTGGG AAGACCAGGG CGGAAGATGT GCTTATTCCG GCAAACCAAT TCCTGTTTGT
GATTTATTGA ATGACTCACT GACTCAGATA GATCACATTT ATCCGTATAG TAGAAGTATG
GACGATTCAT ATATGAATAA AGTTTTAGTT TTAACCGACG AAAATCAAAA TAAAAGAAGT
TATACGCCAT ATGAAGTATG GGGGTCAACT GAAAAATGGG AGGATTTTGA GGCAAGAATA
TATTCTATGC ATTTACCTCA AAGTAAAGAA AAAAGGCTTT TGAACAGAAA CTTTATTACA
AAAGATTTGG ACTCATTTAT TTCAAGAAAT CTTAACGATA CTAGGTATAT ATCAAGGTTT
TTAAAAAACT ATATAGAATC GTATCTGCAG TTTAGTAATG ATTCACCTAA GTCTTGTGTG
GTCTGCGTTA ATGGTCAGTG TACCGCTCAG CTTAGAAGTA GATGGGGGTT AAATAAAAAT
CGAGAAGAGT CAGACCTGCA TCATGCCCTT GATGCAGCAG TAATTGCGTG TGCAGATAGA
AAAATAATTA AAGAAATAAC AAACTATTAT AATGAAAGAG AAAACCATAA TTATAAAGTT
AAATATCCTT TGCCTTGGCA TTCGTTTAGG CAAGATTTAA TGGAAACGTT AGCAGGTGTT
TTCATATCCC GGGCACCCAG AAGAAAAATT ACCGGGCCAG CCCATGATGA GACAATCAGA
TCTCCCAAGC ATTTTAACAA AGGTTTAACT TCGGTAAAAA TACCGTTAAC AACAGTGACG
TTGGAAAAAC TTGAGACAAT GGTAAAAAAT ACAAAAGGGG GAATTTCAGA TAAGGCAGTA
TATAACGTTC TAAAAAATAG ATTAATAGAG CATAATAATA AGCCATTAAA AGCTTTTGCT
GAAAAAATAT ATAAACCACT AAAAAATGGT ACAAACGGTG CAATAATTAG GAGTATTCGA
GTTGAGACAC CATCATATAC AGGAGTATTC AGAAATGAAG GAAAGGGGAT ATCTGATAAT
TCCTTAATGG TTAGGGTTGA TGTATTTAAG AAAAAAGATA AGTATTACCT TGTGCCAATA
TATGTAGCAC ATATGATAAA AAAAGAGTTA CCTTCGAAAG CTATAGTTCC TCTGAAACCT
GAATCTCAAT GGGAGTTAAT TGATAGTACT CATGAATTTC TTTTTTCACT ATACCAAAAT
GATTACCTTG TTATAAAAAC TAAAAAGGGT ATAACTGAGG GCTATTATAG ATCTTGCCAC
AGGGGTACCG GTAGCCTGAG CCTAATGCCT CATTTTGCTA ATAATAAGAA TGTTAAAATA
GATATTGGAG TTAGGACAGC AATAAGTATT GAAAAATATA ATGTGGATAT ACTTGGTAAT
AAAAGCATAG TAAAAGGAGA ACCAAGACGT GGGATGGAGA AATATAATAG TTTCAAATCC
AACTAA
 
Protein sequence
MKYTLGLDVG IASVGWAVID KDNNKIIDLG VRCFDKAEES KTGESLATAR RIARGMRRRI 
SRRSQRLRLV KKLFVQYEII KDSSEFNRIF DTSRDGWKDP WELRYNALSR ILKPYELVQV
LTHITKRRGF KSNRKEDLST TKEGVVITSI KNNSEMLRTK NYRTIGEMIF METPENSNKR
NKVDEYIHTI AREDLLNEIK YIFSIQRKLG SPFVTEKLEH DFLNIWEFQR PFASGDSILS
KVGKCTLLKE ELRAPTSCYT SEYFGLLQSI NNLVLVEDNN TLTLNNDQRA KIIEYAHFKN
EIKYSEIRKL LDIEPEILFK AHNLTHKNPS GNNESKKFYE MKSYHKLKST LPTDIWGKLH
SNKESLDNLF YCLTVYKNDN EIKDYLQANN LDYLIEYIAK LPTFNKFKHL SLVAMKRIIP
FMEKGYKYSD ACNMAELDFT GSSKLEKCNK LTVEPIIENV TNPVVIRALT QARKVINAII
QKYGLPYMVN IELAREAGMT RQDRDNLKKE HENNRKAREK ISDLIRQNGR VASGLDILKW
RLWEDQGGRC AYSGKPIPVC DLLNDSLTQI DHIYPYSRSM DDSYMNKVLV LTDENQNKRS
YTPYEVWGST EKWEDFEARI YSMHLPQSKE KRLLNRNFIT KDLDSFISRN LNDTRYISRF
LKNYIESYLQ FSNDSPKSCV VCVNGQCTAQ LRSRWGLNKN REESDLHHAL DAAVIACADR
KIIKEITNYY NERENHNYKV KYPLPWHSFR QDLMETLAGV FISRAPRRKI TGPAHDETIR
SPKHFNKGLT SVKIPLTTVT LEKLETMVKN TKGGISDKAV YNVLKNRLIE HNNKPLKAFA
EKIYKPLKNG TNGAIIRSIR VETPSYTGVF RNEGKGISDN SLMVRVDVFK KKDKYYLVPI
YVAHMIKKEL PSKAIVPLKP ESQWELIDST HEFLFSLYQN DYLVIKTKKG ITEGYYRSCH
RGTGSLSLMP HFANNKNVKI DIGVRTAISI EKYNVDILGN KSIVKGEPRR GMEKYNSFKS
N