Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_3120 |
Symbol | |
ID | 7311714 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 3654189 |
End bp | 3657254 |
Gene Length | 3066 bp |
Protein Length | 1021 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 643610023 |
Product | CRISPR-associated protein, Csn1 family |
Protein accession | YP_002507391 |
Protein GI | 220930482 |
COG category | [S] Function unknown |
COG ID | [COG3513] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01865] CRISPR-associated protein, Csn1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0115398 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATATA CATTAGGTCT TGATGTTGGA ATTGCTTCTG TAGGTTGGGC GGTAATTGAT AAGGATAATA ATAAAATTAT TGACTTAGGA GTTAGATGTT TTGATAAAGC AGAGGAATCT AAAACGGGTG AGTCACTTGC AACAGCTAGA AGGATAGCTA GAGGTATGAG AAGAAGAATT TCGAGAAGAT CCCAAAGGCT ACGTTTAGTT AAAAAACTGT TTGTTCAATA TGAAATTATT AAAGATTCAA GTGAATTTAA CCGGATATTT GACACTTCGC GGGATGGGTG GAAAGATCCT TGGGAATTAA GGTATAATGC TTTATCAAGA ATACTTAAAC CTTATGAACT TGTTCAGGTA CTTACACATA TTACTAAGAG AAGAGGCTTT AAAAGTAACA GAAAGGAAGA CTTGTCTACT ACAAAAGAAG GTGTAGTTAT AACTAGTATT AAAAACAATT CTGAGATGCT TCGTACGAAA AATTACCGTA CTATTGGAGA AATGATTTTT ATGGAGACTC CTGAAAATAG TAACAAAAGG AATAAGGTAG ACGAGTATAT TCACACTATT GCCAGAGAAG ATCTTCTCAA TGAAATAAAA TATATATTTA GTATACAAAG AAAGCTTGGA AGCCCCTTTG TAACTGAAAA ATTAGAACAT GACTTTTTGA ATATATGGGA ATTTCAACGT CCTTTTGCCA GTGGTGATAG TATACTTTCA AAAGTGGGAA AGTGTACCCT GCTAAAGGAG GAGTTGAGAG CACCGACTTC CTGTTACACA TCAGAATATT TTGGATTACT TCAATCGATT AACAATCTAG TTTTGGTTGA AGATAACAAT ACATTAACAT TAAATAATGA TCAAAGAGCA AAAATAATAG AATATGCTCA TTTCAAGAAT GAAATCAAGT ATTCTGAAAT AAGAAAATTA TTAGATATTG AACCTGAAAT TTTATTCAAA GCACATAATT TGACACACAA AAATCCCTCA GGAAACAATG AGAGCAAAAA GTTTTATGAA ATGAAGTCTT ATCATAAACT GAAAAGCACA TTACCTACAG ACATCTGGGG GAAATTGCAT TCTAACAAGG AATCTCTTGA TAATCTTTTT TACTGCCTTA CGGTCTATAA AAACGATAAC GAAATAAAGG ACTATTTACA AGCGAATAAT CTTGATTATT TAATTGAATA TATAGCAAAA TTGCCAACTT TCAACAAATT TAAACATCTA TCTTTAGTTG CCATGAAAAG GATTATTCCG TTTATGGAAA AAGGGTATAA ATATAGTGAT GCCTGTAATA TGGCGGAATT AGATTTTACA GGTTCCAGCA AACTTGAAAA GTGTAATAAG TTAACTGTTG AACCAATTAT TGAGAATGTA ACTAATCCAG TTGTAATAAG GGCTCTGACG CAAGCAAGGA AAGTTATAAA TGCGATTATA CAGAAGTATG GTCTCCCGTA TATGGTAAAT ATAGAACTTG CACGTGAAGC GGGAATGACA CGTCAGGATA GAGATAATTT AAAAAAAGAA CATGAAAACA ACCGAAAAGC AAGGGAAAAA ATATCAGACC TAATACGCCA AAATGGTAGA GTTGCAAGTG GTCTGGATAT ACTGAAATGG CGTCTTTGGG AAGACCAGGG CGGAAGATGT GCTTATTCCG GCAAACCAAT TCCTGTTTGT GATTTATTGA ATGACTCACT GACTCAGATA GATCACATTT ATCCGTATAG TAGAAGTATG GACGATTCAT ATATGAATAA AGTTTTAGTT TTAACCGACG AAAATCAAAA TAAAAGAAGT TATACGCCAT ATGAAGTATG GGGGTCAACT GAAAAATGGG AGGATTTTGA GGCAAGAATA TATTCTATGC ATTTACCTCA AAGTAAAGAA AAAAGGCTTT TGAACAGAAA CTTTATTACA AAAGATTTGG ACTCATTTAT TTCAAGAAAT CTTAACGATA CTAGGTATAT ATCAAGGTTT TTAAAAAACT ATATAGAATC GTATCTGCAG TTTAGTAATG ATTCACCTAA GTCTTGTGTG GTCTGCGTTA ATGGTCAGTG TACCGCTCAG CTTAGAAGTA GATGGGGGTT AAATAAAAAT CGAGAAGAGT CAGACCTGCA TCATGCCCTT GATGCAGCAG TAATTGCGTG TGCAGATAGA AAAATAATTA AAGAAATAAC AAACTATTAT AATGAAAGAG AAAACCATAA TTATAAAGTT AAATATCCTT TGCCTTGGCA TTCGTTTAGG CAAGATTTAA TGGAAACGTT AGCAGGTGTT TTCATATCCC GGGCACCCAG AAGAAAAATT ACCGGGCCAG CCCATGATGA GACAATCAGA TCTCCCAAGC ATTTTAACAA AGGTTTAACT TCGGTAAAAA TACCGTTAAC AACAGTGACG TTGGAAAAAC TTGAGACAAT GGTAAAAAAT ACAAAAGGGG GAATTTCAGA TAAGGCAGTA TATAACGTTC TAAAAAATAG ATTAATAGAG CATAATAATA AGCCATTAAA AGCTTTTGCT GAAAAAATAT ATAAACCACT AAAAAATGGT ACAAACGGTG CAATAATTAG GAGTATTCGA GTTGAGACAC CATCATATAC AGGAGTATTC AGAAATGAAG GAAAGGGGAT ATCTGATAAT TCCTTAATGG TTAGGGTTGA TGTATTTAAG AAAAAAGATA AGTATTACCT TGTGCCAATA TATGTAGCAC ATATGATAAA AAAAGAGTTA CCTTCGAAAG CTATAGTTCC TCTGAAACCT GAATCTCAAT GGGAGTTAAT TGATAGTACT CATGAATTTC TTTTTTCACT ATACCAAAAT GATTACCTTG TTATAAAAAC TAAAAAGGGT ATAACTGAGG GCTATTATAG ATCTTGCCAC AGGGGTACCG GTAGCCTGAG CCTAATGCCT CATTTTGCTA ATAATAAGAA TGTTAAAATA GATATTGGAG TTAGGACAGC AATAAGTATT GAAAAATATA ATGTGGATAT ACTTGGTAAT AAAAGCATAG TAAAAGGAGA ACCAAGACGT GGGATGGAGA AATATAATAG TTTCAAATCC AACTAA
|
Protein sequence | MKYTLGLDVG IASVGWAVID KDNNKIIDLG VRCFDKAEES KTGESLATAR RIARGMRRRI SRRSQRLRLV KKLFVQYEII KDSSEFNRIF DTSRDGWKDP WELRYNALSR ILKPYELVQV LTHITKRRGF KSNRKEDLST TKEGVVITSI KNNSEMLRTK NYRTIGEMIF METPENSNKR NKVDEYIHTI AREDLLNEIK YIFSIQRKLG SPFVTEKLEH DFLNIWEFQR PFASGDSILS KVGKCTLLKE ELRAPTSCYT SEYFGLLQSI NNLVLVEDNN TLTLNNDQRA KIIEYAHFKN EIKYSEIRKL LDIEPEILFK AHNLTHKNPS GNNESKKFYE MKSYHKLKST LPTDIWGKLH SNKESLDNLF YCLTVYKNDN EIKDYLQANN LDYLIEYIAK LPTFNKFKHL SLVAMKRIIP FMEKGYKYSD ACNMAELDFT GSSKLEKCNK LTVEPIIENV TNPVVIRALT QARKVINAII QKYGLPYMVN IELAREAGMT RQDRDNLKKE HENNRKAREK ISDLIRQNGR VASGLDILKW RLWEDQGGRC AYSGKPIPVC DLLNDSLTQI DHIYPYSRSM DDSYMNKVLV LTDENQNKRS YTPYEVWGST EKWEDFEARI YSMHLPQSKE KRLLNRNFIT KDLDSFISRN LNDTRYISRF LKNYIESYLQ FSNDSPKSCV VCVNGQCTAQ LRSRWGLNKN REESDLHHAL DAAVIACADR KIIKEITNYY NERENHNYKV KYPLPWHSFR QDLMETLAGV FISRAPRRKI TGPAHDETIR SPKHFNKGLT SVKIPLTTVT LEKLETMVKN TKGGISDKAV YNVLKNRLIE HNNKPLKAFA EKIYKPLKNG TNGAIIRSIR VETPSYTGVF RNEGKGISDN SLMVRVDVFK KKDKYYLVPI YVAHMIKKEL PSKAIVPLKP ESQWELIDST HEFLFSLYQN DYLVIKTKKG ITEGYYRSCH RGTGSLSLMP HFANNKNVKI DIGVRTAISI EKYNVDILGN KSIVKGEPRR GMEKYNSFKS N
|
| |