Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2783 |
Symbol | |
ID | 7311410 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 3340430 |
End bp | 3343894 |
Gene Length | 3465 bp |
Protein Length | 1154 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 643609682 |
Product | S-layer domain protein |
Protein accession | YP_002507061 |
Protein GI | 220930152 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.364691 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATGGA AGGGTAAAAA TGTTTGTATT ATGGTACTTG TTTTTTGTTT TCTAATGATT GGCAATGCTT ACTCCAGTGC TAAGCAGTTC CCCGATGCAG TAAACCATTG GTCAGAGAAG GCGATCACCC GTCTAGCCAC AAAAGATGTA ATCAGCGGCT ATCCTGACGG AACAGTAAGA CCTGATGGAA TTGTCACACG AGGTCAGTTT GCTTCTATTC TGGCAAAAAG CTTGGGACTT GACACGTCAA AATCAGAAGA ATCTCAGCCC TTTAATGACA TCTATCACCA CTGGTCTGAG AGGAGCATCA AGGCCCTAGT TCAAAGCGGT ATTATTGCAA AGGCGGATTA TGAAGGAAAC TTCAAGCCTG ATAAGCCGAT TACCCGTATT GAAATTATAA GGATGATGGT AAGAGCAAGT GGAAAGAGTG ATGAAGCTAA AAGAACTAAC GACAATACAG GCTTTGCAGA CGAGGCTGAT ATAAGCAGAG CAGACAGGGG CTATGTAATA ATTGCAAAGC AAAACAATAT TATAGCCGGA TATCCTGATA ACACTCTTCG GCCGAAGAGC GAAGCCACAA GGGCGGAAGC GTTCCAGCTG ATCGATAACC AGATGAGACA AAACCAAAAG CTTGGGAACG AAGTACCGCC TATCGACGGT TCAGATGATT ATGGAAGCGG GGCAAGCAGC AGCCCTAACA GCTACCCTGA TGCTCAGATA GATTTTGAAT TGTCGAATAC GGCACATACT GACACAGAAA TCAGCATATC ACCTATTACC CAATACGCTC AGACACTTAA ATGGTCTCTT GCCAAGGAAA CTGAGGATGG AGTACAGGTC CCTGTCGATA TTTCACAGGT AATAAAAGGT TCATTGTCTC AAAGTGGAGG GAAAATCACA TTCAAGGAAA GTGGAAAATA TACCCTTACA GCTATTACCG AAAACTATAG CGGCAGAGAA ACGAAATGTT CAAAAGGTAT AACAGTGTAT CCGGTGTTTA TCCCTAAATT TGACCTTTCG GAATACAGTT ATATAGATGA AACAATCAAT ATCACGGTTA ATCCAGAGTT TAACGATGTT GACATTGTGT GGAGTGTAAC AAAAGAAGGA AAGGATGAAG CACTAGACAC CGTCATTGAT GGGAGTCTAA CCAATTCTGG AGGAAGCATT ACTTTTAAAG AAAAGGGCAC CTATGCACTT ACTGCGACTG TCACGGATAC AACAGGTAGG AGCTTTACTT GCAGTAAGGA GATTTTAGTC TATCCAGTAA TCAGCCCCGA ATTTGACCTT CCCGAATACA CTCATACGGA TAAAGTGGTC AATATTACGA TTGCTCCGAA GCTTGACGGC CTTGATGTAG TGTGGACTGC AACAAAGGAT GGCAAGGAGA CAGCAATAGA CAAAATTATT GACGGAAGCC TTACCAGTAC GGGAGGAAGC ATAACTTTTA AGGAAAAGGG TAGCTATGCA ATTACTGCCA CTGTCACCGA TGCCACAGGC AGAAGCTTTG CTTGCGGTAA GGGAATTTTA GTATATCCTG TTCCAAGTCT TGTTTTTAAG TACCCAGCTA CTGCATATAC AGACAGCAAT ATTATAATTA CTCAAACAGC AGAAATGGAC GGACTGATAG TGGAATGGCT GGTAGAAAGC ACGTCGGGCC CATTGGATTG GAATGATTAC ATTGACGGTA CACTGGACAA CGACGGCGGA ACCATTCAAT TCAAGCAGGA AGGTACGTAT CAGCTTACAG CCAAGGTAAC TGATAAAACA GGAAGAGTGT TTCTCTTAAA TTCTGAAAGC AGGATAGATG TGTATCCCGT ATCCGATATT AATATCACGC TTCCTGCTAA AGCCTATCCA GGAGAAACGG TTTCCGTCAA TGTGAGCGGA GATAATCTCA ATAACCTTAA ATGTCAATGG TCAATAGCAG CTGACGGAGG AAACCCCGAG GAATATGAAA GCCATGTCAG CGGTACCCTT TCCGATGATG GCGGTACGAT TACCTTTGCT AAAATGGGCA GCTATGCACT GACTGCTACC TTTACAGACA AACTAGGCAG GGCATTTACT TGCAGTAAAG TAATCACCAT ATATCCAATA CCTGATATGC AGATAAGCCT CCCTAAGCTA GCTTACAGCG GAGATGCTGT TTCAGTAGCC ACTGAAGAAA GCGGACTTAA GGGGCTGAAT GCCGTATGGA GTATTTCAAT TGACGGAGGT CCTGAAGTAC CATACAGACA GTATGCAAGC GATGTACTCA CAAGCACTGG TGGTGAAATC CGTATAAGTA CAAATAAAAC TATTGCAGTA AAGCTCACTG CTTCTGTAAC AGATGAAAAT AGCCGTACCT TTACATTTTC ATCTAATACA ATATCAATAA AGCCCAATAT AATCTGTTCA TTTGCAGCAC CATCCTCCGT ACATACCGGA GAAAGCTTTA GTGTCACTAT GGAGGAGGTT TCGGGCTTAG AGGAAAGCAA TATTACCTGG TCCCTGACCA AGGATGGCAG TTTGACTGAC TATACAGGCA GCCTTAGCAA CAATGGCGGG AATATTACTA TAAATGATGT CGGAGGTTAT ACACTGACAG CAGCTGTTAC GAACAGTGAG GGGAGAATAT TTTCGTATTC AGAAAATATA GCTGTTACAA ATACAGCTCC CAATGCACCA GAAGGATATG CAACAGTTAC TAGAACTGCT AAGGATCAAA TGCTCCTTGT AAATATTACA GCATCTGCAA CTGACCCTGA TGGGGATGAT GTTATCTATG AATATGAAGA CCAGAGTGAA GACGGTTATT ATCCTTTAGG TTCACATACT GTTAAGGTTA GGGCAAAGGA TTCCTTCGGG GCTGTTTCAA GCTGGACGGA AATTAACTTC AAGGTTGTCG GCTCGGCACC GTCTACACCT GTAATTACGA GAACACCAGA CGGTAACAGT GTTGCACCAA ATAAACCTAT AACCATAACA GCAGCGTCAA CAGATTTAGA CGGTGATACT ATTACCTATG TATGGGAGGG CAGACAAGCT GAGACCTCAG CATACCCTCT GGGTAGAAAT ACCATACGTG TTAAGGCTGT GGATTCAACA GGAATGGAAT CTCCATGGGC AGCAATAGTT TTCTTTGTTG CCGATTCAAA CAACGGCGGA GGAATGACAC TGAGTGGGCC TGAGTCAGTG ATAGTTGAAA AGGGAATAGA ATGTGCAACA ATTACGGAGT ATACCTTTTC TGTTCCGCCT GTTTTAGGAC ATTCAGGCAG TGACTATGGA CGTGTGCGAG GCTATAATAT TTTGACCGAT ACTTGGGATC AGTTGGATTA TCAATCTATA GCAAATGGGA TTACCCTCAG CCAGAAGCTT ACAGCAGGAG TGTATTCGCA ACTTGAATTC TACTATTATA CAAATCATGA CTGTATGTAC AACAAGAGTA ACATCACCTA CTCAGTATCC TATTATTTTG AATAA
|
Protein sequence | MKWKGKNVCI MVLVFCFLMI GNAYSSAKQF PDAVNHWSEK AITRLATKDV ISGYPDGTVR PDGIVTRGQF ASILAKSLGL DTSKSEESQP FNDIYHHWSE RSIKALVQSG IIAKADYEGN FKPDKPITRI EIIRMMVRAS GKSDEAKRTN DNTGFADEAD ISRADRGYVI IAKQNNIIAG YPDNTLRPKS EATRAEAFQL IDNQMRQNQK LGNEVPPIDG SDDYGSGASS SPNSYPDAQI DFELSNTAHT DTEISISPIT QYAQTLKWSL AKETEDGVQV PVDISQVIKG SLSQSGGKIT FKESGKYTLT AITENYSGRE TKCSKGITVY PVFIPKFDLS EYSYIDETIN ITVNPEFNDV DIVWSVTKEG KDEALDTVID GSLTNSGGSI TFKEKGTYAL TATVTDTTGR SFTCSKEILV YPVISPEFDL PEYTHTDKVV NITIAPKLDG LDVVWTATKD GKETAIDKII DGSLTSTGGS ITFKEKGSYA ITATVTDATG RSFACGKGIL VYPVPSLVFK YPATAYTDSN IIITQTAEMD GLIVEWLVES TSGPLDWNDY IDGTLDNDGG TIQFKQEGTY QLTAKVTDKT GRVFLLNSES RIDVYPVSDI NITLPAKAYP GETVSVNVSG DNLNNLKCQW SIAADGGNPE EYESHVSGTL SDDGGTITFA KMGSYALTAT FTDKLGRAFT CSKVITIYPI PDMQISLPKL AYSGDAVSVA TEESGLKGLN AVWSISIDGG PEVPYRQYAS DVLTSTGGEI RISTNKTIAV KLTASVTDEN SRTFTFSSNT ISIKPNIICS FAAPSSVHTG ESFSVTMEEV SGLEESNITW SLTKDGSLTD YTGSLSNNGG NITINDVGGY TLTAAVTNSE GRIFSYSENI AVTNTAPNAP EGYATVTRTA KDQMLLVNIT ASATDPDGDD VIYEYEDQSE DGYYPLGSHT VKVRAKDSFG AVSSWTEINF KVVGSAPSTP VITRTPDGNS VAPNKPITIT AASTDLDGDT ITYVWEGRQA ETSAYPLGRN TIRVKAVDST GMESPWAAIV FFVADSNNGG GMTLSGPESV IVEKGIECAT ITEYTFSVPP VLGHSGSDYG RVRGYNILTD TWDQLDYQSI ANGITLSQKL TAGVYSQLEF YYYTNHDCMY NKSNITYSVS YYFE
|
| |