Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2355 |
Symbol | |
ID | 7311027 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 2773479 |
End bp | 2776529 |
Gene Length | 3051 bp |
Protein Length | 1016 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643609281 |
Product | protein of unknown function DUF214 |
Protein accession | YP_002506669 |
Protein GI | 220929760 |
COG category | [V] Defense mechanisms |
COG ID | [COG0577] ABC-type antimicrobial peptide transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATTAATT TGCTGTATAA AAACACTCTT AAAAAAATAA AGAAATCCTT TGGAAGGTAT ATTTCCCTTT TTATTATAGT CATGATAGGT GTTGGTTTTT TTGCGGGAAT ACAAGCCACT ACGCCTGATA TTATAGACGT AGCGGATAAA TATTATAAAG AAAGTAATCT TTTGGACTTT AAAGTAGTCA GTACATTAGG TTTGACTGAC CAAGATGTTA ATGCAATTAA GAAACTTAGT AATGTAGGTG CAGTTATACC AAGCTATTCC CTAGATGTTT TGGATACGGA TAAAGCTACC AGAGTTCACG CAATAGAAGA TAAGGTTAAT ACCTTCAAGC TTGAGGACGG CAGAATGCCT CAAAAACATG ATGAGTGTAT TGCAGATAGT AAAACTTACA AAATTGGCGA CAAAATAAAA ATTACAAGCA ATACGGACAA AAAGCTAAAG ATCAAGGAAT ATACCGTTAC AGGTATAGTT CGATCAGTAT TGTATCTTGC TGAAGATTAT GGAAACACTA CTATCGGTGA CGGAAAGCTG TCCTCATTTA TATTTATAAA CAAGGATAAT TTTATTTTAG ATGCATATAC GGAAATATAT CTTATTGCAG CAGGCACAGA TAAAACAGAA GCATATTCTA AAGAGTATGA TACTGAAGTC TCAAAGCTCA AAGACGAACT TCTTAAAATA AAGTCAGACA GAGAAAATGC AAGATATCAG GAGATTTATA CGAAAGCAGA CAGTGAAATA AGAAAAAATG AAACAAAATT GAATGATGAA AAGAAAATGG GACAAAAGAA GCTGGCGGAT GCAAAAGCTA CTCTTGATGA AAATGCAGAG AAACTGGAAA AAGGAAAATC TGAACTTATT GCCAACGAGG CAGAACTTAA AAGAAATACC GAAAAGCAAA ATGCGGAGTT TGCTTCCGCA AAAGAAAAAA TATCGGCAGG CTGGAAGGAT ATAAATAATG CACTTGATCA AAATAAAATA AAAAAAGAAG AAATAGACAC TAAAATTAAT GAGCTGAATT CCGCCATTAG GGTAATGAAA GCACAGCTAA GTCAACTGCC CGTTGAAAGT CAGGAATATA TCCGGCTCAA TGCCACTATA AATCAATATT CCGAAATGCA AAAAGGCTTG CTAAAGTTAA AGCAATCAAT AACTACATTA ACCGCTCAAG AAGCAAAGCT GGATAACGGT ATTGACACCT TTAATAGCGA AATCGCTAAA GCTAAAAGAA AAATAGAACA GGGTAAAATT GAATTAGCAC AGAATGAAAA GAAATTGAAG GACGGATATA CAGAATACAG TAAAAATGTG GAAAAATTTA ATACAGAAAT AATAGATGCA CAGACAAAGC TAACAGATGC AAAAAAAGAA GTTTCTGATA TTGAAAAGCC AAAATGGCAC ATATTTGGCA GAGAGGCGGC TGGGGGTTAC AACGAATTAA AATCAGGTAT CGATGTGGTT ACTTCGGTTG CAGCACTATT TCCATTCTTT TTCATACTTA TTGTTATGCT GATGGCATCA AACTCAATGG TACGGATGAT AGAGGAAGAA AGAAGTGAGC TGGGTACATT AACCTCACTA GGGTACAAAG ACGGCAGTAT AATATCAACA TATCTGTTCT ATGTACTGTC TGCGTCTGGT TTGGGGGCAG TTGTAGGATT TTTCACTGGC TGTGGGATTA TCCCTCCGTT GATTTACTCA ACATTCAGGT TTAATCTGCC GCCCCTTGTT ATTAAATACA GCATGGGAAC ATTTTCGATA ATTTTACTTA TTACCTTTGC TCTTATGAGC ATAGTAACGG TTGTTTCATG CAACAAAGAG CTTAAGCAGA AGCCGTCGAC GCTGATGCGC CCTGTTCCTC CTGAAAACGG TAAAACAATT GTTCTTGAAA GAATCAAGCC TCTGTGGGGG CGTTTGTCCT TTACTTGGAA GGTAACTTTG AGAAATATGT TCAGGTATAA AAAGAGAGCC TTTATGACAA TTGTGGGGGT AGCAGGGTGT ACGGCCCTTC TGCTTGTAGG TTTTGGTCTG AGGGACAGCA TGAATGGCGT GGCGGAAAAG CAGTATGGAG AAATCTTCCG TTATGATAAT ATGTTTATTT TAAAAAATGA GATAAAAAAT ATACAAGGTG ACTTGGAGAA TCTTTTAACA AGGGAGCAAG TAAAAGAGCC ACTTTTGCTT AAACAGACTG CTTTAAAATG CGAAACAAAG AATAAATCCT ACGATGCTTT TTTAATAGTA CCTGCTAATG AGGATGTATT TTATAAATAT TTTAATCTCA AAACTCCTTC CGAGGGAGTA CAGTTAACCC TGAATGGCAG CGGTGTTATT ATTACGCAAA AACTGTCGGA GGTTTATAAT GCGGGGAAAG GAGATACTAT TACCGTAAAG GATGCCGAAA ATAATTCTTA TAAGTTGACC GTTACCGGTG TTGCAGAAAA CTATACAGCT GATTATATTT ATATGAACAA TCAAATGTAC AATAGGATTT TCGGCAAAGC CGCATCCTAT AATGCAATTG TGTCAAATCA TAAAACAGAT GAAACGGCTT TTGCAGAAAA ACTTATTGAC AGCGGCTTAG TTTTAAATGT GGTGTTTAAC GGGGACTTGA TAAAGAAGGT TCTTGACAGC AACGAAAGCC TTAATAGTAT AATTCTGTTG ATTGTGGTAG TTGCCTCACT GTTAGCAATC ATTGTTCTTT ATAACCTGAC ATCGATAAAT ATAAGCGAGC GAACCCGAGA GATTGCAACA CTAAAAGTAC TTGGCTTTAC CGATGAAGAA ACAAACGGTT ATATTTACCG TGAAGCCTTT ATACTTACAT TAATAAGTAT TGGAGTAGGC TTGGTATTGG GAATTTATAT CCATAGTTTA GTAATTGATG TAATTGGAGA AAACTCATTG GTGTTGTTCA AAAAAATAAA ATGGCTAAGC TTTTTACTGG CAGCGTTACT TACTGTCATA TTCTCTGTAG TAATGCAAAT AGTAACTTAT TTCAAATTGC AAACAATTGA TATGATAGAA TCACTGAAGT CAGTGGAATA G
|
Protein sequence | MINLLYKNTL KKIKKSFGRY ISLFIIVMIG VGFFAGIQAT TPDIIDVADK YYKESNLLDF KVVSTLGLTD QDVNAIKKLS NVGAVIPSYS LDVLDTDKAT RVHAIEDKVN TFKLEDGRMP QKHDECIADS KTYKIGDKIK ITSNTDKKLK IKEYTVTGIV RSVLYLAEDY GNTTIGDGKL SSFIFINKDN FILDAYTEIY LIAAGTDKTE AYSKEYDTEV SKLKDELLKI KSDRENARYQ EIYTKADSEI RKNETKLNDE KKMGQKKLAD AKATLDENAE KLEKGKSELI ANEAELKRNT EKQNAEFASA KEKISAGWKD INNALDQNKI KKEEIDTKIN ELNSAIRVMK AQLSQLPVES QEYIRLNATI NQYSEMQKGL LKLKQSITTL TAQEAKLDNG IDTFNSEIAK AKRKIEQGKI ELAQNEKKLK DGYTEYSKNV EKFNTEIIDA QTKLTDAKKE VSDIEKPKWH IFGREAAGGY NELKSGIDVV TSVAALFPFF FILIVMLMAS NSMVRMIEEE RSELGTLTSL GYKDGSIIST YLFYVLSASG LGAVVGFFTG CGIIPPLIYS TFRFNLPPLV IKYSMGTFSI ILLITFALMS IVTVVSCNKE LKQKPSTLMR PVPPENGKTI VLERIKPLWG RLSFTWKVTL RNMFRYKKRA FMTIVGVAGC TALLLVGFGL RDSMNGVAEK QYGEIFRYDN MFILKNEIKN IQGDLENLLT REQVKEPLLL KQTALKCETK NKSYDAFLIV PANEDVFYKY FNLKTPSEGV QLTLNGSGVI ITQKLSEVYN AGKGDTITVK DAENNSYKLT VTGVAENYTA DYIYMNNQMY NRIFGKAASY NAIVSNHKTD ETAFAEKLID SGLVLNVVFN GDLIKKVLDS NESLNSIILL IVVVASLLAI IVLYNLTSIN ISERTREIAT LKVLGFTDEE TNGYIYREAF ILTLISIGVG LVLGIYIHSL VIDVIGENSL VLFKKIKWLS FLLAALLTVI FSVVMQIVTY FKLQTIDMIE SLKSVE
|
| |