Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_0252 |
Symbol | |
ID | 7309152 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 281804 |
End bp | 283108 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643607182 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002504619 |
Protein GI | 220927710 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000514588 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTTAACTA AAGTTTTACT GCTTGTGGTT CTTGTGCTGT TAAATGCGTT CTTTTCGGGG TCTGAGATCG CCCTCATTTC TTTAAATGAC AAATTGATTA AGAAACAGGC GGAAGAGGGA GATAAAAAGG CAAAACAGCT TTATAGTTTT CTTAGCGAGC CAAGCAGGTT TTTGGCCACC ATCCAAATAG GTATAACATT AGCGGGGTTC CTTGCAAGTG CATTTGCAAC AGAGAGTTTT GTAGACGATT TGACAGGGTT ATTGGTAAAG ACAGGTTTTC CTGTTGCTGA ATCCGTTATC AGAAGTGTTT CACTTGTAGT AATTACAATA ATTCTTTCAT ACTTTACATT GGTTTTTGGT GAATTGATAC CTAAGAGACT GGCTATGCAA AAGTCTGAGT TCTTAGCCAA TATTGCTGTA GGTCCGTTGA TGTTCTTATC CCGTATAACA AATCCTTTTG TAAGGTTTCT GACGTTTTCT ACAAACTTTT TTATTAAGGT TTTTGGAGGA AATCCGGCTG GCGGGGACGA TGAAAAAGTA ACCGAGGAAG AAATCAGAAT GATGATGGAG GTTGGAGAAG AGAGAGGTGT TATTCAGGAT ACAGAGAAGG AAATGATTGA CAACATTTTT GAATTTGATA ACAAGAGTGT ATCTGAAATA ATGACTCATC GAACCAACAT TGTTGGAATA CCTGTGGATT CGGATATAAA TTATGTGCTA TACATCATGA ACAGAGACAA GTATACAAGA GTTCCTATTT ATAATGATAA CATTGATAAT ATAGTTGGTA TCCTGCATGT AAAGGACTTA TTGGAGTACA CTCAGAGCCA TAATAAGGAT TTTAGTCTAA AAAAAATAAT AAGAAGTGCT TACTTTGTTC CTGAATCAAA GAGAACAGAC GAGTTATTTA AGGAAATGCA GAAGAACAAA GTCCATCTGG CGGTTGTAAT CGACGAATAT GGTGGTACAG CCGGAATAGT TACCATAGAA GACTTGTTGG AAGAGATTGT CGGAAACATT TTCGACGAAT ATGATATAGA ACAAAAAGAT ATAGAATATC TTGAAAATAA TACTTATATT TTTGACGGTG CAATCGACCT TGACAAGGTT GAAGAAGTAC TGGACGAGGA TCTTCCTGTT GACGATTTCG ATACCTTGGG AGGTTTTATT CTAAAGCTAC TTGGCAGAAT ACCAAAGGTA GATGAGAAAC CTACAGTCCC GTATGAGAAT ATTGTTTTTA AAGTAGTCAA AATGGAAGGC AAGAGAATAG TAAAGGTACA GGCTAGTAAA CAGGAAAATA ATTAA
|
Protein sequence | MLTKVLLLVV LVLLNAFFSG SEIALISLND KLIKKQAEEG DKKAKQLYSF LSEPSRFLAT IQIGITLAGF LASAFATESF VDDLTGLLVK TGFPVAESVI RSVSLVVITI ILSYFTLVFG ELIPKRLAMQ KSEFLANIAV GPLMFLSRIT NPFVRFLTFS TNFFIKVFGG NPAGGDDEKV TEEEIRMMME VGEERGVIQD TEKEMIDNIF EFDNKSVSEI MTHRTNIVGI PVDSDINYVL YIMNRDKYTR VPIYNDNIDN IVGILHVKDL LEYTQSHNKD FSLKKIIRSA YFVPESKRTD ELFKEMQKNK VHLAVVIDEY GGTAGIVTIE DLLEEIVGNI FDEYDIEQKD IEYLENNTYI FDGAIDLDKV EEVLDEDLPV DDFDTLGGFI LKLLGRIPKV DEKPTVPYEN IVFKVVKMEG KRIVKVQASK QENN
|
| |