Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_3420 |
Symbol | |
ID | 7311980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 3974126 |
End bp | 3975415 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643610325 |
Product | protein of unknown function DUF815 |
Protein accession | YP_002507688 |
Protein GI | 220930779 |
COG category | [R] General function prediction only |
COG ID | [COG2607] Predicted ATPase (AAA+ superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00358176 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATATTA ACACAAATGA ACTAATCCTC TACAAGCATT TCCATCGATC TGATATCTTT GAAGATATTT CGTGGGTTAT AAACAATTAC CAAAGCAAAG CTTTTTCTAA AGAGAATGTG AAGACAAGAC TATATGAGGG GTTGCATAAA TTAATTGAAT TTTCAGTTGA CCATTGTTTT GATGGCAATT TATTCCATTC ATACCTCACC TATGTTCTTA TAACCAATGA AAATGCCTTT GCAAGGGCTT GTGAGATATC GGGAAAACCA AGCGGTACAA TTAATGACCT TGCTTTAAGT GATTTTAGGA TTTTTAGAAA GCTATATGAT TTTGATTTAT CCTTGATTGA CGTAGTTCTC CAAGTACAGT GTTTTAGCAT CGTTACAAGC TACATGCCCT CGGATATCTG TAAGAACCGG AACAAAGGAA TACCTTATTT TAAAGAACTT GTTAACAACC TATCAACATC TGAGAATGAA GAATCATTCT ATGCAATACT GACTGAATTT TACAGGAAGT TTGGAGTTGG GTTACTTGGT CTTAACAAGG CATTCAGTGT TATTCATTAT GAAAATGACA TTAAACTCAA GCCTATTACA AATATTGAAT CTGTACTTCT AAATGATCTT GTCGGCTATG AGGCTCAAAA AAAAGAGCTT GTTATGAACA CAGAAGCTTT TGTGCAGGGC CGCAAAGCCA ACAATGTCCT TTTGTACGGT GACAGTGGTA CAGGTAAATC TTCAAGCGTT AAAGCAATCT TAAATGAATA CTATAAACAT GGCCTCAGAA TGATAGAGGT TTACAAGTAT CAAATGAAGG ATTTACCCCT TATAATAGGG CATATAAAAT ATAGGAAGTA TAAGTTTATA ATTTACATGG ACGATTTATC TTTTGAGGAC CATGAAACTG ATTATAAGTA CCTGAAGGCA GTCATTGAAG GCGGCTTGGA GGCGAAACCT GAAAACGTTC TCATATATGC AACTTCTAAC AGAAGACATA TTATCCGTGA GAAATGGAGT GACAAAAATG ACAGGGATGA TGACCTCCAT ACTAACGATA CAGTTCAAGA AAAACTTTCA CTATCAGCCA GGTTCGGATT ACCGATTCTA TATATAGCTC CCAGTAGAAA GGAGTTTCTT CATATTGTTA AGGTACTTGC AGATAAATAC CGTATTAATA TACCAGAAGA AGAACTATAC CTGGAGGCAA ACCGGTGGGA ACTACGTAAC GGTGGCCTAT CAGGAAGAAC CGCTCAGCAT TTTATCACTT ATCTGCTTGG AAAAAAATAA
|
Protein sequence | MYINTNELIL YKHFHRSDIF EDISWVINNY QSKAFSKENV KTRLYEGLHK LIEFSVDHCF DGNLFHSYLT YVLITNENAF ARACEISGKP SGTINDLALS DFRIFRKLYD FDLSLIDVVL QVQCFSIVTS YMPSDICKNR NKGIPYFKEL VNNLSTSENE ESFYAILTEF YRKFGVGLLG LNKAFSVIHY ENDIKLKPIT NIESVLLNDL VGYEAQKKEL VMNTEAFVQG RKANNVLLYG DSGTGKSSSV KAILNEYYKH GLRMIEVYKY QMKDLPLIIG HIKYRKYKFI IYMDDLSFED HETDYKYLKA VIEGGLEAKP ENVLIYATSN RRHIIREKWS DKNDRDDDLH TNDTVQEKLS LSARFGLPIL YIAPSRKEFL HIVKVLADKY RINIPEEELY LEANRWELRN GGLSGRTAQH FITYLLGKK
|
| |