Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_3398 |
Symbol | |
ID | 7311960 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 3942231 |
End bp | 3945155 |
Gene Length | 2925 bp |
Protein Length | 974 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643610302 |
Product | hypothetical protein |
Protein accession | YP_002507666 |
Protein GI | 220930757 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000354514 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAATC TCAGAAAGCT TACTGCAGTT GTTATAGCCG TAGCATTGGT ACTAACATCT ATGACTGCTG CATTCGCTGC TTCAGGTTCA TATGAATTTG AAGATCAAGC AACAGTACTT AAGGATCTTG GCATTTGGCA GGGAGACACT ACCGGTGACT TGATGCTTGG TGAAGATTTA ACTAGAGCAC AAGGTGCTGT ATTAGTACTA AAGACCGTAT TAGGAAAGAC TGACAAAGAT ATGGAAGCTG CAGATGTTTC CAAAATCGCA AGCTTTGATG ATGCTGACGA AGTTCCAGCA TGGGCTGAAG GTTGGGTAGC TCTTGCTGTT CAAGAAGGCG TTATGAAGGG TGGCAACAAC AAGTTAGCTG CTGGCGATCC TTTAAAGGGA AAAGATCTGG CATCTATGTT CATGAACGCT CTTGGTTTTG CAGCTGAGAA CGATTATGCT ACATCAGTTG AATTGTTAGC TGCTAAGTCA GCTGGTAAAA TTCTTGTAGC TATCGCTGAT GATATTACTG ATGCAGATCT TACAAGAGAT GCTGCTTCCG CAGTAGTATT CGACACTTTA ACTGTTAAGG CTAAAGATGC AACTAAGACA GTTGTTGAAG TTTTAGTTGG AACTGACGCT ACTAAGAAGG CAGTTGCTGA AAAAGCTGGT TTGATAGTTG CTCCAGCTGC TCAGACAGTT ACTGACGTTA AGCCTTTAAA CTTGAAGCAG GTTCAAATTA CATTTGCAAA AGACCTTGTA AAGGCTGATG CAGAAAAGAT AGCAAACTAT GTTGTAACTG AAGGTACAAC TGACAAAGCT ACAGGTGGAA GTGTTGCTCT TCAGGCTGAT GGAAAGTCAG TTATAGTAAC ATTGGGCCAG GGTATCACAA ATGGTGCAAC TGCTGTTGTT GAAGTAAAGA ATTTTGCTAC ATACAAAAGT GATGCTGTGA AGTTTGAAGA CTCTACTGTA CCTACAGTAC TTGGTGTTAC AGTATCTGGT CCAAACACTC TTACTGTAGA ATATAGTGAA CCAGTTCAAC TTAAGTCATC TACGACACCT GTAACAGATG CTATCTCTGG TGGAGAATAC AAGATTGATG GTGGAAACTA TATCCTTACT GATATTGAAA TAAATATTAA TAAGGTTACT TTAACAGTAG GTGTTCCACT AACAGAGGGT GCACACAAGG TAAGCTTTGA GTCAAAGGGA TTCATATTTG ACTATGCAGG ATACAATGTA CTGCCTAAGA CTGTTGAATT CACAGTAACA AATGATAACA CAGCTCCTGT ACTTACATTA AAGTCAGCTG ATCCGAAGCA AATAGTTTTG ACTTCTAACA AGCCTTTGAA AGAAGATAGT GTTAAGAGCG GTAACGTTAG ATACAGACAT ACATACAATA CTGACACATA TGTTGTAAAA GGTAACGATA CAAAAACAGT TGATTTAGAG ACTATTAGTA AAGTTACTCT CACTGATTCA GGAACCACAC TTACAATTGA CTTCTCAGGT AATGTAATAC CATTGGGAGC AACTAATCTC TATATTGGAT ATGATGATGC AAATGGCACT CAAATTCAGG ACTTATGGGG AAATAAATTA CCAGCTACAA CTATTCCTTT GAACATAACT CTGGATACTG TTAAACCAAC AGTTACAGAA GTTAAGTTTG ATAATACATT GCAGTTAACA GTTGTTTTCT CAGAAAAACT AAATAAGGCA TCAGCAGAAG ATAAAGGAAA CTATGTAATC AAAGATTCAG CTGGAAAGGT TATAGCTGTT ACGGGTGCAA CACTTGTTAA TGACGATTCA CTTAATAAGG TACAGCTTGC GTTTACAGAA GAATTGGGTG GTGGATCCTA TACTATCGAA ATAAAGGGTG TTAAGGATGA CGCTTTTGTA AATAATGCAA TGGACACTTA TACATCAACA TTGAACTTTA CTGATAAAGT TGCTCCAAAA GTAACTGTTG CTTCAGCAAG AATTGTTATC AGTAAAGATT CAGCAAATAA TGCAGACAAG AAGGCATCCA TCTACATTCC ATTCAGTGAA CAAATGGATC CTACTACTCT TGTGAAGGCT AACTTCATGA AGGCAATTGG TGACCCTTTA TCAATAGATA CTAAGTTCGT AGCTTTGGGT GACAACGATA CAGTTACTCC TGCTGCTGAC GGAAAGTCAG TTACAATTGT GTTAGATAAG AATGCTGATG CTTTCGTATA TGATCAAGTT CAGATCAAGG TTGGTCTTGT AAAAGATGTT GCAGGAAATA CTCTGGCAAC TTATGTTCAA GATGTAAAAC CTGCAAAGGA TGCTATAAAA ATCGAAAAGG TTGAAGCTAT AGCTAAAAAG CAGATAAAGG TTACATTTGA TGGTAGACTT TCTACAATAA CTGCTAAAGG ATTTAAACTC GCAAATGAAG CTGGTGAGCA AATCGCGTTG TCAGTTGCAA GTGTGGCATT GAACGACGAT GGAAAGTCAG TAGTTGTATT CAACCTTGGA GCAGAACTCA AAGAGGATGC AACATATGCT AATAAAGAAG CTGTAACAGT AGTATTGTTG TCTGTTGATG AAGCTGTTGC TTTGGATACT AAATCATACT TAGGTGCGGT TATTTCAACT AGTTCTGAAA CCGCTTCAGA CGTAATTGTT CCTACTGTTG ATACAACAAC GGTTATGGCT GATGGTACTA TTCAAGTTAC ATTCTTTGAG AAAATTGATG CTTCAACTTT AGCAGCTAAA ACATTAAATG GTTTCTCAGT ATCAGGAGAT GTTAAGATAA AATCAGTTGG CGCTTCAGGT AAGGTAATAA CTCTTACACC TGAAGATGGT AAGAAGTTCT CAGACTCAAC TGTTGTTAAA TACAACTCAG TTGCTGGAAT TACTGATGAA TCTGGAAACA AGGTTGCTGA CTTCGAAAAA ACAGCTAAAA AATAA
|
Protein sequence | MRNLRKLTAV VIAVALVLTS MTAAFAASGS YEFEDQATVL KDLGIWQGDT TGDLMLGEDL TRAQGAVLVL KTVLGKTDKD MEAADVSKIA SFDDADEVPA WAEGWVALAV QEGVMKGGNN KLAAGDPLKG KDLASMFMNA LGFAAENDYA TSVELLAAKS AGKILVAIAD DITDADLTRD AASAVVFDTL TVKAKDATKT VVEVLVGTDA TKKAVAEKAG LIVAPAAQTV TDVKPLNLKQ VQITFAKDLV KADAEKIANY VVTEGTTDKA TGGSVALQAD GKSVIVTLGQ GITNGATAVV EVKNFATYKS DAVKFEDSTV PTVLGVTVSG PNTLTVEYSE PVQLKSSTTP VTDAISGGEY KIDGGNYILT DIEININKVT LTVGVPLTEG AHKVSFESKG FIFDYAGYNV LPKTVEFTVT NDNTAPVLTL KSADPKQIVL TSNKPLKEDS VKSGNVRYRH TYNTDTYVVK GNDTKTVDLE TISKVTLTDS GTTLTIDFSG NVIPLGATNL YIGYDDANGT QIQDLWGNKL PATTIPLNIT LDTVKPTVTE VKFDNTLQLT VVFSEKLNKA SAEDKGNYVI KDSAGKVIAV TGATLVNDDS LNKVQLAFTE ELGGGSYTIE IKGVKDDAFV NNAMDTYTST LNFTDKVAPK VTVASARIVI SKDSANNADK KASIYIPFSE QMDPTTLVKA NFMKAIGDPL SIDTKFVALG DNDTVTPAAD GKSVTIVLDK NADAFVYDQV QIKVGLVKDV AGNTLATYVQ DVKPAKDAIK IEKVEAIAKK QIKVTFDGRL STITAKGFKL ANEAGEQIAL SVASVALNDD GKSVVVFNLG AELKEDATYA NKEAVTVVLL SVDEAVALDT KSYLGAVIST SSETASDVIV PTVDTTTVMA DGTIQVTFFE KIDASTLAAK TLNGFSVSGD VKIKSVGASG KVITLTPEDG KKFSDSTVVK YNSVAGITDE SGNKVADFEK TAKK
|
| |