Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_3098 |
Symbol | |
ID | 7311695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 3634475 |
End bp | 3635626 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643610002 |
Product | exopolysaccharide biosynthesis protein |
Protein accession | YP_002507370 |
Protein GI | 220930461 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000562756 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA ATAATCAGAA ACTGATTATA AATATTGGTT TGGTTTTAGT ATTGTCCTTC ATTTTAATTT ATGCAGGACA ATTTTTATTT TCCCCAAATA ATAATACTCC GGGAAAGGCT GACAACACCT TGAATTCACC GGATACATCG TCAAATTCAG TACAGCCTTT ACCTGTTCAA TACAAGAGTA CAACAGAAAC CATAAACGGG TATAAGCAGG AAATATATAT GCTGGAGTTT GATCCAAGAG ACGAAAGAGT TGAATTCAAG CCAGCATTGT CATTTGATAA TATTTTCGGA TTTGAAAAAT TATCTGATAT TTGTAAAAGG AATGGGGCAT ATGCAGCGGT TAATGGAGGA TTTTTTTACC AATTTGGTGA TCCGGCAGGG ATGGTTGCTA TAGACGGCCA GATGCTCACG ACATCAACGG GATTGAGTCC TGTACTTATT TTAGATAAAA TGGGTGCGAG ATTTGAAACC TTTTATTCCA ATATTTTTTT GGAATCTAAA GGTAATAGAG TTAAGATAAA TGAGATGAAC AGGGTAGGTA AAAATGATGA TATAATTTTA TATATTGACA AATTCGGAAA TACAAACAGA GCTGAAGTAA AAAGTACATC ACTTATAGTT GATAACAATA AAATAATTTC TATAATTGAA AGTACAAAAG AAGTTAACAT AAAAAAAGGT ATGTATGTCA TCAGCTTTTA CGGCGATAAA TCATCGCTGC CTGACAAAAT TGGTTTAAAA ACGGGTGATA AAGTAAATAT TAGGATAGAA CCGTATTTAG GTTATAATTA CCAGGCTTAT GAATGCGGGA GTATGCTTGT AAAAAACGGG AAATCAGTAG TGCCGGAACG TGACAAATGG GCGGGAACTT TAGGTAACCG TGACCCTAGG ACGGTTATTG GTATAAAAAC AAACGGCAAG ATAGTACTAG TGGTTGCCGA TGGTCGCCAG CCGGGATATA GTGAAGGAAT GACGGGTAAA GAAATGGGTG AATTCCTAGT GAAAATAGGT GTGAGGGATG CGGCAATGCT AGACGGCGGA GCCACTTCAC AGATGATAAT AAATGGCAGA ATCCAGAACA GACCGTCCTA TGAAGGGATT GAGAGGCCAG TAGCTGGATG TTTTATAGTT AAGATCAAAT AA
|
Protein sequence | MKKNNQKLII NIGLVLVLSF ILIYAGQFLF SPNNNTPGKA DNTLNSPDTS SNSVQPLPVQ YKSTTETING YKQEIYMLEF DPRDERVEFK PALSFDNIFG FEKLSDICKR NGAYAAVNGG FFYQFGDPAG MVAIDGQMLT TSTGLSPVLI LDKMGARFET FYSNIFLESK GNRVKINEMN RVGKNDDIIL YIDKFGNTNR AEVKSTSLIV DNNKIISIIE STKEVNIKKG MYVISFYGDK SSLPDKIGLK TGDKVNIRIE PYLGYNYQAY ECGSMLVKNG KSVVPERDKW AGTLGNRDPR TVIGIKTNGK IVLVVADGRQ PGYSEGMTGK EMGEFLVKIG VRDAAMLDGG ATSQMIINGR IQNRPSYEGI ERPVAGCFIV KIK
|
| |