Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1562 |
Symbol | |
ID | 7310323 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 1911003 |
End bp | 1912640 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643608491 |
Product | Mannosyl-glycoprotein endo-beta-N-acetylglucosamidase |
Protein accession | YP_002505894 |
Protein GI | 220928985 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4193] Beta- N-acetylglucosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATGCAA AAGCAAATGC CAAAAAGCAG GATAAAATTC AAAACAAAGG AGGTGTATTG CAAAAGGCTA AGTCTCCTAA CAGGCGAAAT ATTAAACAGC AAAACCCGAC GGAGATAATT CAACGTATCC GGATAAACCC TGAATCGATG ACACAAAAAG ATGTAGTACA ACTTCAACAT ACAATTGGGA ACCAAGCAGT TCAACGGTTA ATGTCTCGGC TCCATAATAA AAATGGTTAT AATGCAGAAG ATAAACCCAT ACAGAAGAAA GAAGAAAAAA AGAAAGAGAT TCAATCCATA TCAGAAAGGA ATTCCCCATT GGGTCTGCCT ATTAATCTAA AAGAAGGACT TGAATCACTC TCGAATATTG ATCTTTCAGA TGTTCAGGTA CACTATAATT CAGATAAACC TCAAGATGTA GGTGCTTTAG CTTTTACTCA AGGAAATAAT ATTCATATCG CACCCGGCCA AGAGAAGCTT CTTCCCCATG AAGGCTGGCA TACAGTGCAG CAAAAGCAGG GTAGAGTACA GCCCACCATG CAAATGAAAA CAGGAACACT TGTTAATGAA GATGCAGGTC TTGAAAAAGA AGCTGATGCT ATGGGGAGTA GAGCTGAAAG AGAAAGTTCA GGAAACAAAA CCTTACAATT TAAGGGATAT TCAAAATTAA ATCTGCCAGA TTATAATGAA AATGTTATAC AAAAGAAAGA CAATGCTTCC AAAATCGGCA AAAAAGTAAT ATACAACCAA CAAACAAAAA ACTATAAAAT AATTAATAGC AAGGATGGTT ACGAAAAGGA TTGGAGTGAA ACCCCGCCAA AATGCATGCA AATTGTATAT AGTAAAACTT TGAAATATGC ATTATGCAAC AATACTGGTG AAATTGCAAG TTACTTAACT TCTGGCTGGT ACACAGATCC ACTTTGGGCC CAAGGAATAA AAGTTAGTAG CTATGATGTA AGCTTACAAG ATTCATTAAA TAAACAGATG AAACTAAGTG CAAAACCTCA AACACAAAAG AATGGTAAAT GGGTTAATGC GGAAACCGAT CAGGTAAAAA AATATTTAGA TCCAAGTAAC TTTAATGATG GTGTAAGCAA ATACCAATTT CTTGATTTAT CGGCATCTGC TGATATAAGT GAGAAAGAGA TGACAAAATT CTTATCAGGT AAAGGAGTTC TTTCTGGTCA TGCAAAGACT TATTTGGACG CAGCTAAGAA ATATAATGTC AGCGAGGTGT ACTTAGCAGC ACATTCAGCA CTTGAAACAG GTAATGGCAC AAGTGAACTA GCTAAGGGGG TCAAGGTAGA AGGAGTAAAA GTTTATAATA TGTATGGCAT CAATGCCACT GACAAAGACC CTGTAGGTGA AGGTTCCAAA TATGCATATA AAATGAAATG GACTTCGATA GATAAAGCTA TCGATGGTGG TGCTGAATGG ATTTCTAAAA ACTATATAAA TAGTTCTTCA CATAGCCAAA ATACTTTGTA TAAAATGCGA TGGAACCCTG CTTCACCCGG TGAGCATCAA TATGCCACAG ATATAGCATG GGCTGTTAAT CAAACCTCTA GTCTTAAGAA AATGTACGAT TCTTTTCCTA GTGCTTCTTT AAAGTTTGAT ATTCCAGTAT ATAAGTAA
|
Protein sequence | MYAKANAKKQ DKIQNKGGVL QKAKSPNRRN IKQQNPTEII QRIRINPESM TQKDVVQLQH TIGNQAVQRL MSRLHNKNGY NAEDKPIQKK EEKKKEIQSI SERNSPLGLP INLKEGLESL SNIDLSDVQV HYNSDKPQDV GALAFTQGNN IHIAPGQEKL LPHEGWHTVQ QKQGRVQPTM QMKTGTLVNE DAGLEKEADA MGSRAERESS GNKTLQFKGY SKLNLPDYNE NVIQKKDNAS KIGKKVIYNQ QTKNYKIINS KDGYEKDWSE TPPKCMQIVY SKTLKYALCN NTGEIASYLT SGWYTDPLWA QGIKVSSYDV SLQDSLNKQM KLSAKPQTQK NGKWVNAETD QVKKYLDPSN FNDGVSKYQF LDLSASADIS EKEMTKFLSG KGVLSGHAKT YLDAAKKYNV SEVYLAAHSA LETGNGTSEL AKGVKVEGVK VYNMYGINAT DKDPVGEGSK YAYKMKWTSI DKAIDGGAEW ISKNYINSSS HSQNTLYKMR WNPASPGEHQ YATDIAWAVN QTSSLKKMYD SFPSASLKFD IPVYK
|
| |