Gene Ccel_1562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1562 
Symbol 
ID7310323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1911003 
End bp1912640 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content36% 
IMG OID643608491 
ProductMannosyl-glycoprotein endo-beta-N-acetylglucosamidase 
Protein accessionYP_002505894 
Protein GI220928985 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4193] Beta- N-acetylglucosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATGCAA AAGCAAATGC CAAAAAGCAG GATAAAATTC AAAACAAAGG AGGTGTATTG 
CAAAAGGCTA AGTCTCCTAA CAGGCGAAAT ATTAAACAGC AAAACCCGAC GGAGATAATT
CAACGTATCC GGATAAACCC TGAATCGATG ACACAAAAAG ATGTAGTACA ACTTCAACAT
ACAATTGGGA ACCAAGCAGT TCAACGGTTA ATGTCTCGGC TCCATAATAA AAATGGTTAT
AATGCAGAAG ATAAACCCAT ACAGAAGAAA GAAGAAAAAA AGAAAGAGAT TCAATCCATA
TCAGAAAGGA ATTCCCCATT GGGTCTGCCT ATTAATCTAA AAGAAGGACT TGAATCACTC
TCGAATATTG ATCTTTCAGA TGTTCAGGTA CACTATAATT CAGATAAACC TCAAGATGTA
GGTGCTTTAG CTTTTACTCA AGGAAATAAT ATTCATATCG CACCCGGCCA AGAGAAGCTT
CTTCCCCATG AAGGCTGGCA TACAGTGCAG CAAAAGCAGG GTAGAGTACA GCCCACCATG
CAAATGAAAA CAGGAACACT TGTTAATGAA GATGCAGGTC TTGAAAAAGA AGCTGATGCT
ATGGGGAGTA GAGCTGAAAG AGAAAGTTCA GGAAACAAAA CCTTACAATT TAAGGGATAT
TCAAAATTAA ATCTGCCAGA TTATAATGAA AATGTTATAC AAAAGAAAGA CAATGCTTCC
AAAATCGGCA AAAAAGTAAT ATACAACCAA CAAACAAAAA ACTATAAAAT AATTAATAGC
AAGGATGGTT ACGAAAAGGA TTGGAGTGAA ACCCCGCCAA AATGCATGCA AATTGTATAT
AGTAAAACTT TGAAATATGC ATTATGCAAC AATACTGGTG AAATTGCAAG TTACTTAACT
TCTGGCTGGT ACACAGATCC ACTTTGGGCC CAAGGAATAA AAGTTAGTAG CTATGATGTA
AGCTTACAAG ATTCATTAAA TAAACAGATG AAACTAAGTG CAAAACCTCA AACACAAAAG
AATGGTAAAT GGGTTAATGC GGAAACCGAT CAGGTAAAAA AATATTTAGA TCCAAGTAAC
TTTAATGATG GTGTAAGCAA ATACCAATTT CTTGATTTAT CGGCATCTGC TGATATAAGT
GAGAAAGAGA TGACAAAATT CTTATCAGGT AAAGGAGTTC TTTCTGGTCA TGCAAAGACT
TATTTGGACG CAGCTAAGAA ATATAATGTC AGCGAGGTGT ACTTAGCAGC ACATTCAGCA
CTTGAAACAG GTAATGGCAC AAGTGAACTA GCTAAGGGGG TCAAGGTAGA AGGAGTAAAA
GTTTATAATA TGTATGGCAT CAATGCCACT GACAAAGACC CTGTAGGTGA AGGTTCCAAA
TATGCATATA AAATGAAATG GACTTCGATA GATAAAGCTA TCGATGGTGG TGCTGAATGG
ATTTCTAAAA ACTATATAAA TAGTTCTTCA CATAGCCAAA ATACTTTGTA TAAAATGCGA
TGGAACCCTG CTTCACCCGG TGAGCATCAA TATGCCACAG ATATAGCATG GGCTGTTAAT
CAAACCTCTA GTCTTAAGAA AATGTACGAT TCTTTTCCTA GTGCTTCTTT AAAGTTTGAT
ATTCCAGTAT ATAAGTAA
 
Protein sequence
MYAKANAKKQ DKIQNKGGVL QKAKSPNRRN IKQQNPTEII QRIRINPESM TQKDVVQLQH 
TIGNQAVQRL MSRLHNKNGY NAEDKPIQKK EEKKKEIQSI SERNSPLGLP INLKEGLESL
SNIDLSDVQV HYNSDKPQDV GALAFTQGNN IHIAPGQEKL LPHEGWHTVQ QKQGRVQPTM
QMKTGTLVNE DAGLEKEADA MGSRAERESS GNKTLQFKGY SKLNLPDYNE NVIQKKDNAS
KIGKKVIYNQ QTKNYKIINS KDGYEKDWSE TPPKCMQIVY SKTLKYALCN NTGEIASYLT
SGWYTDPLWA QGIKVSSYDV SLQDSLNKQM KLSAKPQTQK NGKWVNAETD QVKKYLDPSN
FNDGVSKYQF LDLSASADIS EKEMTKFLSG KGVLSGHAKT YLDAAKKYNV SEVYLAAHSA
LETGNGTSEL AKGVKVEGVK VYNMYGINAT DKDPVGEGSK YAYKMKWTSI DKAIDGGAEW
ISKNYINSSS HSQNTLYKMR WNPASPGEHQ YATDIAWAVN QTSSLKKMYD SFPSASLKFD
IPVYK