Gene Ccel_2213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2213 
Symbol 
ID7310901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2585235 
End bp2586275 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content40% 
IMG OID643609145 
Producthypothetical protein 
Protein accessionYP_002506535 
Protein GI220929626 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTACG ATACTATATC TTTAAACGAA AAGCAAGTCA AGACCATACA AAAAAAGTCT 
GCCAAAAAGA AGAAAAAGAA AAAAGGCAGA CTCAGGAGTC TTTTGGGTTT TTTAATATTT
GAGTTTTTCT TTATGTCAAT AACAACACCG CTGCTTATAT TCTACGGGCC CTTTGAGAAC
GTAAAAAGAA CCGCTACAGG CATGGTCTGG AACTCAATGA CCAAGCAGGT TATAGCCAAG
ACCTTCTTGT CTGATAAGGC TATTGCTAAA ATTCTGGGAG ACGGGTATGC AATCTCCAAT
ATAAATACTG AAGATATAAA AATGTTGGAT TTCAGGGTAA AGCATAATAA CAATCTGGAA
TATTTTGATG TTGAGAGCAG AAATTTCAAA GGCAAAATGA TTATAGTGGA TGATCCTACG
CGTATTAAAG TAGGGTATTC CAGCAAAATG CCGCGGTCTG GGGAAACTAC CAGCAGTATT
GCAAGGCGAA ACGGGGCAGT TGCTGCCATT AACGGAGGAG GCTTCATTGA CAAAGGGTGG
GCAGGAACTG GAGGAGTAGC AATTGGTTTT GTAATAAGCA ACGGCAAATA CATTAGCGGA
AAGCTGACTA ACAACTATAC AAAAAGGGAT ACTATTGCAT TTACAAAAGA TGGTATGTTA
ATTGTAGGTA AACATTCCCA AGCAGAACTA GCTAAATATA ATATTAAAGA GGGAATAAGC
TTCGGCCCGC CTTTAATTGT TAACGGCAAG CCTACTATCA ACAAGGGTGA CGGAGGCTGG
GGCATATCCC CAAGAACTGC AATAGGTCAA AAAGAAGATG GCTCAGTAAT GCTTCTTGTT
ATTGATGGAA GAAGCCTAAA GTCCTTTGGA GCAACTTTAA AAGAGGTTCA GGATATTATG
CTGGAGCACG GAGCAGTCAA TGCTGCAAAC CTTGATGGAG GTTCATCGGC TACCATGTAC
TATGACGGAA AAGTTGTAAA TACTCCGTCT GATGCGTTAG GAGAAAGAAC AGTAGCTACG
GCATTTGTTG TAATGCCTTG A
 
Protein sequence
MNYDTISLNE KQVKTIQKKS AKKKKKKKGR LRSLLGFLIF EFFFMSITTP LLIFYGPFEN 
VKRTATGMVW NSMTKQVIAK TFLSDKAIAK ILGDGYAISN INTEDIKMLD FRVKHNNNLE
YFDVESRNFK GKMIIVDDPT RIKVGYSSKM PRSGETTSSI ARRNGAVAAI NGGGFIDKGW
AGTGGVAIGF VISNGKYISG KLTNNYTKRD TIAFTKDGML IVGKHSQAEL AKYNIKEGIS
FGPPLIVNGK PTINKGDGGW GISPRTAIGQ KEDGSVMLLV IDGRSLKSFG ATLKEVQDIM
LEHGAVNAAN LDGGSSATMY YDGKVVNTPS DALGERTVAT AFVVMP