Gene Ccel_3098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3098 
Symbol 
ID7311695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3634475 
End bp3635626 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content36% 
IMG OID643610002 
Productexopolysaccharide biosynthesis protein 
Protein accessionYP_002507370 
Protein GI220930461 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000562756 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA ATAATCAGAA ACTGATTATA AATATTGGTT TGGTTTTAGT ATTGTCCTTC 
ATTTTAATTT ATGCAGGACA ATTTTTATTT TCCCCAAATA ATAATACTCC GGGAAAGGCT
GACAACACCT TGAATTCACC GGATACATCG TCAAATTCAG TACAGCCTTT ACCTGTTCAA
TACAAGAGTA CAACAGAAAC CATAAACGGG TATAAGCAGG AAATATATAT GCTGGAGTTT
GATCCAAGAG ACGAAAGAGT TGAATTCAAG CCAGCATTGT CATTTGATAA TATTTTCGGA
TTTGAAAAAT TATCTGATAT TTGTAAAAGG AATGGGGCAT ATGCAGCGGT TAATGGAGGA
TTTTTTTACC AATTTGGTGA TCCGGCAGGG ATGGTTGCTA TAGACGGCCA GATGCTCACG
ACATCAACGG GATTGAGTCC TGTACTTATT TTAGATAAAA TGGGTGCGAG ATTTGAAACC
TTTTATTCCA ATATTTTTTT GGAATCTAAA GGTAATAGAG TTAAGATAAA TGAGATGAAC
AGGGTAGGTA AAAATGATGA TATAATTTTA TATATTGACA AATTCGGAAA TACAAACAGA
GCTGAAGTAA AAAGTACATC ACTTATAGTT GATAACAATA AAATAATTTC TATAATTGAA
AGTACAAAAG AAGTTAACAT AAAAAAAGGT ATGTATGTCA TCAGCTTTTA CGGCGATAAA
TCATCGCTGC CTGACAAAAT TGGTTTAAAA ACGGGTGATA AAGTAAATAT TAGGATAGAA
CCGTATTTAG GTTATAATTA CCAGGCTTAT GAATGCGGGA GTATGCTTGT AAAAAACGGG
AAATCAGTAG TGCCGGAACG TGACAAATGG GCGGGAACTT TAGGTAACCG TGACCCTAGG
ACGGTTATTG GTATAAAAAC AAACGGCAAG ATAGTACTAG TGGTTGCCGA TGGTCGCCAG
CCGGGATATA GTGAAGGAAT GACGGGTAAA GAAATGGGTG AATTCCTAGT GAAAATAGGT
GTGAGGGATG CGGCAATGCT AGACGGCGGA GCCACTTCAC AGATGATAAT AAATGGCAGA
ATCCAGAACA GACCGTCCTA TGAAGGGATT GAGAGGCCAG TAGCTGGATG TTTTATAGTT
AAGATCAAAT AA
 
Protein sequence
MKKNNQKLII NIGLVLVLSF ILIYAGQFLF SPNNNTPGKA DNTLNSPDTS SNSVQPLPVQ 
YKSTTETING YKQEIYMLEF DPRDERVEFK PALSFDNIFG FEKLSDICKR NGAYAAVNGG
FFYQFGDPAG MVAIDGQMLT TSTGLSPVLI LDKMGARFET FYSNIFLESK GNRVKINEMN
RVGKNDDIIL YIDKFGNTNR AEVKSTSLIV DNNKIISIIE STKEVNIKKG MYVISFYGDK
SSLPDKIGLK TGDKVNIRIE PYLGYNYQAY ECGSMLVKNG KSVVPERDKW AGTLGNRDPR
TVIGIKTNGK IVLVVADGRQ PGYSEGMTGK EMGEFLVKIG VRDAAMLDGG ATSQMIINGR
IQNRPSYEGI ERPVAGCFIV KIK