Gene Ccel_3003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3003 
Symbol 
ID7312435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3555006 
End bp3556196 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content42% 
IMG OID643609907 
Productglycosidase PH1107-related 
Protein accessionYP_002507277 
Protein GI220930368 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2152] Predicted glycosylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.87538 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTTTTG ATGAGAGATT GAATTATTTT ATCCAAAAAT ATAATGGATT GATTAAAAGA 
AAAAACTCCG TTCAAGAAGG CGGAAACGGC ATATACGATA AATACATATA TCCTGTAATA
ACCAAGGACC ACACACCTCT TTTCTGGAGA TATGACCTTG ACAGGAATAC TAATCCCTAT
CTTATGGAAC GCCTCGGCGT CAATTGTGCT TTCAACCCGG GAGCTATTGA GCTTAACGGG
AAAATATACC TTGTAGTACG TATAGAAGGG AACGATAGAA AATCCTTCTT TGCAGTGGCT
GAAAGTGAAA GTGGCATTGA TAACTTCACT TTCTGGGATT ACCCTGTGGT AATGCCGGAA
ACAAAAAACC CTGATATCAA TATATATGAC ATGAGGCTTG TAAAACATGA GGACGGTTGG
ATTTACGGTC TTTTCTGTAC AGAGAGAAAA GACCCGGAGG CTGCCTGCGG CGATACCTTC
AGTGCAGTAG CGGCATGCGG CATTGCTAGA ACTAAAGATT TAAAATTATG GGAGCGTTTG
CCTGATTTGA AAACAGCATC ACCTCAACAG AGAAATGTGG TCCTTCACCC AGAGTTCATA
AAGGGAAAAT ATGCCCTTTA CACACGTCCT CAGGACGGCT TTATCGAAAT AGGAAAGGGC
GGCGGAATAG GATTCGGTTA TACCGACAGT ATGGAAAATG CTGTTATATA TGAAGAAATA
CTGATGGAAT GCAAGGAATA CCACACTATT AAAGAGGTCA AAAACGGCCA AGGGCCCACC
CCTTTGAAGA CTGAAAAAGG CTGGCTTCAC ATTGCCCATG GCGTAAGAAA TACGGCTGCC
GGACTAAGAT ATGTTGTATA TGCCTTTCTT TCCGACCTTG AACATCCCGA AATTGTAACC
CATCGTCCGG GTGGCTTTTT AATCGCTCCT GAAGGTGAAG AACGAATCGG GGACGTTTCA
AATGTGGTCT TCTGTAACGG TGTTGTAGCC AGACAAAACG GAGATGTTCT TATTTACTAT
GCTTCCTCCG ATACACGTTG CCATGTAGCT TCTACCACTA TAGACAAGCT TCTGGACTAT
GTCATTAATA CCCCGGAGGA CCCTCTAAGA TCCTATGCAT GTGTGCAGCA GAGAATTGAC
CTGATTTCAA GAAATTTGAG GCTGATTAAG AATAACACTG TTTCTGTGTA A
 
Protein sequence
MVFDERLNYF IQKYNGLIKR KNSVQEGGNG IYDKYIYPVI TKDHTPLFWR YDLDRNTNPY 
LMERLGVNCA FNPGAIELNG KIYLVVRIEG NDRKSFFAVA ESESGIDNFT FWDYPVVMPE
TKNPDINIYD MRLVKHEDGW IYGLFCTERK DPEAACGDTF SAVAACGIAR TKDLKLWERL
PDLKTASPQQ RNVVLHPEFI KGKYALYTRP QDGFIEIGKG GGIGFGYTDS MENAVIYEEI
LMECKEYHTI KEVKNGQGPT PLKTEKGWLH IAHGVRNTAA GLRYVVYAFL SDLEHPEIVT
HRPGGFLIAP EGEERIGDVS NVVFCNGVVA RQNGDVLIYY ASSDTRCHVA STTIDKLLDY
VINTPEDPLR SYACVQQRID LISRNLRLIK NNTVSV