Gene Ccel_1472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1472 
Symbol 
ID7310241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1787108 
End bp1788382 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content37% 
IMG OID643608398 
Productaminodeoxychorismate lyase 
Protein accessionYP_002505806 
Protein GI220928897 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.120948 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAGAA GATTAAAAGT ATCATTGACG GGGATACTCA CCGTGATTAT TGGGGTTCTG 
CTATCGGTAC CTAACATCAT GAATTATTTC AAAACCCTGT TAAGAGGCTT TGATGCTAAA
TTCGGTCTGC TTAATGCAGT GCTGACGGCA GTTGGGCTTA CATTGTTTCT TTTTGGTTTA
TTTATAATAT CAGCCGGATT AAGGAAGCTA TCCTTCTGGC TGACCATCCT TATTGTTTTT
GTATTTATTT TCACTGTTGG GGCCGTTATT ACTTTTAGAA ATACGGTTTC CACGGATGTA
TCAGAGACAG TAACGGAGGA AATCAAAATA AAGGCTGATT CCGAAGGTGC AAAGATGATT
GATATACCTA TGGGTTCAGA TACTAAGACT ATTGCAGGCA TACTCACAAA TGAGGGCATT
ATCAACAAAC CGCAGATTTT CACAGTTGTA TCAAAAATAA ACGGTTTTGA TGGAAAGTAT
CAGGCTGGCA CACATATTTT GAAGCCGGGT CTGGAATTCA ATTCTATTAT GACAATTCTT
ACAGGGAAGC CTGAAAGCAA AAAGGTTACA ATACCTGAGG GCTTGAGCTA CAGACAGATT
GTCAATACGT TTGTTAAAAA AGAACTTGCA ACCACAGACA AGTTTGATTA TGCAATGAAG
TATGAAAAAT ACGATTACGA TTTTGTGAAA AACATGAAAA GTAGTAACAA TCGTGAATTT
CAGCTAGAAG GATATTTATT TCCCGATACA TACGAATTTG CCATGAATGC CAGTGAAAAG
ACAATAGTAA GTATAATGCT TGAAAACTTT AATAACAAGA TAACAAAAGA GCATTATAAA
CGTGCCAAGG AATTAGGTAT GTCGATGGAC GAAATTATTA CTCTTGCTTC CATTATTGAA
AGAGAGGCAA ATAATACTAA GGACAGAAGG CTGGTATCGG CAGTATTCCA TAGACGTTTA
AAAAGCAGGG ATTTGAATAG GTTGCAGTCC TGTGCTACCA TACAGTATGT TTTTCTAAAT
AAAGAAGGAA AAGTGCATGA AAAGCTTACT TACGAGGATA CTAAAATTAT AAGTCCATAT
AATACGTATA TTCATCCGGG TCTTCCACCG GGACCAATCT GTTCACCGGG CATGGATTCC
ATAAACGCAG CATTATACCC CGATGAAGAT ACAGACTACA TGTTCTTTAT CGCAGGGCCG
GAAGGTTCTA CTAAGTTCTC CAAGACATAT CAGGAGCATT TAAAGGCAAT GAAGCAATAT
GGATTGGCAA AATAA
 
Protein sequence
MDRRLKVSLT GILTVIIGVL LSVPNIMNYF KTLLRGFDAK FGLLNAVLTA VGLTLFLFGL 
FIISAGLRKL SFWLTILIVF VFIFTVGAVI TFRNTVSTDV SETVTEEIKI KADSEGAKMI
DIPMGSDTKT IAGILTNEGI INKPQIFTVV SKINGFDGKY QAGTHILKPG LEFNSIMTIL
TGKPESKKVT IPEGLSYRQI VNTFVKKELA TTDKFDYAMK YEKYDYDFVK NMKSSNNREF
QLEGYLFPDT YEFAMNASEK TIVSIMLENF NNKITKEHYK RAKELGMSMD EIITLASIIE
REANNTKDRR LVSAVFHRRL KSRDLNRLQS CATIQYVFLN KEGKVHEKLT YEDTKIISPY
NTYIHPGLPP GPICSPGMDS INAALYPDED TDYMFFIAGP EGSTKFSKTY QEHLKAMKQY
GLAK