Gene Ccel_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1039 
Symbol 
ID7309861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1294109 
End bp1295347 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content36% 
IMG OID643607966 
ProductPyridoxal-dependent decarboxylase 
Protein accessionYP_002505381 
Protein GI220928472 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0076] Glutamate decarboxylase and related PLP-dependent proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGATC AGAGATGGAA TTTTCCCAGA AATGGGTTGA GCATGGACCA AATTAAGGAA 
TATATGAACC CTGGCAGAAA TTATGATTGC TTAGACAATG GCAAGGTGTT TCTCGGATAT
CCGCAGACTA CTCCCCACCC TATTGCTATT AAGACGTACA AGAATTATCT GCAATATAAC
GATAACCATG TGGGGACATT TAGTAACAAT AACACTGACT TGAATATATC CAGAAAAATG
GAGAAACAGT TTATCGAAAT GTTAGGTGAT TTGTATGGAG ATATTGAAGC AGATGGTTAT
GTTACATCAG GGGGTACAGA GGGTAATATT ATGGGAATAT GGGTAGGAAA ATATTACCTT
GGCGGAGGGG AGACAGATAA CCTTTGCCTG ATAAAAACAT ACCTTACACA TCAATCAATT
GATAAGGCTT GCAGCTTAAA CAATATTACA AACATAATTG AAATTCCATA TAACCAAAAT
TTTGAGATGG ATACCAATTT ACTTAGAAAT GAAATTGATT TTCAGATTGA ATCCGGTAAA
AACAGAATTA TAATTGTTGC TACAGTAGGC TATACAATGA CGGGGACAAG TGATCCCATT
GATGAGATTG ATAAAATAAT ACAGGATTAT TCCAGAAATA AAGATGTCAG TTTTTATCTT
CACGTTGATG CAGCTATTGG AGGGCTTGTA TATCCGTTTT GCAAAAAGGA AGATTTTGCA
TTCCAATATC CTAGTGTTAA GTCTTTAACT GTTGATCCTC ATAAAATGGG ATATGTACCT
TTTTCAGCCG GTGTTTTCCT TTGCAGACGC AATTTGCAGG ATTGTGTTGC AATCCCTATA
AAGTATGCAA AAACCGTTAT GGATAAAACA TTGGTCAGTT CAAGAAGTGC GGCAGCAGCA
GCAGCATGCT GGACAACCTT CAACTATTTG GGCATAGCGG GATTTGAAAA AAAAATAAAA
AAGCTTATTT CTATCAAGGA GTATTTAGTT GAAAAAGTCT TGGCTGATAA ATTGGCTGTA
TTAATTTCCG ATCCGGGGAC TAATATGGTA TGCCTGTACT TTGATTCTCT TGCTCAGGGA
TTACTGCCTG AATGGATTGA AAAAAAGTAT ACCTTGGACG GATTTTTGTT GAAATGTAAA
GACGAAATGA TTATATGCTA CAAGGTATAT ATCATGCCTC ATGTTACTAA AAGAGCTATT
CTTCAGTTTG TTGATGACAT TCGAGCGTTA GCCTGCTAA
 
Protein sequence
MEDQRWNFPR NGLSMDQIKE YMNPGRNYDC LDNGKVFLGY PQTTPHPIAI KTYKNYLQYN 
DNHVGTFSNN NTDLNISRKM EKQFIEMLGD LYGDIEADGY VTSGGTEGNI MGIWVGKYYL
GGGETDNLCL IKTYLTHQSI DKACSLNNIT NIIEIPYNQN FEMDTNLLRN EIDFQIESGK
NRIIIVATVG YTMTGTSDPI DEIDKIIQDY SRNKDVSFYL HVDAAIGGLV YPFCKKEDFA
FQYPSVKSLT VDPHKMGYVP FSAGVFLCRR NLQDCVAIPI KYAKTVMDKT LVSSRSAAAA
AACWTTFNYL GIAGFEKKIK KLISIKEYLV EKVLADKLAV LISDPGTNMV CLYFDSLAQG
LLPEWIEKKY TLDGFLLKCK DEMIICYKVY IMPHVTKRAI LQFVDDIRAL AC