Gene Ccel_1618 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1618 
Symbol 
ID7310372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1954778 
End bp1955830 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content32% 
IMG OID643608547 
Productbifunctional 3-deoxy-7-phosphoheptulonate synthase/chorismate mutase 
Protein accessionYP_002505950 
Protein GI220929041 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1605] Chorismate mutase
[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase
[TIGR01801] chorismate mutase domain of gram positive AroA protein 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA GTTTAAAAGC TATAAGACAA GAAATTGATA ATATTAATGA TTCTATCCTT 
GAAATGCTCA ATAAAAGAAC AGAATTAATA AAGGAAATAA CAGATTTAAA AGATCAAAAC
GGCTCTGAAT ATTTTGATCC TGAACGTGAA ACAGAGATGA TGAAAAAGGT TCTAAGCAAA
AATAGCGGTC CTTTATATAA CGAGCTTATA AGGGAGGTTT TTAGTGCTAT TTTTTCTACA
TCACTAAAAT TTATGGGCAT AAGCCGTCAA AAAAAACTGT TGGTAAGTTC AAGTTCGAAT
GCATGTTTTA AGAGTATTAA TGAAATGTTT GGATTAGGGA ATAATGAACC GGTTATAATT
GCTGGACCAT GTGCTGTTGA AACGCCAGAA TACCTTGAAA CAATAGCAAA GCACTTAAGA
GATAAAAATA TCAGATTTAT AAGAGCAGGT GCCTATAAGC CAAGATCATC ACCATATGAC
TTTCAAGGAT TAAAGGAAAA TGGTTTAAAA ATACTACAAG ACGTTTCTAA ACGCTATGGA
CTCTTTAGTA TTACGGAAGT TGTTGACACA AGGGACGTAA ACTTAGTAAC ACAGTACGTA
GATATACTTC AAATTGGTGC AAGAAATATG CAAAATTTTG AACTACTAAA AGAGGTAGGT
AAAACTAATC ACCCAGTATT ACTAAAAAGA GGTATTAGTG CAACTATCCA AGAATTAATG
CTTGCGGCAG AGTATATTGC ATTAAAAGGA AATAATAAGA TAATTTTATG TGAGCGTGGA
ATTAGAACTT ATGAAACAAA AACAAGGAAT ACACTTGATA TTTCTTCAAT ACCTATCATT
AAAAAAGAAA CACACTTGCC TATTGTAGCT GACATAAGTC ATTCACTTGG AAGAAAAGAT
ATTGTTAATA ATATTGCAAA AGCTGTTCTT GCAGCAGGTG CTGATGGCAT TATGGTAGAG
GTGCACCCAA TTCCTGAACT TGCTCTTTCA GATAGTAAAC AACAGCTTAA TTTGAGTGAA
TTTGACGATA TGCTTGATTT TATAAAAAGA TAA
 
Protein sequence
MSKSLKAIRQ EIDNINDSIL EMLNKRTELI KEITDLKDQN GSEYFDPERE TEMMKKVLSK 
NSGPLYNELI REVFSAIFST SLKFMGISRQ KKLLVSSSSN ACFKSINEMF GLGNNEPVII
AGPCAVETPE YLETIAKHLR DKNIRFIRAG AYKPRSSPYD FQGLKENGLK ILQDVSKRYG
LFSITEVVDT RDVNLVTQYV DILQIGARNM QNFELLKEVG KTNHPVLLKR GISATIQELM
LAAEYIALKG NNKIILCERG IRTYETKTRN TLDISSIPII KKETHLPIVA DISHSLGRKD
IVNNIAKAVL AAGADGIMVE VHPIPELALS DSKQQLNLSE FDDMLDFIKR