Gene Cthe_0732 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0732 
Symbol 
ID4810350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp889176 
End bp890360 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content43% 
IMG OID640106149 
Productchorismate synthase 
Protein accessionYP_001037160 
Protein GI125973250 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0075518 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAGGAA ATACATTCGG CAGAATATTC AGGGTTACAA CCTGTGGAGA ATCTTATGCA 
GGTGCTTTTC GCAAAAATCT TCAAATACCA AAAGAGTTGT TTGGGGGACT AATAGCAATC
GTGGACGGTG TTCCGCCCGG AATAAAGCTC ACTGCTGATT TCGTGCAGGA GGAGCTTGAT
AAAAGAAGAC CGGGAAAAAC TCCTTTGGAT ACACCAAGAA AAGAAAGGGA CAAAGTATAT
ATTTTTTCCG GAGTAATGGA AGATGATATT ACAACCGGTG CTCCTGTCGG GATGATTATA
CCCAATGACG TTATTGAGGA TGAGCACATT AACAAGCATA AGAGCTACAA AGAGGTTGTA
AGACCGGGAC AGGCAGGATA TACCTTTTTT AAGAAGTACG GACAATTTGC AGACAATATA
GGTGCAGGAA GAGCTTCCGG AAGAGAAACG GCTGCCCGTG TTGCCGCCGG AGCCGTGGCA
AAGGCGGTGT TAGATACCAT GGGTATAGAT GTAATTGCTT TTGTAACTGA AATACACGGA
ATTAAAGCCC AGGAAAATAT TACTTATGAA ATGGCCAAAG CCAATTATCG CAAAAATGAA
ATAAACTGCC CGGACCTTGA AAAAGCAAAA GAAATGATTG AAGAACTTAA AAGGATAAAG
GAAGAAGGAG ATTCTGTAGG CGGAGTGGTG GAAATAATTG CAAGAGGTGT TCCGGCAGGT
TTGGGAGAAC CCGTGTTTGA CAAGCTTCAG GCCACACTTG CCCACGCCTT AATGTCCATT
GGAGCCATAA AAGGGATAGA GTTTGGCGAA GGATTCGGTC ATACAAAATT AAAGGGTTCG
GAATCAAACG ATGTTCCTTA TTACGATGAA GCCTCAGGCC GTGTAAGATT TAAAACCAAC
AGAGCGGGAG GTATACTGGG CGGAATTTCC AACGGCGAGG ATATCAGAAT CAGAGTTGCG
GTTAAGCCGA CGCCTACTAT TTCAATACCT CAGAAAACAG TAAACATGTA CACTCTTGAG
AATGTTGAAG TAGAGTTTAA CACAAGAAAC GATCCTTCAA TATGTCCAAG AATTTATCCT
GTATGTGAAG CTATGGTCAG AATTGCTCTT TTGGATGCTT TATATATTGC AAAAGGCTAT
AGGGCAATCA GCAGCAACAT AGATCCCCGT TGGGACCGTT TATAA
 
Protein sequence
MVGNTFGRIF RVTTCGESYA GAFRKNLQIP KELFGGLIAI VDGVPPGIKL TADFVQEELD 
KRRPGKTPLD TPRKERDKVY IFSGVMEDDI TTGAPVGMII PNDVIEDEHI NKHKSYKEVV
RPGQAGYTFF KKYGQFADNI GAGRASGRET AARVAAGAVA KAVLDTMGID VIAFVTEIHG
IKAQENITYE MAKANYRKNE INCPDLEKAK EMIEELKRIK EEGDSVGGVV EIIARGVPAG
LGEPVFDKLQ ATLAHALMSI GAIKGIEFGE GFGHTKLKGS ESNDVPYYDE ASGRVRFKTN
RAGGILGGIS NGEDIRIRVA VKPTPTISIP QKTVNMYTLE NVEVEFNTRN DPSICPRIYP
VCEAMVRIAL LDALYIAKGY RAISSNIDPR WDRL