Gene Ccel_2427 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2427 
Symbol 
ID7311099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2927215 
End bp2928393 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content43% 
IMG OID643609358 
Productchorismate synthase 
Protein accessionYP_002506737 
Protein GI220929828 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00147882 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAGGAA ATACATTTGG CAGATTATTC AGGGTTACAA CATGCGGAGA ATCCTATTCG 
GGAGCTTTTA GGAAGAATTC AGATATTCCG CCTGAATTGT GGGGCGGACT GGCAGTAATA
GTTGACGGTG TTCCCGCCGG ACTGAAAGTA ACTTCACAGA TTATACAGGA GGAACTGGAT
AAAAGAAAAC CTGGACAATC CCGGCTCCAC ACACCCAGAA CTGAAGCGGA TAAGGTCTAT
ATATTTTCAG GTGTAATGCA GGATGACAGA ACAACAGGAG CACCTGTATG TATGCTGATT
CCCAGCAGTG ATATTGGGGA TTATCATATA GAACAGCACA AGGGTAATAA GGACTTGTTA
AGACCCGGAC AGGCAGCGTA TACCTATTAT AAAAAATATG GGGAACATTC TGACTATCTT
GGAGCAGGAA GGGCCTCGGC GCGTGAAACA GTTGCAAGGG TGGCAGGAGG AGCAATTGCT
AAAATAATTC TCGATAGTAT GGGGATTGAT GTAATAGCGT TTACGATTGA GTCTCATGGA
ATAAAAGCAG GACCGTTTTC ATATGAAACG GCCAAACAGA ATTACAGAGC TAATGATATA
AACTGTCCCG ATTTGGATAT TGCAAAGCAT ATGATTGATG ACTTGCTTCA AGTAAAAAAG
GAGGGGGATT CCTGCGGAGG TGCAATAGAG ATAATAGCAA AGGGAGTACC TGCGGGACTT
GGAGAGCCTG TATTCGATAA GTTAAGTGCC ACAATTGCAC ATGGAATTAT GTCTATAGGC
GGTGTAAAAG GAATTGAGAT AGGAGATGGC TTTGGAGTAA CATCCAAAAA GGGCTCAGAA
TGTAATGACA CGCCTTATTA CGATGAGGAA ACAAGACGTA TCAGATTTAA AACAAACAGA
GCGGGCGGTA TGCTTGGAGG AATATCAAAC GGTGAAGAAA TTAGAATTCG TGTTGCTGTC
AAACCGACAC CAACTATTTT AAAGGATCAG CTGACAGTGA ATGTATCAAC TCTTGAACCG
GTTACCCATA AATTTGCGTC CAGAAGCGAC CCTTCGCTTG TACCGAGAAT ATACCCTATT
TGTGAAGCTA TGGTTAGAAT GGCACTGGTA GATAGTATAC TAATGGCTTC AGGTAGCAGG
AGCATAACAG ATATGGATAA CAGGTGGGAT AAGCTATGA
 
Protein sequence
MLGNTFGRLF RVTTCGESYS GAFRKNSDIP PELWGGLAVI VDGVPAGLKV TSQIIQEELD 
KRKPGQSRLH TPRTEADKVY IFSGVMQDDR TTGAPVCMLI PSSDIGDYHI EQHKGNKDLL
RPGQAAYTYY KKYGEHSDYL GAGRASARET VARVAGGAIA KIILDSMGID VIAFTIESHG
IKAGPFSYET AKQNYRANDI NCPDLDIAKH MIDDLLQVKK EGDSCGGAIE IIAKGVPAGL
GEPVFDKLSA TIAHGIMSIG GVKGIEIGDG FGVTSKKGSE CNDTPYYDEE TRRIRFKTNR
AGGMLGGISN GEEIRIRVAV KPTPTILKDQ LTVNVSTLEP VTHKFASRSD PSLVPRIYPI
CEAMVRMALV DSILMASGSR SITDMDNRWD KL