Gene Ccel_3218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3218 
Symbol 
ID7311800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3753825 
End bp3755009 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content44% 
IMG OID643610120 
Producttryptophan synthase subunit beta 
Protein accessionYP_002507488 
Protein GI220930579 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAAG GCAGATATGG AAAGCATGGG GGGCAATATA TCCCTGAAAT ATTAATGAAC 
ACCATAAATG AACTTGAAGA GAGTTATAAC TATTATAGAA ATGATTTTGA CTTCAACAGG
GAGCTTAATA CCTTATTGAA GGAATACGCA GGAAGGCCCT CCCTGTTATA TTTCGCAAAA
AAAATGACAG AGGATTTGGG TGGTGCAAAA ATATATCTAA AGCGTGAAGA CCTGAATCAC
ACAGGTTCCC ACAAGATAAA CAATGTTCTG GGACAGGTGC TTCTGGCAAA GAAAATGGGC
AAAAAGCGTG TCATAGCCGA GACAGGTGCT GGACAGCACG GTGTGGCTAC TGCAACCGCT
GCGGCACTTA TGGGTCTTGA TTGCGAAATT TTTATGGGTC TGGAGGATAC TAAGCGTCAG
GCGTTAAATG TTTTCAGAAT GGAGCTGCTG GGTGCAAAAG TCCACCCGGT TACAAGCGGA
ACACAAACCT TGAAGGACGC AGTTAATGAG ACTTTCCGTG AGTGGGCTTC AAGAATGGAT
GACACCGCCT ATGTACTGGG TTCTGTTATG GGGCCTCATC CATTTCCTAC GATTGTAAGA
GATTTCCAGA GTGTTATTGG TAAGGAAGTC AGGGAACAGA TGTTGGAGAA AGAAGGCAGG
CTCCCGGATG TTGCCATGGC TTGCGTTGGC GGCGGCAGTA ATGCTATGGG ACTTTTTTAT
GACTTTATAG GCGACAAATC CGTTGAGCTG ATAGGATGTG AAGCCGCCGG AAAAGGTGTA
GATACTGAAT TGCATGCAGC TACTATAGCA AAAGGACAGC TTGGAATATT CCACGGTATG
AAATCGTATT TTTGTCAGGA CGAATACGGA CAAATTGCTC CCGTTTACTC TATTTCGGCA
GGCTTGGATT ACCCCGGAAT AGGTCCCGAA CATGCAAACC TCCATGACAC GAGCCGTGCT
AAATATGTCC CCATAACTGA TGCAGAGGCG GTTACAGCCT TTGAATATCT TTCACGTACC
GAAGGTATCA TTCCGGCAAT TGAAAGCTCC CATGCAGTTG CACATGCCAT GAAAATTGCA
CCTAAAATGG AAAATGACAA AATAATAGTT ATTTGCCTTT CAGGAAGAGG AGATAAGGAT
GTTGCTGCTA TTGCAAAATA TATGGGGGTG AATATTGATG AGTAA
 
Protein sequence
MIKGRYGKHG GQYIPEILMN TINELEESYN YYRNDFDFNR ELNTLLKEYA GRPSLLYFAK 
KMTEDLGGAK IYLKREDLNH TGSHKINNVL GQVLLAKKMG KKRVIAETGA GQHGVATATA
AALMGLDCEI FMGLEDTKRQ ALNVFRMELL GAKVHPVTSG TQTLKDAVNE TFREWASRMD
DTAYVLGSVM GPHPFPTIVR DFQSVIGKEV REQMLEKEGR LPDVAMACVG GGSNAMGLFY
DFIGDKSVEL IGCEAAGKGV DTELHAATIA KGQLGIFHGM KSYFCQDEYG QIAPVYSISA
GLDYPGIGPE HANLHDTSRA KYVPITDAEA VTAFEYLSRT EGIIPAIESS HAVAHAMKIA
PKMENDKIIV ICLSGRGDKD VAAIAKYMGV NIDE