Gene Cphy_3843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3843 
Symbol 
ID5744795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp4707224 
End bp4708408 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content42% 
IMG OID641294955 
Producttryptophan synthase subunit beta 
Protein accessionYP_001560929 
Protein GI160881961 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAG GAAGATTTCA TCAATATGGC GGTCAATATG TACCAGAAAC ATTAATGAAT 
GCGGTGTTAG AGGTAGAAAA GGCATACGAG TATTTTAAAA AGGATCCTGA TTTTTGTAAG
GAACTAGAGA CCTTATACCA TGAATATGCA GGAAGGCCAT CGTTGTTATA TTACGCTAAG
AAAATGACCG AGGATCTTTC TGGGGCTAAA ATTTACTTAA AGCGAGAGGA TTTAAATCAT
ACGGGTTCTC ACAAAATTAA TAATGTACTA GGTCAGGTAT TATTGGCAAA AAAAATGGGT
AAGACACGTG TCATAGCAGA AACTGGAGCC GGCCAACATG GTGTGGCGAC AGCAACAGCC
GCAGCACTTA TGGGACTGGA ATGTGAAATC TTTATGGGAA AAGAGGACAC AGACCGACAG
GTACTGAATG TCTATCGAAT GGAACTATTG GGAGCTAAGG TGCATCCAGT AACCTCAGGA
ACTATGACTC TTAAGGATGC AGTAAACGAA ACGATGCGTG AGTGGACGAA GAGGGTAGAG
GATACTCATT ATGTTTTAGG GTCTGTTATG GGACCTCATC CTTTCCCAAC AATTGTTCGA
GATTTTCAGA AAGTGATTGG TAAGGAAATC AAAGCTCAAC TACAGGAAGT GGAAGGAAAA
CTTCCAGATG CAATCGTTGC CTGTGTTGGT GGAGGGAGTA ATGCTATGGG AGCATTTTAT
GAATTCCTAA ATGATCCTAG TGTAGCTTTA TATGGTTGTG AGGCAGCAGG ACTTGGTGTA
AATCATCCTA AAAATGCAGC TACCATCGCA AATGGAACAG AAGGTATTTT CCATGGAATG
AAATCTTATT TCTGCCAGGA TGAATATGGT CAAATTGCTC CTGTTTACTC TATTTCTGCG
GGTCTTGATT ACCCTGGAAT CGGACCGGAG CATGCTATGT TACATGATAC CAATCGGGCA
ACTTATGTAC CAGTTACGGA CGATGAAGCG GTGGAGGCAT TTGAATATCT TTCAAGAACA
GAAGGAATTA TACCTGCAAT AGAGAGTGCT CATGCTGTTG CATACGCAAA GAAGTTAGCG
CCAACGATGG GGAAAGACAG TATCCTTGTG ATAAATATCT CAGGACGTGG AGATAAGGAT
GTTGCTGCGA TTGCTAGATA TAGGGGGGTG AAATTATATG ACTAG
 
Protein sequence
MKEGRFHQYG GQYVPETLMN AVLEVEKAYE YFKKDPDFCK ELETLYHEYA GRPSLLYYAK 
KMTEDLSGAK IYLKREDLNH TGSHKINNVL GQVLLAKKMG KTRVIAETGA GQHGVATATA
AALMGLECEI FMGKEDTDRQ VLNVYRMELL GAKVHPVTSG TMTLKDAVNE TMREWTKRVE
DTHYVLGSVM GPHPFPTIVR DFQKVIGKEI KAQLQEVEGK LPDAIVACVG GGSNAMGAFY
EFLNDPSVAL YGCEAAGLGV NHPKNAATIA NGTEGIFHGM KSYFCQDEYG QIAPVYSISA
GLDYPGIGPE HAMLHDTNRA TYVPVTDDEA VEAFEYLSRT EGIIPAIESA HAVAYAKKLA
PTMGKDSILV INISGRGDKD VAAIARYRGV KLYD