Gene Cthe_0683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0683 
Symbol 
ID4810301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp840619 
End bp841932 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content42% 
IMG OID640106100 
Productdiaminopimelate decarboxylase 
Protein accessionYP_001037111 
Protein GI125973201 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0019] Diaminopimelate decarboxylase 
TIGRFAM ID[TIGR01048] diaminopimelate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000143238 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGTTT CAAAAGCTTT AAAGGTTAAC AGTAAAAATC ACCTGGAAAT CGGCGGATGT 
GATTGCGTTG ATCTTGTAAA CAATTTCGGT ACTCCCTTGT ATGTAATGGA TGAAAGTCTT
ATAAGGGAAA ATTGCCGTAT ATATAAAAAT GCACTGGACA AGTATTATAA CGGAAACGGA
CTGGTACTTT ACGCCAGCAA GGCTTTCTGT ACAATGGCAA TGTGCAAAAT TGTCCAGCAG
GAAGGCCTGG GTCTTGACGT GGTATCGGGC GGAGAGCTGT ACACCGCGAT TAAAGCGGGA
TTTCCCATGG AAAAGGTGTA TTTTCACGGA AACAATAAAA CCATTGACGA ACTGGAGCTG
GCGATTGACA ACAATGTAAG AAGAATAGTA GTGGATAATA GGCAGGAACT TTTGCATGTA
AACAGAATTG CAGCAGAAAA AGGCAAGACA GTAAACATTT CTTTCAGAAT AAAACCCGGA
ATTGATGCTC ATACTCATGA CTTTATCCGG ACAGGTCAGA TTGACTCAAA ATTTGGTGTT
GCCCTTGAAA ACGGTGAGGC AATGGAAATA ATAGGCGAAG CGGTGAAACT GAGCAATGTG
AAGGTGGTTG GACTTCATTG CCACATAGGC TCTCAAATTT TCGAGCTTGC TCCTTTTGAG
GAAGCTGCAA GGGTGATGCT TACCTTTATT GCAAAAATAA AGGAAGAGCT GGGTATAGAA
ATTGAGGAGC TGAACCTTGG AGGAGGCTTT GGGATAAAAT ATACCCAGGA TGACGACCCG
ATAGAGTATG ACCGTTATAT AAAATCAGTA TCGGAAGTTG TGAAAAGTGT GTGCGAAGAC
AAGGGAATAA AGCTTCCGTT TATAGTTATA GAGCCGGGAA GGTCCATTGT TGCATCTGCG
GGAATAACGC TCTACAGAAT TGGCACTATA AAAGATATCA AGGGTGTCAG AAAATATATC
GCCGTTGACG GCGGAATGAC CGACAACCCA AGATATGCCC TCTATCAGTC AAAATATGAA
GGTGTTATTG CCAATAAAGC TGATGCGGCA AAAACAGAAA AGGTTACAAT TGCGGGCAAG
TGCTGTGAAT CCGGGGACCT GCTTGGCAAG GACGTATTGC TTCCCGAAGC GGAGGAAGGG
GATATTTTGG CAATACTTGC TACCGGTGCA TACAACTATT CCATGTCCAG TAACTACAAC
CGTATTCCAA GACCTGCGGT GGTTCTTGTA AAAGACGGTA AAGCGCGGGT TATTGTTAAA
AGGGAAGACT ATAACGATAT AATAAGAAAC GATATTATCC CTGAAGATCT GTAA
 
Protein sequence
MFVSKALKVN SKNHLEIGGC DCVDLVNNFG TPLYVMDESL IRENCRIYKN ALDKYYNGNG 
LVLYASKAFC TMAMCKIVQQ EGLGLDVVSG GELYTAIKAG FPMEKVYFHG NNKTIDELEL
AIDNNVRRIV VDNRQELLHV NRIAAEKGKT VNISFRIKPG IDAHTHDFIR TGQIDSKFGV
ALENGEAMEI IGEAVKLSNV KVVGLHCHIG SQIFELAPFE EAARVMLTFI AKIKEELGIE
IEELNLGGGF GIKYTQDDDP IEYDRYIKSV SEVVKSVCED KGIKLPFIVI EPGRSIVASA
GITLYRIGTI KDIKGVRKYI AVDGGMTDNP RYALYQSKYE GVIANKADAA KTEKVTIAGK
CCESGDLLGK DVLLPEAEEG DILAILATGA YNYSMSSNYN RIPRPAVVLV KDGKARVIVK
REDYNDIIRN DIIPEDL