Gene Cthe_1960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1960 
Symbol 
ID4810743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2335339 
End bp2336421 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content46% 
IMG OID640107376 
Productpeptidoglycan binding domain-containing protein 
Protein accessionYP_001038371 
Protein GI125974461 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3409] Putative peptidoglycan-binding domain-containing protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAGA AGAAACCTGC TTTGTTTGCG GTCATTCTGC CTCTCTTAAT CGCCTGTATA 
ATGATTAATT CTTCAATGGT ATTTGCATCT TCAGGGATTT TGAAGGAGGG AATGAGCGGA
AGCCAGGTTA CATCACTGCA GAGGGATCTT AACACGCTGG GGTATCTTGA TGTAACTCCT
ACAGGTTATT ATGGCAGTCT TACAACAGCA GCAGTTAAGA AGCTTCAGAG AAATTACGGA
CTTAAAGAGG ACGGCATTGC GGGGCCTGAC ACTCTCTCGC TTATCAAAAG GCTGATAAAC
GAAAGGACTG CTTCAAGGTC TTCCGGCGGC ACAACGTTGA AAGAGGGTAT GAGCGGGAGC
AGTGTGACAG CTTTGCAGAA GGACTTGAAA GCTTTGGGCT ATCTGAGCGT GGATCCAACG
GGTTACTATG GAAGCCTTAC AAAAGAAGCG GTAAAGAAAC TTCAGGCAAA GCACGGTCTT
GAGCAGGACG GAATTGCAGG ACCGAAGACC TTGGCATTGA TTGACAGGCT TATGGGAAGA
AGCGGTAGTT CTGCTTCACA ATCCGCAGCT ACGGCATCCA GGGGAGGGCT CGATAAGACC
AATTACCTTT ATTCCTGGTT CGGTAATGCG GAAAACATTT TCAAGATAGG CGATACAGCA
CAGGTATATG ACATTAGGAC TGGGCGCACA TTTAATATAA AGAGGACTTA TGGCTATAAC
CATGCAGACT GTGAGACTTT AACCGCTAAA GACACGGAAA TAATGCTCAG TATCTACGGC
GGAAGCTGGA GTTGGGAAAG AAGACCGATA ATTGTTATTG TCAACGGGAG AAAAATGGCG
GCTTCGATGG CGGGAATGCC TCATGCAGGA GTTGACAGTG CGCCGGCTAA TACATATGTA
AAATCGAGAA GCGGAGGATA TGGCGCAGGA GACAATCTCG ACTCCGTTAA AAACAACAAC
ATGAACGGAG TGTTTGACGT TCACTTTTTA AACAGCAAGA CTCATGGAAC CAACAGAGTG
GATGAAAATC ATCAGAAGGC GGTCAGGGAA GCGGCAGAGT GGGCTGCAAA GAATAAGTTT
TAG
 
Protein sequence
MQKKKPALFA VILPLLIACI MINSSMVFAS SGILKEGMSG SQVTSLQRDL NTLGYLDVTP 
TGYYGSLTTA AVKKLQRNYG LKEDGIAGPD TLSLIKRLIN ERTASRSSGG TTLKEGMSGS
SVTALQKDLK ALGYLSVDPT GYYGSLTKEA VKKLQAKHGL EQDGIAGPKT LALIDRLMGR
SGSSASQSAA TASRGGLDKT NYLYSWFGNA ENIFKIGDTA QVYDIRTGRT FNIKRTYGYN
HADCETLTAK DTEIMLSIYG GSWSWERRPI IVIVNGRKMA ASMAGMPHAG VDSAPANTYV
KSRSGGYGAG DNLDSVKNNN MNGVFDVHFL NSKTHGTNRV DENHQKAVRE AAEWAAKNKF