Gene Cthe_0427 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0427 
Symbol 
ID4808430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp536692 
End bp537861 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content39% 
IMG OID640105841 
Productserine phosphatase 
Protein accessionYP_001036858 
Protein GI125972948 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGATT TATGCGTTGA TTTAGGATAT AAAAGCCTTA ACAAATTTGG GGAGCAGCTG 
TGTGGCGACA TGATACAGGT TGTAAAAGAT GATGACACTA CAATTCTGGT TCTGGCCGAC
GGTTTGGGAA GTGGTGTCAA GGCCAATATT TTATCCACCC TTACCTCAAA GATTATTTCA
ACGATGATTG CAGCGCATAT GGGTATTGAA GAATGTGTCA ATACGATTAT GTCAACTCTT
CCGGTTTGCA AGGTCAGAGG AATTGCCTAT TCAACATTTA CCATAATAAA AATTACCAAC
AACACCTACG CAGAAATAAT TCAGTATGAC AATCCTCTGG TAATACTTTT GCGGAACGGT
AAAAAATATG ATTATCCTAC ACAGACAAAA ATAATATCCG GCAAAAAAAT CGTTGAATCA
AAAATAAGGC TGAATTGTGA TGATGTGTTT GTTGTGATGA GTGACGGGGC AATTTATGCG
GGAGTCGGCC AGACTTTAAA TTACGGCTGG CAAAGGGAGA ATATTATTGA GTTTATTGAG
TCTCATTATG ACAAAAGCCT TTCTGCCAAT GCTCTTACAT CTCTTTTGAT TGATACTTGC
AACAACCTGT ATGCAAACAT GCCCGGAGAT GATACAACCA TTGCAGCAAT TAAGATTAGA
AAAAGACAAG TAGTCAATCT GATGTTTGGT CCGCCGCAGA ATCCTGAAGA TGTCCATAAT
ATGATGTCTC TGTTTTTTGC AAAACAGGGA AGACATATTG TATGTGGCGG TACCACATCA
ACGCTTGCAG CGAAGTTTTT GGGCAAGGAG CTTGAAACGA CCATTGATTA TATTGACCCG
AGAATTCCGC CCATTGCCAG GATTGAAGGA GTTGATCTTG TGACAGAGGG CGTGTTGACA
ATAAGCCGGG TTCTGGAATA TGCAAAGGAT TATATTGGGA AAAACATTCT TTATAACGAG
TGGCACAGCA AAAATGACGG TGCTTCGATA ATAGCAAGAA TGCTTTTCGA GGAAGCAACG
GACATCAATT TCTATGTTGG AAAGGCTATT AATCCTGCCC ACCAGAATCC CAATCTTCCC
ATAGGATTTA ATATTAAAAT GCAGTTGGTG GAAGAGCTTT CAAAGATACT TAAGCAAATG
GGCAAAACAA TAAATCTTAG CTATTTTTGA
 
Protein sequence
MNDLCVDLGY KSLNKFGEQL CGDMIQVVKD DDTTILVLAD GLGSGVKANI LSTLTSKIIS 
TMIAAHMGIE ECVNTIMSTL PVCKVRGIAY STFTIIKITN NTYAEIIQYD NPLVILLRNG
KKYDYPTQTK IISGKKIVES KIRLNCDDVF VVMSDGAIYA GVGQTLNYGW QRENIIEFIE
SHYDKSLSAN ALTSLLIDTC NNLYANMPGD DTTIAAIKIR KRQVVNLMFG PPQNPEDVHN
MMSLFFAKQG RHIVCGGTTS TLAAKFLGKE LETTIDYIDP RIPPIARIEG VDLVTEGVLT
ISRVLEYAKD YIGKNILYNE WHSKNDGASI IARMLFEEAT DINFYVGKAI NPAHQNPNLP
IGFNIKMQLV EELSKILKQM GKTINLSYF