Gene Cthe_0574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0574 
Symbol 
ID4808249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp701454 
End bp703565 
Gene Length2112 bp 
Protein Length703 aa 
Translation table11 
GC content41% 
IMG OID640105988 
Productserine/threonine protein kinase 
Protein accessionYP_001037003 
Protein GI125973093 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[S] Function unknown
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase
[COG2815] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.153609 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGGTC AAATTTTAGG AAATAGGTAT GAATTGATTG AGAAAATCGG CGGGGGAGGA 
ATGGCCGATG TATATAAAGC AAGGTGTAAA TTGTTAAACA GGTTTGTGGC AATTAAGATT
TTAAAACCCG AATTTATAAA TGATGAGGAA TTTCTCAAAA GGTTTACAAT AGAGGCTCAG
GCCGCTGCAA GCCTGTCACA TCCGAACATT GTCTCCATAT ATGACGTAGG CCAGGAGAAT
GATATACATT ATATTGTTAT GGAGTATGTA AACGGACAGA CTTTAAAGGA GTACCTTGAC
GAGAATGGCG CTCTTTACTG GAAAGATGCT GTAAATATTG CGATTCAAAT ATGTCAGGCT
ATTGAACATG CACACAAAAA TCATGTTGTT CACAGAGATA TAAAGCCTCA CAATATTTTG
CTTACAAAAG ACGGAATGCT GAAGGTTACC GATTTCGGTA TAGCAAGAGC CGTAAGTTCA
TCCACTATAA CCATGGCGGG AAATGCAATA GGCTCGGTGC ATTATTTTTC ACCGGAACAG
GCCCGGGGAG GATTTACCGA TGAAAAATCG GACCTTTATT CATTGGGAAT AGTACTGTAC
GAACTTTTGA CCGGAAGAGT TCCCTTTGAC GGAGAATCTC CTGTGGCTGT TGCAATAAAG
CATATTCAGG ATGAACCGGA AGAGCCAATA AATATTAAAG AGGATATACC CACGGGTGTT
AACAGTATTG TTATGCGGGC CATTCAAAAG GATCAGGCAT TAAGATATCA ATCCGCGTCC
GAGTTGCTGA ATGATTTGTA CAAAGTTTTG AAACAGCCCG ACGCCCAGTT TGCAAAAGCC
AGAACCACGG AGGACAGTCC CACGGTAAGA ATTCCGTCAA TCAAGAAGAA AGAACTTGTT
CTGGAGATGG ACACATCGGG AAAAGCAGGT GATGATGCAG TGAAAAAGAA GAAAAAAGAC
AACAAAACCA CTGTATGGGC GGTGGTAACA TCAATTCTGG TGATTTTCGT TCTGGGCGCC
TTGATGGTAA AGGGCTTGGG GCCTGTTGTT TATTCAATGT TCAACAAACC GGAGGATTTT
ATTGTTGAAG ATTACACAAA TCAAAATTTC TATGAGGTAA AAGGCAAATT GTCTCAATAC
AATATTGAGG CAATAGAGAT AAGGAAGCAT GATGATCAAA TACCAAAAGA TAGAATAATT
TCTCAGGATA AAGCCGTTGG GGAAAGAATA AAACCCGGTG AATTTGCAAA GATTGAGTTT
GTGGTGAGTG ACGGTCCGTT GCTTGTAAAA ATACCGGACC TCAGAAGGAT GGAATACAGG
CAGGCGGTGA TAGAGCTCAG ACAATTGGGA CTTGAAGCAA ATGTAATCGA TGAGTACAGT
GATGTCGTTT CAAAAGGTGT TGTAATCAGG ACGGAACCGG ATATAAACGC GGAGGTAAAA
CCGGGTACGG TTGTGAATGT TTATAAGAGT TTGGGTCCGG AGATAAAATA CAGCCTGGTT
CCGAACTTGA TAGGAAAGAC AAAAAGCGAG GCATTGAACC TTTTGGTTGG AGCCAAGCTT
ACCATGGGTA AAATATACCC TGAAGACATG ACTTATGCCA GAGATAAAAT AGTAAGGCAG
GAACCTGCAG CCGGAACTGA AGTTGAAGAA GGAACTCCGG TGAATATATA CCTTGAGGAT
TATAATCCTG ATCAGAAATA TGTTACCCGT CTCATTGAAC TTGACAATCC GGACAATTAC
GGAGAAAATA TAAAGTTTTT GGTAAATATA ACAAGATCGG ATACAAAGAG GGTTGAGACC
TTATACAGTG AAGTACGGAA AAAAAGTGAT TTCCCGATAA CGATTTCCAT ACCGGTACCA
AACGGCGGAA GTACTTTGGT GAGGGTATAT CTCGATAATA AGAACTACAT GGAGTTTACA
GAAGAGTTTA ATAAACGCAG TAATGAAACT AATACCGGTA ATACTGCCAA CAATAACGGT
AATTCCAACA ATAACGGTAG CGGTAGCGGT ACCGATAATA TTAATGATAC CAATAATACT
GACAATGCCA ACAACGATAA CGAAAGAACG GAAGAGTCCG GTGAAACAAA CCATGCAGAA
GCTGCAGGAT AG
 
Protein sequence
MVGQILGNRY ELIEKIGGGG MADVYKARCK LLNRFVAIKI LKPEFINDEE FLKRFTIEAQ 
AAASLSHPNI VSIYDVGQEN DIHYIVMEYV NGQTLKEYLD ENGALYWKDA VNIAIQICQA
IEHAHKNHVV HRDIKPHNIL LTKDGMLKVT DFGIARAVSS STITMAGNAI GSVHYFSPEQ
ARGGFTDEKS DLYSLGIVLY ELLTGRVPFD GESPVAVAIK HIQDEPEEPI NIKEDIPTGV
NSIVMRAIQK DQALRYQSAS ELLNDLYKVL KQPDAQFAKA RTTEDSPTVR IPSIKKKELV
LEMDTSGKAG DDAVKKKKKD NKTTVWAVVT SILVIFVLGA LMVKGLGPVV YSMFNKPEDF
IVEDYTNQNF YEVKGKLSQY NIEAIEIRKH DDQIPKDRII SQDKAVGERI KPGEFAKIEF
VVSDGPLLVK IPDLRRMEYR QAVIELRQLG LEANVIDEYS DVVSKGVVIR TEPDINAEVK
PGTVVNVYKS LGPEIKYSLV PNLIGKTKSE ALNLLVGAKL TMGKIYPEDM TYARDKIVRQ
EPAAGTEVEE GTPVNIYLED YNPDQKYVTR LIELDNPDNY GENIKFLVNI TRSDTKRVET
LYSEVRKKSD FPITISIPVP NGGSTLVRVY LDNKNYMEFT EEFNKRSNET NTGNTANNNG
NSNNNGSGSG TDNINDTNNT DNANNDNERT EESGETNHAE AAG