Gene Cthe_1205 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1205 
Symbol 
ID4809897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1436762 
End bp1438690 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content41% 
IMG OID640106628 
Productputative serine protein kinase, PrkA 
Protein accessionYP_001037630 
Protein GI125973720 
COG category[T] Signal transduction mechanisms 
COG ID[COG2766] Putative Ser protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAGAC TGGATTTTTC AAGTATCATA CGCAAGGACA GGGAAGAACA CAAGAGCAAA 
AAGTTTGAAG GAACTTTCCT TGAGTACCTT GAAATTGTGA AAGAAAACCC CGAGATTACG
ATGCTTGCTC ACCAAAGAAT GTATAAGCTT ATCACGGAAC CCGGTGTGAC CGTAATAAGA
ACCGAGGAAA ATCCCAGACT TCGCAGGATA TATGGAAATG ATATTATAAA GAAATATAAA
TTTTTTGAGG ATGAATTTTT CGGAATAGAT AAGACAATCA TGAAAATCGT AAGGTATTTT
CATTCGGCTG CCATGGCGGG AGAAGAAGCA AGACAGGTAC TTTACCTGGT GGGACCGGTC
GGTGCGGGAA AATCATCATT GATGGAAGCT TTAAAACGCG CATTGGAAAT GAGTCCTCCG
ATTTATGCAT TAAAGGGATG CCCAATGAGG GAGGAGCCTT TGCATCTTGT GCCAAAACAT
TTGAGAAAAG AATTTGAAGA AATATTGAAT GTGAGGATAG AAGGAGACCT GTGCCCCATA
TGCCGTTACA GACTTAAAAA CGAATATAAT GGGGAATATG AGAGATTTCC CGTTGAGACG
GTGGATTTTT CCATCAGGTC AAGGAAAGGT ATCGGTGTGG TACCTCCCGT GGATCCCAAC
AACCAGGACA CATCGGTTCT TATAGGCAGC GTGGACATAT CCAAGATAGA CTTGTATCCG
GAAGATGACC CCAGAGTTCT TTCGCTGAAC GGTGCGTTCA ATGTCGGAAA CCGCGGTATT
GTTGAGTTCA TTGAAGTGTT TAAAAATGAG ACCGAGTATT TGCACACAAT GATTACCGCA
ACCCAGGAAA AGTCGATTCC GTCTCCCGGA AAAGGTTCAA TGATATACTT TGACGGTATA
ATTCTGGCCC ATTCTAATGA GGCGGAATGG AACAAGTTCA AGTCCGATCA CACAAACGAA
GCAATACTTG ACCGAATTGT AAAAATCGAA GTTCCGTATT GTCTGGAGCT TAACGAAGAG
ATAAAAATTT ATGAAAAAAT ATTGAGAAAG AGCAAGTTCG ATGCCCACAT AGCACCTCAC
ACCATTGAAA TTGCTGCAAT GTTTGCAATA CTGACAAGGC TTGCCCCGTC AAACAAGGTG
GACCCCATGA CCAAGCTTAA GATATACAAC GGTGAGGAAA TAATTGAAAA GGGTATGACA
AAGAAAGTTG ATATTTTTGA ACTTAAAGAA GAGGCTCCGA GGGAAGGAAT GACGGGTATT
TCCACGAGAT TTATCATGAA AGTGCTGGAT ACGACGCTTT CAGAATCGGA ACACAACTGC
ATCAACCCCA TATCTGTTAT GGAGACACTG GTTAAATCAA TAAAGGAGCT GTCAATCAGC
GAGGAAGAAA GGGAAAGATA TTTAAGGTTT GTGCAGGACA GTATAAGAAA AGAATACAAC
AAGATTTTGG AAAGAGAAAT TACAAAGGCG TTTATTCATG GCTACAAGGA ACAGGCGGAA
AGCCTCTTTA ACAACTATCT TGACCATGCC GAGGCCTTTG TAAACAAGTC AAAGATAAAG
GACAGGAATA CGGGAGAAGA GCTTGAGCCG GACGAAAAAT TCCTAAGATC CATAGAGGAG
CAGATAGGAA TTACGGATAC TGCGGCAAAG GGATTTAGAG CGGATGTTAC GGCATATATG
TTCTATGTTT TGAGAAACGG CGGAAAGCTT GACTATACCA GCTATGAGCC TTTGAAGGAA
GCTATTGAAA AGAAACTTAC ATCATCGGTC AGAGAACTTA GCAGGGTTAT TACACAGGCA
AAGGTAAGGG ACAAAGAGCA GAGCGAAAAA TACGATACAA TGGTTCAGGA GATGAAAAAT
AACGGATACT GCGACCATTG CTGCAATGTT ATATTGAAAT ATGCGGCTAA CAACTTGTGG
AAGGATTAA
 
Protein sequence
MERLDFSSII RKDREEHKSK KFEGTFLEYL EIVKENPEIT MLAHQRMYKL ITEPGVTVIR 
TEENPRLRRI YGNDIIKKYK FFEDEFFGID KTIMKIVRYF HSAAMAGEEA RQVLYLVGPV
GAGKSSLMEA LKRALEMSPP IYALKGCPMR EEPLHLVPKH LRKEFEEILN VRIEGDLCPI
CRYRLKNEYN GEYERFPVET VDFSIRSRKG IGVVPPVDPN NQDTSVLIGS VDISKIDLYP
EDDPRVLSLN GAFNVGNRGI VEFIEVFKNE TEYLHTMITA TQEKSIPSPG KGSMIYFDGI
ILAHSNEAEW NKFKSDHTNE AILDRIVKIE VPYCLELNEE IKIYEKILRK SKFDAHIAPH
TIEIAAMFAI LTRLAPSNKV DPMTKLKIYN GEEIIEKGMT KKVDIFELKE EAPREGMTGI
STRFIMKVLD TTLSESEHNC INPISVMETL VKSIKELSIS EEERERYLRF VQDSIRKEYN
KILEREITKA FIHGYKEQAE SLFNNYLDHA EAFVNKSKIK DRNTGEELEP DEKFLRSIEE
QIGITDTAAK GFRADVTAYM FYVLRNGGKL DYTSYEPLKE AIEKKLTSSV RELSRVITQA
KVRDKEQSEK YDTMVQEMKN NGYCDHCCNV ILKYAANNLW KD