Gene Cthe_1021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1021 
Symbol 
ID4811315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1222325 
End bp1223803 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content41% 
IMG OID640106439 
Productstage IV sporulation protein A 
Protein accessionYP_001037446 
Protein GI125973536 
COG category 
COG ID 
TIGRFAM ID[TIGR02836] stage IV sporulation protein A 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.143257 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAAAAAT ATAATATATA CCAGCAAATT GCAGAAAGAA CGCAGGGAGA TATATATATA 
GGCGTTGTTG GCCCGGTGAG AACAGGAAAG TCCACATTCA TAAAAAGATT TATGGACTTG
TTGGTTATAC CGAATATAGA AAATGAATTT TCAAGGGCAA GGGCAAAAGA CGAACTTCCG
CAAAGTGCTT CGGGACGCAC AATAATGACC ACCGAACCGA AATTTGTCCC AAATGAAGCC
ATTAAAATTG AACTTGACGA AAATGTTCAC TTTAAGGTAA GACTGGTGGA TTGTGTCGGA
TACATGGTAA AAGGCGCTAT TGGCCACATG GAAAATGACA TGCCCCGGAT GGTGTCCACA
CCTTGGTTTG ATGAGCAGAT TCCATTTGTT CAGGCTGCTG AAATCGGAAC CAAAAAAGTT
ATCACTGATC ATTCCACCAT TGGATTTGTA GTTACAACCG ACGGCAGTAT AACCGACATC
GCCAGGGAAG ACTACGTTGA AGCCGAAGAA AGAGTTGTAA AAGAATTAAA AGAGATTAAC
AAGCCATTTG TAATACTTTT GAATTCGATA AATCCGTCGA ATCCTGAAAC AGAAAGTTTA
AGGCAGGAAC TTGAGGCAAA GTACAATGTT CCGGTGATAG GTGTTAACTG CGCACAGCTT
CGAATCGAAG ATTTGAACAA CATAATGGAA AGGGTGCTGC TTGAGTTCCC GATAAATGAA
ATTGGGGTTA ACATACCGAA ATGGATTGAG TCCCTTGATG ACAACCACTG GCTTAAAGTT
GACATAATTA ATGCTGTGAA AGAAGCTTTC AGAGGAATCA CCAGGATCCG GGAAATTAGG
GGCAGTGTAA ACAGGTTTGA TGAATTTGAA TTTATAAAAC GGGCTTATAT TGATCACATA
AACCTTGGTT CGGGAACTGC TTATGTGGAG ATTAATGAGC AGGACGGACT GTTTTATCGT
ATATTGAGCG AAATGACCGG GCTTGAAATT GACGGCGAAC ACAGGCTTAT TTCACTTATG
ACGGAGCTTG CAAGGATAAA AAAAGAATAT GATAAAGTTC AATATGCCCT TCATGAGGTT
AAACTTAAAG GATATGGCAT AGTATCTCCC CAGATAGAAG AAATGTCTCT TGAAGAACCC
GAAATAATAA AACAGGGAAG CCGCTTCGGG GTTAAACTTA GGGCAAGTGC GCCTTCAATA
CATATGATAA GGGCGGATAT TGAAACAGAA ATAGCGCCTT TGGTGGGAAC GGAAAAACAG
TCCGAAGAAC TGGTCAGCTA TCTTTTAAAG GAATTTGAAA ATGAACCGGA AAAATTGTGG
CAGTCGAACA TCTTTGGAAA GTCATTGCAT GAACTGGTGA GTGAAGGGCT TCAAAACAAG
CTTTACAGGA TGCCTGAAGA CGCACAGCTG AAACTTCAGG AGACACTGCA AAAAATAATT
AACGAGGGAA GCGGAGGACT TATTTGTATT ATTTTGTAA
 
Protein sequence
MEKYNIYQQI AERTQGDIYI GVVGPVRTGK STFIKRFMDL LVIPNIENEF SRARAKDELP 
QSASGRTIMT TEPKFVPNEA IKIELDENVH FKVRLVDCVG YMVKGAIGHM ENDMPRMVST
PWFDEQIPFV QAAEIGTKKV ITDHSTIGFV VTTDGSITDI AREDYVEAEE RVVKELKEIN
KPFVILLNSI NPSNPETESL RQELEAKYNV PVIGVNCAQL RIEDLNNIME RVLLEFPINE
IGVNIPKWIE SLDDNHWLKV DIINAVKEAF RGITRIREIR GSVNRFDEFE FIKRAYIDHI
NLGSGTAYVE INEQDGLFYR ILSEMTGLEI DGEHRLISLM TELARIKKEY DKVQYALHEV
KLKGYGIVSP QIEEMSLEEP EIIKQGSRFG VKLRASAPSI HMIRADIETE IAPLVGTEKQ
SEELVSYLLK EFENEPEKLW QSNIFGKSLH ELVSEGLQNK LYRMPEDAQL KLQETLQKII
NEGSGGLICI IL