Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1021 |
Symbol | |
ID | 4811315 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1222325 |
End bp | 1223803 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106439 |
Product | stage IV sporulation protein A |
Protein accession | YP_001037446 |
Protein GI | 125973536 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02836] stage IV sporulation protein A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.143257 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAAAAAT ATAATATATA CCAGCAAATT GCAGAAAGAA CGCAGGGAGA TATATATATA GGCGTTGTTG GCCCGGTGAG AACAGGAAAG TCCACATTCA TAAAAAGATT TATGGACTTG TTGGTTATAC CGAATATAGA AAATGAATTT TCAAGGGCAA GGGCAAAAGA CGAACTTCCG CAAAGTGCTT CGGGACGCAC AATAATGACC ACCGAACCGA AATTTGTCCC AAATGAAGCC ATTAAAATTG AACTTGACGA AAATGTTCAC TTTAAGGTAA GACTGGTGGA TTGTGTCGGA TACATGGTAA AAGGCGCTAT TGGCCACATG GAAAATGACA TGCCCCGGAT GGTGTCCACA CCTTGGTTTG ATGAGCAGAT TCCATTTGTT CAGGCTGCTG AAATCGGAAC CAAAAAAGTT ATCACTGATC ATTCCACCAT TGGATTTGTA GTTACAACCG ACGGCAGTAT AACCGACATC GCCAGGGAAG ACTACGTTGA AGCCGAAGAA AGAGTTGTAA AAGAATTAAA AGAGATTAAC AAGCCATTTG TAATACTTTT GAATTCGATA AATCCGTCGA ATCCTGAAAC AGAAAGTTTA AGGCAGGAAC TTGAGGCAAA GTACAATGTT CCGGTGATAG GTGTTAACTG CGCACAGCTT CGAATCGAAG ATTTGAACAA CATAATGGAA AGGGTGCTGC TTGAGTTCCC GATAAATGAA ATTGGGGTTA ACATACCGAA ATGGATTGAG TCCCTTGATG ACAACCACTG GCTTAAAGTT GACATAATTA ATGCTGTGAA AGAAGCTTTC AGAGGAATCA CCAGGATCCG GGAAATTAGG GGCAGTGTAA ACAGGTTTGA TGAATTTGAA TTTATAAAAC GGGCTTATAT TGATCACATA AACCTTGGTT CGGGAACTGC TTATGTGGAG ATTAATGAGC AGGACGGACT GTTTTATCGT ATATTGAGCG AAATGACCGG GCTTGAAATT GACGGCGAAC ACAGGCTTAT TTCACTTATG ACGGAGCTTG CAAGGATAAA AAAAGAATAT GATAAAGTTC AATATGCCCT TCATGAGGTT AAACTTAAAG GATATGGCAT AGTATCTCCC CAGATAGAAG AAATGTCTCT TGAAGAACCC GAAATAATAA AACAGGGAAG CCGCTTCGGG GTTAAACTTA GGGCAAGTGC GCCTTCAATA CATATGATAA GGGCGGATAT TGAAACAGAA ATAGCGCCTT TGGTGGGAAC GGAAAAACAG TCCGAAGAAC TGGTCAGCTA TCTTTTAAAG GAATTTGAAA ATGAACCGGA AAAATTGTGG CAGTCGAACA TCTTTGGAAA GTCATTGCAT GAACTGGTGA GTGAAGGGCT TCAAAACAAG CTTTACAGGA TGCCTGAAGA CGCACAGCTG AAACTTCAGG AGACACTGCA AAAAATAATT AACGAGGGAA GCGGAGGACT TATTTGTATT ATTTTGTAA
|
Protein sequence | MEKYNIYQQI AERTQGDIYI GVVGPVRTGK STFIKRFMDL LVIPNIENEF SRARAKDELP QSASGRTIMT TEPKFVPNEA IKIELDENVH FKVRLVDCVG YMVKGAIGHM ENDMPRMVST PWFDEQIPFV QAAEIGTKKV ITDHSTIGFV VTTDGSITDI AREDYVEAEE RVVKELKEIN KPFVILLNSI NPSNPETESL RQELEAKYNV PVIGVNCAQL RIEDLNNIME RVLLEFPINE IGVNIPKWIE SLDDNHWLKV DIINAVKEAF RGITRIREIR GSVNRFDEFE FIKRAYIDHI NLGSGTAYVE INEQDGLFYR ILSEMTGLEI DGEHRLISLM TELARIKKEY DKVQYALHEV KLKGYGIVSP QIEEMSLEEP EIIKQGSRFG VKLRASAPSI HMIRADIETE IAPLVGTEKQ SEELVSYLLK EFENEPEKLW QSNIFGKSLH ELVSEGLQNK LYRMPEDAQL KLQETLQKII NEGSGGLICI IL
|
| |