Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0845 |
Symbol | |
ID | 4810463 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1021420 |
End bp | 1022430 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106262 |
Product | stage III sporulation protein spoIIIAA |
Protein accession | YP_001037273 |
Protein GI | 125973363 |
COG category | [S] Function unknown |
COG ID | [COG3854] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR02858] stage III sporulation protein AA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000346466 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTACAA AAGAAGATTT TGTTTTCAGC GTTGAAAAAG TACAGAACTT TGAAAAGGAC ATTTTGAGAA TGATATCTCC ACAAGTGAGA GATGTTTTAA AGAAGATTCC GCTAAATGAA CTTATAACCA CCGAAGAGAT AAGACTAAGA GCGAACAAGC CTTTAATGAT TCAAAATGAA AAAGGAAGCT TTTTTGTCAA TTTGGAAGGA AGACTTACAG CAAACAGGAT GAATCTTTTT TATGTAAGCC AGGAGCAGAT AGCAAAAACG CTGGAACTTA TAAGTGAAAA CTCCATTTAT GCTTTTCAGG ATGAAATAAG AAACGGTTTT TTGACCATAA GGGGAGGCCA TAGGGTCGGT ATTGTCGGAC GAGTTGTTTT AAACGGAGAT ACCGTAAAGA ACATCAAGGA TGTTTCCGGG CTTAATATAA GAATATCCAG GGAAATAACC GGATGTTCCT CGAAAGTTTT AAAATATATT ATCAGCAGTG AAAAGCAAGT TTACAACACT TTGATAGTAT CTCCTCCCCA ATGCGGGAAA ACAACCTTGT TAAGAGACAT AACGAGGGCT ATCAGCGACG GTGTTGAAGA AATGGGCTTT AAAGGAGTTA AAGTGGGAGT TGTAGATGAA CGTTCAGAAA TTGCAGCATG TTACAAGGGG GTGCCCCAGA ACAGGGTAGG AACAAGGACC GATGTGCTTG ATGCGTGCCC CAAACAAATT GGCATGATAA TGATGCTCAG ATCCATGTCG CCGGATGTGA TTGTTACGGA TGAAATAGGA AACAAGGGAG ACAAAGATGC TTTGATTCAG GTGCTTAATG CAGGGGTGAA AGTGATATCC ACGGCGCACG GGTACAATAT TTCGGAATTA AAAAGCCGCA AAGAAGTCTT GAGCCTGATA GAAGAAAAGA TGTTTGAAAG GTATATTGTT TTGAGCGCGA GAAAAGGCCC CGGTACAGTG GAAGAGATAA TTGACGGGAC CGATATGAGT ATTTTGTACA AAGGAGAATG A
|
Protein sequence | MVTKEDFVFS VEKVQNFEKD ILRMISPQVR DVLKKIPLNE LITTEEIRLR ANKPLMIQNE KGSFFVNLEG RLTANRMNLF YVSQEQIAKT LELISENSIY AFQDEIRNGF LTIRGGHRVG IVGRVVLNGD TVKNIKDVSG LNIRISREIT GCSSKVLKYI ISSEKQVYNT LIVSPPQCGK TTLLRDITRA ISDGVEEMGF KGVKVGVVDE RSEIAACYKG VPQNRVGTRT DVLDACPKQI GMIMMLRSMS PDVIVTDEIG NKGDKDALIQ VLNAGVKVIS TAHGYNISEL KSRKEVLSLI EEKMFERYIV LSARKGPGTV EEIIDGTDMS ILYKGE
|
| |