Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1328 |
Symbol | |
ID | 4809468 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1611804 |
End bp | 1613081 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106752 |
Product | stage II sporulation P |
Protein accession | YP_001037753 |
Protein GI | 125973843 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02867] stage II sporulation protein P |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.944507 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGTAT ACGGTTTGAG AAAAAGAAAG TTTGATTATG GCAAGTTGTT TAAGATTGCC CTGATAATAA TACTTTCAAT CGGGGCAATA AAGATTGGGG CAATTGCCGG AGACGTGCTC TACAAGTCTG ATAAAAAGAT TATTGGAAAG ATTGAAGTTG AAACTTTAAG AGCCACTCTC AATGCTTCAC TTCCTATAAT TGACACCATT TACAACAGCG GCAATATCAG TTTTTCGATT TCGGGACAGA TAAAAGAAAT TATTAATCTG GTTTTCTATT TTGATCTCAG TAATCCTGTT ACAATTTTTG GGGCAGAGTC TCCAATATTC TATAGTTACT ATATGAATGA GTACCAAAAA CAGCTGGCCC AAAATCAAAA CTGTGAACCT TACTTCTATA TGGCGGATTT GGATACGCCG GATGACAATA GCGACAAGGT CAAAAATCCT GACAATAATG CTCCTGAACC CACATATCCG GCCAGCAGCA TAAGTTATGA GGTAAATGAG TTGGACAGAA CCGGAACTCC TGAGAATGCT ACCACTGTTA CGGCGGACAA AATTGCAATC AACAGTCATG AAGTTGACTA TGAAATTGAT GTTGAAAAGC TTCTTAACGA ACCTTTAAAC ATCAGCTTTG ACAAAAAGGG TCCCAAAGTT CTTATATACC ATACACATAC CACGGAAGGA TTTATTAAAG ACCTAAGCGA GCTGGATAAA AGTGGTATTC CAAGCAGAAC CACCGATAAC AGATACAATG TAGTAAGAGT TGGGGAGGAA CTGGCTCAGA CATTAAGGAA AAAATACGGT ATTGAAGTGA TTCATAACGC CACTGTTCAC AATCATCCCT CGGACACAGG AGCTTATGGT AGATCCCTTA ATACTGCGGC CAACATTTTA AAAAGTTATC CTTCAATAAA AATAGTCCTG GATATTCACA GGGACGGGCT GGGCGAAGGT AAACTTAGAG TGGCGACCAA GATTAATAAC AAGGATGCGG CAAAAATAAT GTTTGTGGTG GGAACCGACG GGACAGGGCT TGAGCATCCT AACTGGCGGG AGAATTTGAA ATTGGCAATC AAGCTTCAGC AAAAGCTTAA TGAAAAGTAT CCCGGTATCA CAAGACCGAT TTATATAAGC CGCAACCGCT ACAACCAGCA CCTTACCAAC GGTTCTTTGA TTGTTGAAAT CGGAGGGGAT GGCAATACAA TAAATGAATG TTTGGAGAGT ACGAAATATC TTGCCGAGGT TTTAAACGAT GTCATTAATA ATAAATAA
|
Protein sequence | MRVYGLRKRK FDYGKLFKIA LIIILSIGAI KIGAIAGDVL YKSDKKIIGK IEVETLRATL NASLPIIDTI YNSGNISFSI SGQIKEIINL VFYFDLSNPV TIFGAESPIF YSYYMNEYQK QLAQNQNCEP YFYMADLDTP DDNSDKVKNP DNNAPEPTYP ASSISYEVNE LDRTGTPENA TTVTADKIAI NSHEVDYEID VEKLLNEPLN ISFDKKGPKV LIYHTHTTEG FIKDLSELDK SGIPSRTTDN RYNVVRVGEE LAQTLRKKYG IEVIHNATVH NHPSDTGAYG RSLNTAANIL KSYPSIKIVL DIHRDGLGEG KLRVATKINN KDAAKIMFVV GTDGTGLEHP NWRENLKLAI KLQQKLNEKY PGITRPIYIS RNRYNQHLTN GSLIVEIGGD GNTINECLES TKYLAEVLND VINNK
|
| |