Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1072 |
Symbol | |
ID | 4811370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1279894 |
End bp | 1281090 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106494 |
Product | putative stage IV sporulation YqfD |
Protein accession | YP_001037497 |
Protein GI | 125973587 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02876] sporulation protein YqfD |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0129367 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGATAT TTAGGCTATG GAATTATATA AGAGGATATG TTATTATATT TGTTGAAGGA TATTTCCTGG AGAAGTTTGT GAATATATGT ACCAGAAGAC AAATTTTGCT GTGGGATATC CAAAGGGACA GAAACAGCAA AATGACGCTC AAAGTCAGCA TCCGGGGCTT TAAGATGTTA AAGCCCGTGG CAAAAAAGAC GGGCTGCAGG GTAAAGATAC TTGAAAAAAG AGGCTTGCCT TTTTTGCTAA ACAGATACCG GCACAGGAAA ACTTTTTTAC TTGGTGCCGC AGTATTTGTT GTGTTATTTT ATATAATGAC ATCCTTTGTG TGGAGTGTTG AAGTTGTCGG TAATAAAAAG ATTGAAACGG ACGGTATTTT AAAATGCCTT GAAAAATACG GGGTAAAGCC CGGAGTGCTT AAATACAGGA TAAACCCCGA GGAAGTTGCA AACGGTGTGA TTTTGGACAT AGACGGGCTT TCCTATGTGA ATGTGCTGGT AAGAGGTACA AAAGTAAAAG TGGAAGTGGC CGAGGGTGTC AAGCGTCCTT CGATTATACC TTTGAATGTG CCCTGCGATA TTGTGGCCAA GAAGGACGGC GTAATAAAGT CCGTCATTGT CAAGATTGGC CAGGCGCAGG TCAAGGAGGG AGACACGGTA AAAAAGGGAC AGCTTCTTGT ATCGGGAAGC ATACCGATAA AGGGAGCTGA AGACAACCCA AAAAGAGTGC ATGCGATGGC GGAAGTTCTT GCCAGGACAT GGTATGAAGG AAGGCAGCCG GTAGAGCTTA AAGCCGTTGA AAAAATAAGG ACCGGCAGAA AAAAGGACAA TGTAACTTTG GTTTTGTTTT CGAAAAAAAT TAATTTGTTT CATAAAGAGA TAGATTTTAA AGATTTTGAA AAGGTGGAAA TAAAAAAGAA TCTTTCAATA GGTGAAGAAT TTGTTCTGCC CTTTGGGCTT GTTATTGAAA GATATTATGA AAATGATTTG GTGGAGGCCG ATATTTCTTT GGAAGATGCA AAAGAGAATG CCGCAGGCAT TGCATACAGG AAAGCCGCGG AAAATATCCC CGAAGGTGCC ACGATAGTTG ACAAAAGGGT TAATTTTATT GAGAATGAAA ATGGGGAAAT TATTGCGGAT GTTATTATAG AATGCCTGGA GGATATTGGA GTAGCAAAAG AGAATGGAGG AGAATGA
|
Protein sequence | MLIFRLWNYI RGYVIIFVEG YFLEKFVNIC TRRQILLWDI QRDRNSKMTL KVSIRGFKML KPVAKKTGCR VKILEKRGLP FLLNRYRHRK TFLLGAAVFV VLFYIMTSFV WSVEVVGNKK IETDGILKCL EKYGVKPGVL KYRINPEEVA NGVILDIDGL SYVNVLVRGT KVKVEVAEGV KRPSIIPLNV PCDIVAKKDG VIKSVIVKIG QAQVKEGDTV KKGQLLVSGS IPIKGAEDNP KRVHAMAEVL ARTWYEGRQP VELKAVEKIR TGRKKDNVTL VLFSKKINLF HKEIDFKDFE KVEIKKNLSI GEEFVLPFGL VIERYYENDL VEADISLEDA KENAAGIAYR KAAENIPEGA TIVDKRVNFI ENENGEIIAD VIIECLEDIG VAKENGGE
|
| |