Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1525 |
Symbol | |
ID | 4810563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1850184 |
End bp | 1852052 |
Gene Length | 1869 bp |
Protein Length | 622 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640106945 |
Product | hypothetical protein |
Protein accession | YP_001037946 |
Protein GI | 125974036 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAACTCAT TTAAATTGGC TGTTATGAGT TTTAGAAGAA ATATAAAAGC TTATGGAATG TATCTTATGG CAATGATTTT ATCAGTAGCC ACCTATTATA ATTTTGCATC TATGAGATTC AACCCTCAAT TCCGGGAGGC AAGAGATTTA ACTGTATATG TACAGAGTTC ATCAGTGGTT GCCTCCCTGC TTATGATATT GTTTCTGATA TTTTTCATTA TGTATTCCGG CAACTTCTTT CTGAACCAAA GGAAAAAGGA AATAGCAGTA TATGCTTTCA TGGGAATTGA TAACTATAAA ATTGCCTTTA TGTTTGCATC GGAAGGATTG TTGATGGGGA TAATGTCTTT GGTAATCGGC CTGTCGCTTG GAATTCTGTT CAGCAAATTG TTTCTGATGT TGCTTGCAAA GGTAGCTTTA CTGAATATGA GAATTAATTT CTTCATATCA GTAAAGGCTA TTGTAGAGAC TGTAGTTGCA TATTTGGTCA TTTTATTTAT TACATTCCTG AAAGGATATA TAGATGTTGT CAGGACAAAT TTGATTGATT TGATAAATAC GTTGAAAAAA TCGGAGGAGC TTCCTAAAAT TAATTATTTA AAAGGCATTG CCTCATTAAT GGTTATAGGT GCTGCATATT ATATTGCGGT AAATTATGGC AAGTTCGGGT TTGGAAAAGC CCTCTTATGG ACAGTGATTC TGGTCGTTAT AGGCACTTAC TGGCTGTTTG GTTCTCTTTT ATCAATGATT ATCAGGTACT TCATAAGCAG AAAAAAGTTT TTGTATAAAG GCACAAATAT TATAAGCTTT TCAAATATAG CCTTTAGGAT AAAGGGCAAC TATAGGGCCC TTGCAGCAGT AGCGGTATCG ATAACTGTGT GTATAACATC CTTTGGTACG GTTAGCTCTC TTAAGTATTT TGTAAATGAG AACCATAAGA TTGAGGTACC ATATACTGTT ACCTATATTT CCGAAAAACA GGAAGAAATA GAAAGAGTGG ATGAAATAAT AGGAAAATCG AATCATAACG TTAAGCTGAA AGAAAAGGCC AACTTTTTGT TTGTCCCTGA TTCACAGGTT GTAGTGGTGA AACTGTCCAC TTTTCAAAGG ATACTGACGG ATCTTAATGT TAAAGGGCGG GATAAAATTT TATCTAAAAT TGGACAGCTG AAGGAAGAAG CGGTATATGT AGAGAGACCC GGAGTCTTTA TGAGCCTGTT GGAAAAAAAT GATATAAAAA TAGGTGACAG GGTCTACAGA ATAAAAGCTC AGACAAAGAT TCCTTTGTTT GGAAGCGGAT TGCCTTTTCC TTGTGTTGTT GTCGGCGAGG AAGAATATGA AACATTAAAG TCTGAATTTG AAGAGAAACA GTTTAATGGA ATTATACTTG ACAATCCGGA AGACACAAAG GATTTGACTT TACAGCTGGC TCAAATACTG CCGGAGAATT CAAGACTATT CACCTATTTT ATAGCTGGCG CTGCAATGTA CGACTTAATT GGAATAGTAT ATTTTCTTGG AGCTTTCCTG TTTCTAGTGT TTGTATTTGC CACAGGCAGC ATAATATACT TTAAGATTTT GAGCGAATCT TTCAGAGATA AAGATAAATA CGAAATACTT AAGAAACTGG GGACAACGGA TGTTGAAATC AAAAAGTCCG TATCAAAACA GGTGGGTGTG TTTTTCCTGT TGCCGCTGAT AGTGGGGATA ATCCACAGCA CAGTTGCCAT TTCAGTATTA AGTGACCTTA TGAGTTATAG TTTGACAGTG CCGACAATTA TAAGTATTGG CGTATTTATA ATTGTATATG CGATATTCTA TGTCTTTACC GGAAGAAAAT ATGTTAATGT TGTAAGAAAT CAGGCTTGA
|
Protein sequence | MNSFKLAVMS FRRNIKAYGM YLMAMILSVA TYYNFASMRF NPQFREARDL TVYVQSSSVV ASLLMILFLI FFIMYSGNFF LNQRKKEIAV YAFMGIDNYK IAFMFASEGL LMGIMSLVIG LSLGILFSKL FLMLLAKVAL LNMRINFFIS VKAIVETVVA YLVILFITFL KGYIDVVRTN LIDLINTLKK SEELPKINYL KGIASLMVIG AAYYIAVNYG KFGFGKALLW TVILVVIGTY WLFGSLLSMI IRYFISRKKF LYKGTNIISF SNIAFRIKGN YRALAAVAVS ITVCITSFGT VSSLKYFVNE NHKIEVPYTV TYISEKQEEI ERVDEIIGKS NHNVKLKEKA NFLFVPDSQV VVVKLSTFQR ILTDLNVKGR DKILSKIGQL KEEAVYVERP GVFMSLLEKN DIKIGDRVYR IKAQTKIPLF GSGLPFPCVV VGEEEYETLK SEFEEKQFNG IILDNPEDTK DLTLQLAQIL PENSRLFTYF IAGAAMYDLI GIVYFLGAFL FLVFVFATGS IIYFKILSES FRDKDKYEIL KKLGTTDVEI KKSVSKQVGV FFLLPLIVGI IHSTVAISVL SDLMSYSLTV PTIISIGVFI IVYAIFYVFT GRKYVNVVRN QA
|
| |