Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1413 |
Symbol | |
ID | 4809074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1732208 |
End bp | 1734124 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640106836 |
Product | hypothetical protein |
Protein accession | YP_001037837 |
Protein GI | 125973927 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAGTTT TTGAATGGAA GAAAGTCCTG ATAAAACAAA AAGGATTGCT TTGTATAGGT ATTATGTTTC TTCTCAAAAT TGCCCTGTTA TTTTACCAGG GATATGATTC TAACAGCATT ATCAATAGCA ACGAAGAAGG TTATAAATAC TATATAAACC TCTATCAAGG TAAACTTACG GAGGAAAAAG AAAAGTCCAT TAAAGCTGAA TATGATAGTG TAACAAATGC ACAGGCTTAC CTAGAAGACT TGTCTCACAA AAAAAGAAAT GGAGAGATTG GTTTTAAGGA GTATGAAGAA AAATCTAAAA AGTATTATGA ATGCTTAAAA AACGCAGATG TTTTTAATTT GGTATATAAT CAATATTACT ACGCGAAAGA AGCTCCGGAT GTCAGATATA TTATTGATTG GAGGGGCTGG CAGACACTTC TGAGCCATGA CGCGCCGGAT GTATTGCTTA TAGTGTGCTT GCTAATTGTT ATGGTACCAT TGTTTTGCAA TGAATATGAA AGTGGCATGT ATTCATTGCT TGTATCCAGT GTTAGGGGAA AGTACAAGGT TGCAATCGTA AAGCTACTGA GCGCATTTGT TTTAAGTGCT GGCATAGTGA TTTTGTTTTC AGTAGCAGAA TATATTTGCG TGGATTTTAT GGTGGGACTT GATAATAGTA CATTTCCATT GCAAAGTTTG AAATTTTTTG AATATAGTGA CTGGTATGTT TCTTTAAGAC AAGCTTTTGT CATAATCGTA TTATTTCGCA TAGTTGGTGC GGTGCTGTTT ACAGCTTTTA TTTCAGTTGT AAGTGTAATA AGTAAAAAAA CAATTGTTGC TTTGTTTACT TGCAGTACAT TGGTGTTTTT ACCGTATATA GTATATGGAG GAACAACTAC ATTATATTAT CTTCCACTGC CGTCAGGGCT TCTTGTCGGA GCAGGGTATT TGTGGGGAGA TAATTATCTT TCGGCTATTA CTGAAGAAGG AGTGGATAGA ATAATATTAT TTCAAAAAAT TAGTAAAAAT ATGATTACTT TATTGATGTT AATGTTTGTA ATTGAAATTG TATTTCTTTT TTTGACTTGC ATTGTGAAAT ATTCAAGGCA TACTTTTCGC CTAAATAATT TCGGTAATAA AATACGTAAG TTCTCATGTG CTTTATGTGT GTTAACCATT CTGCTTTTAA TTCTTACAGG TTGCCGGACA GAAATGAGTG AAAAAGATAA TTTCACTTTC AATGCTTCGG AAGAATGGAG ATGTGTAAAA ACTGATGAGT ATGTAATATC TTTGGATCCG GAAAAAAATA TAATAACTGC GGAGAATCTT GATAAAGGAG AGCAAATTGT TTTGCCAAAG GATCCTTTCA GACAGGATAT ATATGAAACT GAAGACAGAT CATTAAAACG AGGATACAGA ATAAGGTCGA TATTTGTAAG AGACGGATGG TGCTATTATT TAAAAGAAAT ATTGCAAACT GATGGATTTC AAATATATGG TATTGATTTA AAAGATTTTA AAGAGGAATT GATTTATAAT GGTATCCAAG AGAATGATAA AAATTTTTTT GGAGCATTTT TCGATAGAAG ACAGGATCAA ACTAGTCTGC CGTCAGTTGA TTATTTTTTT CTCAATGCGA ATTATATATA TTATCTGCAA GGCAAAAGAC TGGTTAGAAT TGACAGAAAT ACAAATTCAG AAACAGTATT GGCATTGGAT GTGAAAGAAA GAAGTGCCGT TTATCATAAC GGGGATATTT ATTACATAGA TACTCTTAAC AGGCTTAGTG TGTATAAGGA AGAAGATGAA ACCGTCAATA AAATAGATTC TGTTTATACT GATCAGATCA GTATTGAGGG AAAGCGTATC AGGTATACTG ATTTGTTAAA TGATAAAAAT ATTGGATATT ATGATATAGA AACCTGA
|
Protein sequence | MIVFEWKKVL IKQKGLLCIG IMFLLKIALL FYQGYDSNSI INSNEEGYKY YINLYQGKLT EEKEKSIKAE YDSVTNAQAY LEDLSHKKRN GEIGFKEYEE KSKKYYECLK NADVFNLVYN QYYYAKEAPD VRYIIDWRGW QTLLSHDAPD VLLIVCLLIV MVPLFCNEYE SGMYSLLVSS VRGKYKVAIV KLLSAFVLSA GIVILFSVAE YICVDFMVGL DNSTFPLQSL KFFEYSDWYV SLRQAFVIIV LFRIVGAVLF TAFISVVSVI SKKTIVALFT CSTLVFLPYI VYGGTTTLYY LPLPSGLLVG AGYLWGDNYL SAITEEGVDR IILFQKISKN MITLLMLMFV IEIVFLFLTC IVKYSRHTFR LNNFGNKIRK FSCALCVLTI LLLILTGCRT EMSEKDNFTF NASEEWRCVK TDEYVISLDP EKNIITAENL DKGEQIVLPK DPFRQDIYET EDRSLKRGYR IRSIFVRDGW CYYLKEILQT DGFQIYGIDL KDFKEELIYN GIQENDKNFF GAFFDRRQDQ TSLPSVDYFF LNANYIYYLQ GKRLVRIDRN TNSETVLALD VKERSAVYHN GDIYYIDTLN RLSVYKEEDE TVNKIDSVYT DQISIEGKRI RYTDLLNDKN IGYYDIET
|
| |