Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1931 |
Symbol | |
ID | 4810789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2303858 |
End bp | 2305423 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640107347 |
Product | S-layer-like domain-containing protein |
Protein accession | YP_001038342 |
Protein GI | 125974432 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000922492 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAAAA GAAAGTTGTC ATTAAAAAGG GGATTGGCAT TTATCACGGC AACATTTATA ATTGTTCTTG TAACAGGTGT TTTCCGTGTA TATGCAAGAG AAGGGGACAG CGGATACGAA GGAGGCATCT CCAGCGGGGA AGCACCGGGA AAGACATCGT TCGAGTATAA AGAGGTGTGC TTTATAACCG GTGAACCCAT AGTATTTGAA GGAACTCTGA CCATAAGAAA AACGTTGAGG CAGGACAAAA CAACAGGAAA AAATGTGATT ACGGCAAATT ACACATACAA CCTCAGGAAT CTTGAAAAAG ACGCCACACT GACAAGAGTT CTGTCATATA CCACAACTCT TACCGAAAAG GAAAACGGCC AGGTAATTGA AGAAACGGGA TTTGGCGGAA GATGCACCGA AGTGGTGAGA ATCGGTTCCA CCACTTACAC CCTGGAAAAT TATGATTTTA CAAAGACCAA TATAAAAGAC AAAAAACCGG CGGTGGATTA TTTTGCCGGG AATCTGTGGG GGAAAAAGAC ATACCGGACC GGCACCGGTG CAAACAGCGG GGAAGTTACC GTTGAAATAT CGGGAGATTA TTATGGATAC AATCAGATTT GGGGAACCGT TGAAGCCCAG GTTTTAAATT ATGTAATAGA AAGTCAGAAA AGAAGCGGTG AAGTACTGGA CCGCTGGGGC GGAACTGCAA CTGTAAGCAT TTCGTCCACA ACAACCAAAA AAATTGACTA TGTTGAAAAC AAACCGGATG TTATAAGCTT TGAAGGAGGT TTTGTAGAGA GTCAGTACAA CAACAGCATA CTTCAGTATA CGGCAAAACT TCCCGAGTTT GACCATCAGG GAGTATCCAC CGACAGAATG GTTGAGACAA AAGGAAGCCT TATGATTGAA AGCTTTCCCA CCAGCCGAAG ACTTTTGGTG CCGGAACTGA GTCACCTGAG AGGACATTGG GCTGAAAATG ATATCAAGGC GCTGTACAGC CTGGAAATCT TCAAAGAAAA TCCGTCCGGT TTCAACCCCC AGGAAGTCAT GACGAGAGCC GAGTTTACCG AAGCCATTGT GCTGGCCGCG GGCGAAGTTC CGAAAGATCC GCTGCTTGTT GAGTCAAGAA CCACGAAAAA GACTTCCAAT ACCAAAGAAG AGATAACTTC CCCTTTTATC GACGTGCCAA CAGGAAGCAA ATATTTTGAA AGCATAAACA ATGCGTATAA AAGGGGAATG ATAAGCGGAA GAGGGGACGG AACTTTTGCA CCGGATGATT ATCTTACCAC GGCGGATGCC ATTACCATAT TGGTAAAAGC TTTAGGTCTT GAAGGACTTG CTCCTTCAAA CGGAGCGGTA ACGGTGTTCA GGGACAGCGA CGACATACCG AGATATGCGA AAGGTCCGGT TTATGTTGCC CATAGAATCG GTCTTGTGAT GGGTGACGAC AAGGGTTATC TGAGACCCAA CGAGTATCTT ACCAAGGCCA GGGCGGCGGT AATTATCAAT AATTTTATTG ACTATATGAG AAATGATTTA AGAAAGGATT ACAGGGAGAG AATAGTAAAT TATTAA
|
Protein sequence | MIKRKLSLKR GLAFITATFI IVLVTGVFRV YAREGDSGYE GGISSGEAPG KTSFEYKEVC FITGEPIVFE GTLTIRKTLR QDKTTGKNVI TANYTYNLRN LEKDATLTRV LSYTTTLTEK ENGQVIEETG FGGRCTEVVR IGSTTYTLEN YDFTKTNIKD KKPAVDYFAG NLWGKKTYRT GTGANSGEVT VEISGDYYGY NQIWGTVEAQ VLNYVIESQK RSGEVLDRWG GTATVSISST TTKKIDYVEN KPDVISFEGG FVESQYNNSI LQYTAKLPEF DHQGVSTDRM VETKGSLMIE SFPTSRRLLV PELSHLRGHW AENDIKALYS LEIFKENPSG FNPQEVMTRA EFTEAIVLAA GEVPKDPLLV ESRTTKKTSN TKEEITSPFI DVPTGSKYFE SINNAYKRGM ISGRGDGTFA PDDYLTTADA ITILVKALGL EGLAPSNGAV TVFRDSDDIP RYAKGPVYVA HRIGLVMGDD KGYLRPNEYL TKARAAVIIN NFIDYMRNDL RKDYRERIVN Y
|
| |