Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1307 |
Symbol | |
ID | 4809560 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1586303 |
End bp | 1588198 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640106731 |
Product | cellulosome anchoring protein, cohesin region |
Protein accession | YP_001037732 |
Protein GI | 125973822 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGAAGA AAAAAAGATT AATATCATTA CTGCTTGCGG TTTTTATCGC CGTTGCATGT CTGCCGGCGG GAATTGCAAG GGCAGATAAA GCCTCGAGCA TTGAGCTTAA GTTTGACCGC AATAAGGGAG AAGTTGGAGA TATACTTATT GGTACCGTAA GGATAAACAA TATCAAGAAT TTCGCAGGAT TTCAGGTAAA CATTGTATAT GATCCAAAAG TCTTAATGGC TGTTGACCCT GAAACGGGGA AAGAATTTAC TTCTTCAACA TTTCCGCCAG GACGCACTGT ACTGAAAAAC AATGCTTACG GCCCAATACA GATTGCGGAC AATGATCCGG AAAAAGGGAT ACTGAACTTC GCGCTTGCAT ATTCATATAT TGCGGGATAC AAAGAAACAG GAGTAGCGGA GGAAAGCGGC ATAATTGCGA AAATTGGATT TAAAATACTC CAGAAAAAGA GCACTGCCGT AAAATTCCAG GATACATTAA GCATGCCCGG AGCTATTTCG GGAACACAGC TGTTTGACTG GGACGGAGAA GTTATTACCG GATATGAGGT AATACAGCCG GATGTGCTGA GTTTGGGTGA CGAGCCTTAT GAGACACCGG GAACGGATAT TCCGATATCC GACAATCCGG CAGCAACTCC GTCATCCACG CCGTCAGTTA CTCCTTCACC GGAAGTTAAA CCGACTCAGA CGCCTTCGCC TGCAGAAAAT TCTGCAAAAG TGGAGCTTGA ACCTGTGTTG GATAATGCAA CAGGAGAAGC AAAGGCGGCA ATAGATGAAG AAAAATTAAA CAAGGCTCTT GATGAAGCGA AAAAATCGGA AGATGACAAA CTTGTGGAAC TTAACATAAA GAAGGTTGAA AATGCCGATG CTTACATACA ACAGCTTCCG GCGAAATTCC TGATAAAAAG TGACGCCGAA TATAAGCTGA GAATAGCTAC AGAGCAGGGA ATTATAGAAG TACCGGCCAA CATGCTGAAT ACTGCGGATA TTTCAAAGCT TGTAAAAAAT GACTCCGTTG TTGAATTCGT CATAAGAAAA GTAAAAGTCG ATGAACTTGG TGCAGAGCTC AAAGAGAAGA TAGGCAACAG GCCGGTGATT GACATAAGCG TGGTTGTTGA CGGCAAAAAA GTTGAATGGA GCAATTACAA AGCCAAGGTT AAAATATCAA TTCCTTACAA GCCTGATGCA AAAGAGCTGG AGAACCACGA GCATATTGTT GTACTCCATA TTGATGACGC CGGCAAGGCA GTTTCCGTAC CCAGCGGAAA ATATGAACCT TCTTTGGGCG TCGTTACGTT TGAGACGAAT CATTTAAGCA AGTATGCGGT TTCATATGTT TACAAGACTT TCGCGGATAT TGGTTCATAT GCCTGGGCTA AAAAGCAGAT AGAGGTTTTG GCTTCCAAAG GAGTAATTAA CGGTACATCC GATACCACTT TTACGCCCCA GGCAGACATA ACAAGGGCGG ATTTCATGAT ACTTCTTGTA AAGGCACTGG GATTGACTGC CGAGGTTACT TCCAATTTTG ATGATGTGTC CGAAAAAGAC TACTATTATG AATACGTGGG AATTGCAAAA GAGCTTGGAA TTACGACAGG AGTCGGAAAC AACAAGTTCA ATCCGAAAGC CAAAATTACA AGACAGGATA TGATGGTACT TACAACAAAT GCTCTCAGGA TTGCAGGAAA AATATCGAGC ACAGGAACCC GCGCTGATGT TGAAAGATTT TCGGACAAGG ACCAGATAGC TTCATATGCG GTTGAAGGCG TTGCAACCTT GGTAAAAGAA GGTATTGTAG TGGGAAGCGG CGATATTATA AATCCAAGGG GAAATGCTTC AAGAGCCGAA CTTGCAGCAA TCATATACAA GATTTACTAC AAGTAA
|
Protein sequence | MRKKKRLISL LLAVFIAVAC LPAGIARADK ASSIELKFDR NKGEVGDILI GTVRINNIKN FAGFQVNIVY DPKVLMAVDP ETGKEFTSST FPPGRTVLKN NAYGPIQIAD NDPEKGILNF ALAYSYIAGY KETGVAEESG IIAKIGFKIL QKKSTAVKFQ DTLSMPGAIS GTQLFDWDGE VITGYEVIQP DVLSLGDEPY ETPGTDIPIS DNPAATPSST PSVTPSPEVK PTQTPSPAEN SAKVELEPVL DNATGEAKAA IDEEKLNKAL DEAKKSEDDK LVELNIKKVE NADAYIQQLP AKFLIKSDAE YKLRIATEQG IIEVPANMLN TADISKLVKN DSVVEFVIRK VKVDELGAEL KEKIGNRPVI DISVVVDGKK VEWSNYKAKV KISIPYKPDA KELENHEHIV VLHIDDAGKA VSVPSGKYEP SLGVVTFETN HLSKYAVSYV YKTFADIGSY AWAKKQIEVL ASKGVINGTS DTTFTPQADI TRADFMILLV KALGLTAEVT SNFDDVSEKD YYYEYVGIAK ELGITTGVGN NKFNPKAKIT RQDMMVLTTN ALRIAGKISS TGTRADVERF SDKDQIASYA VEGVATLVKE GIVVGSGDII NPRGNASRAE LAAIIYKIYY K
|
| |