Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0404 |
Symbol | |
ID | 4808407 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 504225 |
End bp | 505793 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640105818 |
Product | type 3a, cellulose-binding |
Protein accession | YP_001036835 |
Protein GI | 125972925 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0894635 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCTTG GAGTGGTAAT AAAAATAAAA AGGAAGAAGG CCATAATTGT TACGGAAACC GGCGAATTTA AAGCTGTAAA TGCCAGAAAC GGTATGTTTT TGGGACAAAA GATTTTATTT GATCAGCAAG ATGTTATTGA AAATAACAGA AATGGCATTG GTCTTGCATA TTCTGCAGCT ATAGCGGGAA TGGTTGCTGT TTTTGTATTC ATGTTTACAT ATTTCGGCTT GCATAATTTT AATGGCACTT TTGCATATGT TGACGTGGAT ATAAATCCAA GTGTCGAATT TGCGGTAAAC AGGGACGGTA TTGTTGTAAA TGCCGAACCG CTTAATGATG ATGGGAGAAA AGTACTGGAA GAGTTGATAT ATAAAGATGC TTTGCTGGAA GATGTGATTT TGGATCTGGT TGACAAGTCG AGAAAGTACG GATTTATAGA AGATAATGAT AGGAAGAATA TCATATTGAT TTCGGCAGCG TTAAACAGTG ATGAGCAGGA ACAAAGAAAT GACTTTGAAA AGAAGCTGGT TGACAATTTA ATGCCGGAAC TTGAGAATTT GGATGTAAAT ATTGAAATGA GGTTTGTCAT TGCCTCAAAA GAGCAAAGGA AGAAGGCACA GGAAAACAAA GTGTCCATGG GTAAGTATAT GATTTATGAA ATGGCGAGAC GGCAAGGTGA AAAACTGACT TTGGAGTCAA TTATGTCAGA AACATTGGAA AATTTACTTT TGGGTCAGGA CTTTGGTGTA ATTGAAACTG AGAAAACACC TGTGAATACA CCGGTTAAAT CTACTGCTAC TCCGACGAAG GCGCTGGCTG CCGAGATTAC TCCCACAAAG ACACCGGAAC AGGTTGTGAT GACGCCTGCA AATACGCCGG CTAAGCCTAC AGCTGCTCCA ACAAAGGCAC CGGCTGCTGT GGCTGTGACC TCGGCAAAAA CACCGGAAAG AGCTACGACA GTGCCTGTGA ATACACCGGT TAAACCTACG GATGCTCCGA CAAAATCACC GGCCACTGCC ACAGCAACTG CAACCAGGGC ACCTGTAAAA GCTACAGCAA CACCTGCGAA GACACTCAAA CCATCAGACA CTCCTGTAAA GACCCCGGAT GGTGAGCAGA GTGTCAAAGT GAGGTTCTAC AACAATAACA CTTTGTCTGA AACCGGTGTA ATTTACATGA GAATAAATGT TATTAACACC GGAAATGCAC CTTTGGACCT TTCGGATTTA AAACTAAGAT ATTATTACAC TATTGACAGT GAGAGTGAAC AGAGATTCAA CTGTGATTGG TCGTCCATTG GAGCTCACAA TGTAACGGGA AGTTTCGGAA AGGTAAATCC ATCTCGAAAC GGAGCGGATA CTTATGTTGA AATAGGATTT ACAAAAGAAG CTGGAATGCT TCAACCGGGC GAAAGCGTTG AACTTAATGC GCGCTTTTCA AAAACTGACA ATACACAGTA TAATAAAGCA GATGATTATT CATTTAATTC CCATTATTAC GAATATGTAG ACTGGGACAG AATTACAGCG TATATTTCCG GCATTTTAAA ATGGGGAAGA GAACCATGA
|
Protein sequence | MNLGVVIKIK RKKAIIVTET GEFKAVNARN GMFLGQKILF DQQDVIENNR NGIGLAYSAA IAGMVAVFVF MFTYFGLHNF NGTFAYVDVD INPSVEFAVN RDGIVVNAEP LNDDGRKVLE ELIYKDALLE DVILDLVDKS RKYGFIEDND RKNIILISAA LNSDEQEQRN DFEKKLVDNL MPELENLDVN IEMRFVIASK EQRKKAQENK VSMGKYMIYE MARRQGEKLT LESIMSETLE NLLLGQDFGV IETEKTPVNT PVKSTATPTK ALAAEITPTK TPEQVVMTPA NTPAKPTAAP TKAPAAVAVT SAKTPERATT VPVNTPVKPT DAPTKSPATA TATATRAPVK ATATPAKTLK PSDTPVKTPD GEQSVKVRFY NNNTLSETGV IYMRINVINT GNAPLDLSDL KLRYYYTIDS ESEQRFNCDW SSIGAHNVTG SFGKVNPSRN GADTYVEIGF TKEAGMLQPG ESVELNARFS KTDNTQYNKA DDYSFNSHYY EYVDWDRITA YISGILKWGR EP
|
| |