Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0542 |
Symbol | |
ID | 6165759 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 494837 |
End bp | 495925 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641667695 |
Product | cellulase |
Protein accession | YP_001793931 |
Protein GI | 171185012 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.192278 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.00117454 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGAGGACT TCGTCGCTCT CCTAAAGACC CTATCCGAGG CGAGAGGCCC CTCCGGCTTC GAGGACGAGG TGAGGGAGAT CGTGATAAAG GAGATGGAGC CGTACGTAGA CGAGGTGTTG GTGGACAGGT GGGGGAACGT AATAGGGGTC AAGAGGGGGG CCTCCGAGGT CCGGGCCATG GTGGCGGCCC ACATGGACGA GATCGGGCTG GTTGTAGACC ACGTCGAGAA GGAGGGCTTT CTAAGGTTTA GGCCGATCGG CGGCTGGAAC GAGGTGACGC TGCTCGGCCA GCGGGTGTGG GTGAGGACTC AAGATGGGAG GTGGGTCAGG GGGGTCGTAG GCGTTACGCC GCCGCATGTG ACCCCCTCCG GCCACGAGAG GGAGGCCCCG GAGATGAAAG ACCTCTACAT AGACGTGGGG GCTAGAAGCA GGGAGGAGGC CGAGAAGATG GGCATCTCCG TCGGCTCCGT GGCCGTCCTC GAGAGGGAGC TGGCCGTCTT AAACGGGAGG GTTGCGACGG GCAAGGCCTT CGACGACAGG GTGGGCCTCG CCGTTATGTT GTACACCCTG CGGCAACTTG GCGACCTCCC CGTGACCCTA TACGCCGTCG CCACGGTGCA GGAGGAGGTG GGCCTCCGGG GGGCCCAGAT AGCGGCGGAT CGGATAGCCC CCCACTACGC GGTGGCCCTA GACACCACCA TAGCCGCCGA CGTGCCGGGT GTAGGCGAGA GGCTACACGT GACTAAGCTG GGCGCGGGGC CCGCCATAAA GGTAATCGAC GGCGGCCGCG GCGGCCTCTT CATAGCGCAC CCCGGGCTGA GGGACCACAT CGTGAAAATC GCCAGGGAGG CCGGCATCCC CCACCAGCTT GAGGTGCTAT ACGGCGGCAC CACAGACGCC ATGGCCATAG CCTTTAGGCG GGAGGGCGTG CCCGCCGCCG CCATCTCCAT ACCCACGCGC TACGTCCACT CGCCGGTGGA GCTGGTGGAT CTGTCAGACG CGTTGAACGC GTCGCGGCTA CTCAAGCAGG TGCTTGAGAA AACGACGCCG GCGGCGGTGG AGAAGTTCCT GGAGAGGAGG GTGAAGTGA
|
Protein sequence | MEDFVALLKT LSEARGPSGF EDEVREIVIK EMEPYVDEVL VDRWGNVIGV KRGASEVRAM VAAHMDEIGL VVDHVEKEGF LRFRPIGGWN EVTLLGQRVW VRTQDGRWVR GVVGVTPPHV TPSGHEREAP EMKDLYIDVG ARSREEAEKM GISVGSVAVL ERELAVLNGR VATGKAFDDR VGLAVMLYTL RQLGDLPVTL YAVATVQEEV GLRGAQIAAD RIAPHYAVAL DTTIAADVPG VGERLHVTKL GAGPAIKVID GGRGGLFIAH PGLRDHIVKI AREAGIPHQL EVLYGGTTDA MAIAFRREGV PAAAISIPTR YVHSPVELVD LSDALNASRL LKQVLEKTTP AAVEKFLERR VK
|
| |