Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pcal_0842 |
Symbol | |
ID | 4908818 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum calidifontis JCM 11548 |
Kingdom | Archaea |
Replicon accession | NC_009073 |
Strand | + |
Start bp | 803372 |
End bp | 804460 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640124591 |
Product | cellulase |
Protein accession | YP_001055734 |
Protein GI | 126459456 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGACT TCTTTGAACT GCTGAAAAAA TTGGCGGAGG CTAGGGGCCC CTCGGGCTTT GAGGACGAGG TGAGGGAACT CGTGGCAAGG GAAATGGAGC CGTTTGTCGA CGAAGTCGTG GTAGACCGGT GGGGCAACGT AATCGGCGTC AAGAGGGGCT CCACCAACTA CAGGGCCATG GTGGCCGCCC ACATAGACGA AATTGGGCTA GTGGTAGACC ACATAGAGAA GGAGGGCTTT CTCCGCGTGA GAGGCATCGG CGGGTGGAAC GAGGTCACCC TAGTGGGCCA GAGAGTGTGG GTGAGGACTA GAGACGGCAA GTGGATACGC GGCGTCGTCG GCGTCACTCC TCCGCACATT ACGCCGTCTG GCAAAGAGCG CGAGGCCCCC GAGATGAAGG ACTTGTTCAT AGATATAGGG GCCAGAGACA GAGAAGAGGC AGAGAAGCTG GGGGTCACCA TAGGCTCTGT CGCCGTGTTA GACAGAGACG TGGTCAAGCT TCAGAACGAC GTTGTGGCTG GCAAGGCGTT TGACGACAGA GTCGGCGTCG CCGTCATGTT GTACGCGCTG AGGATGCTCA AGGAGACTCC CACGACTGTA TACACCGTGG CCACTGTGCA AGAGGAGGTG GGGCTGAGGG GCGCGCAGAT AGCGGCGGAG AAGGTGTCCC CCCACTACGC CATTGCTCTC GATACCACCA TTGCAGCAGA TGTGCCAGGG GTGCCAGAGC GGCAACACAT AGTAAAAGTG GGGAAGGGGC CCGCGATTAA GGTCATCGAC GGCGGCAGAG GAGGGCTGTT CATAGCCCAC CCGCCTCTGC GCAACCACAT TATCAAAGTT GCAGAGGAGC TGGGCATCCC CTTCCAACTG GAAGTCCTCT ACGGCGGAAC CACAGACGCC ATGGCTATAG CGTTTAGGCG AGAGGGCGTG CCCACTGCGG CAATCTCCGT GCCAACGCGC TACGTCCACT CCCCCGTGGA GGTTCTGAGC CTCAGCGACG CCGTGAATGC AGCGCGGCTA TTAGCCGCCG TGTTAGAAAA AACTAAGCCT AATATTATAG AGATGTTCCT TGATAAGAAA ATCAAGTAA
|
Protein sequence | MDDFFELLKK LAEARGPSGF EDEVRELVAR EMEPFVDEVV VDRWGNVIGV KRGSTNYRAM VAAHIDEIGL VVDHIEKEGF LRVRGIGGWN EVTLVGQRVW VRTRDGKWIR GVVGVTPPHI TPSGKEREAP EMKDLFIDIG ARDREEAEKL GVTIGSVAVL DRDVVKLQND VVAGKAFDDR VGVAVMLYAL RMLKETPTTV YTVATVQEEV GLRGAQIAAE KVSPHYAIAL DTTIAADVPG VPERQHIVKV GKGPAIKVID GGRGGLFIAH PPLRNHIIKV AEELGIPFQL EVLYGGTTDA MAIAFRREGV PTAAISVPTR YVHSPVEVLS LSDAVNAARL LAAVLEKTKP NIIEMFLDKK IK
|
| |