Gene Pcal_0842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPcal_0842 
Symbol 
ID4908818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum calidifontis JCM 11548 
KingdomArchaea 
Replicon accessionNC_009073 
Strand
Start bp803372 
End bp804460 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content58% 
IMG OID640124591 
Productcellulase 
Protein accessionYP_001055734 
Protein GI126459456 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGACT TCTTTGAACT GCTGAAAAAA TTGGCGGAGG CTAGGGGCCC CTCGGGCTTT 
GAGGACGAGG TGAGGGAACT CGTGGCAAGG GAAATGGAGC CGTTTGTCGA CGAAGTCGTG
GTAGACCGGT GGGGCAACGT AATCGGCGTC AAGAGGGGCT CCACCAACTA CAGGGCCATG
GTGGCCGCCC ACATAGACGA AATTGGGCTA GTGGTAGACC ACATAGAGAA GGAGGGCTTT
CTCCGCGTGA GAGGCATCGG CGGGTGGAAC GAGGTCACCC TAGTGGGCCA GAGAGTGTGG
GTGAGGACTA GAGACGGCAA GTGGATACGC GGCGTCGTCG GCGTCACTCC TCCGCACATT
ACGCCGTCTG GCAAAGAGCG CGAGGCCCCC GAGATGAAGG ACTTGTTCAT AGATATAGGG
GCCAGAGACA GAGAAGAGGC AGAGAAGCTG GGGGTCACCA TAGGCTCTGT CGCCGTGTTA
GACAGAGACG TGGTCAAGCT TCAGAACGAC GTTGTGGCTG GCAAGGCGTT TGACGACAGA
GTCGGCGTCG CCGTCATGTT GTACGCGCTG AGGATGCTCA AGGAGACTCC CACGACTGTA
TACACCGTGG CCACTGTGCA AGAGGAGGTG GGGCTGAGGG GCGCGCAGAT AGCGGCGGAG
AAGGTGTCCC CCCACTACGC CATTGCTCTC GATACCACCA TTGCAGCAGA TGTGCCAGGG
GTGCCAGAGC GGCAACACAT AGTAAAAGTG GGGAAGGGGC CCGCGATTAA GGTCATCGAC
GGCGGCAGAG GAGGGCTGTT CATAGCCCAC CCGCCTCTGC GCAACCACAT TATCAAAGTT
GCAGAGGAGC TGGGCATCCC CTTCCAACTG GAAGTCCTCT ACGGCGGAAC CACAGACGCC
ATGGCTATAG CGTTTAGGCG AGAGGGCGTG CCCACTGCGG CAATCTCCGT GCCAACGCGC
TACGTCCACT CCCCCGTGGA GGTTCTGAGC CTCAGCGACG CCGTGAATGC AGCGCGGCTA
TTAGCCGCCG TGTTAGAAAA AACTAAGCCT AATATTATAG AGATGTTCCT TGATAAGAAA
ATCAAGTAA
 
Protein sequence
MDDFFELLKK LAEARGPSGF EDEVRELVAR EMEPFVDEVV VDRWGNVIGV KRGSTNYRAM 
VAAHIDEIGL VVDHIEKEGF LRVRGIGGWN EVTLVGQRVW VRTRDGKWIR GVVGVTPPHI
TPSGKEREAP EMKDLFIDIG ARDREEAEKL GVTIGSVAVL DRDVVKLQND VVAGKAFDDR
VGVAVMLYAL RMLKETPTTV YTVATVQEEV GLRGAQIAAE KVSPHYAIAL DTTIAADVPG
VPERQHIVKV GKGPAIKVID GGRGGLFIAH PPLRNHIIKV AEELGIPFQL EVLYGGTTDA
MAIAFRREGV PTAAISVPTR YVHSPVEVLS LSDAVNAARL LAAVLEKTKP NIIEMFLDKK
IK