Gene Pcal_1616 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPcal_1616 
Symbol 
ID4908148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum calidifontis JCM 11548 
KingdomArchaea 
Replicon accessionNC_009073 
Strand
Start bp1505971 
End bp1508991 
Gene Length3021 bp 
Protein Length1006 aa 
Translation table11 
GC content56% 
IMG OID640125363 
Productglycoside hydrolase family protein 
Protein accessionYP_001056499 
Protein GI126460221 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1449] Alpha-amylase/alpha-mannosidase
[COG4945] Membrane-anchored protein predicted to be involved in regulation of amylopullulanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCC TCGCCCTCCT CTTGTCGGCC CTCGTACTGG CCCAGACCGT CAACGTCGTG 
GTGATATTGC ACAACCACCA GCCCTGGTAC ATAGACTTGG AGAGGGGGGA GCTCGTGCTC
CCCTGGGTGA GAATGCACGC GGTTGGCAAC TACCTAAAGG TGCCTCTCTT GATAAATCAG
ACGGGGGTGT CCGTGGCCTT CACTCTCTCC GGTAGCTTGA TAGAACAGTT GAACTGGTAT
GCCAATGGCA CATATCTCGA TGCCAGGTTT AAAATCTCGG AGAAGTTGGC AAGGGGAGAG
GCCTTGACGG CGGAGGAGAA GTACTCCATG TTAAAAGTGC CAGGGGGCTT CTTCGACATC
AATTGGCAGA ATATCCTCTA CAAGAATCCG CGCTACACCG CGCTCCTGGG GATTAGAAAC
GACGCCTTTA ACAAGTGCCC GCCCGGAGAT GTTACATGTG TAGTGTCGAA ATTCAGCGAC
CAAGACTTCG TCGACTTGGC CACGTTGTTC AACCTAATGT GGATAGACCC CTACATAGCT
AGGCAGAGGC CAGACATATG GGCTCTTAGG AATAAGACAA GTTTCACGAG GGATGACTTG
GCCAAGGTTC TACAATTCCA CATAGAGTTA ATAAAAGAGG TTTTGCCCCT CTACAAGAGG
CTCGCAGAAC AAGGCCGCAT TGAGCTTGTC CCAGTGCCGT ACTCCCACCC GCTGATGCCG
CTCTTGGCGG ACATGGGGGC AGTAGACGAC TTGAGGCTAC ACATACAGCT CTCCAACGGC
CTCTTCAGAA GGTACTTAGG CGCCTCGCCC CTGGGCGTCT GGCCGCCTGA ACAGGCGGTA
AACGACGAAG TTTTGAGACT GTTCGCAGAA GAGGGCTATC TGTGGACTGT GACCGACGAA
GACGTGCTTA AGATGACTAT GCCTGGCAAG AGCCACTTCC AGCTCTACTA CGCCGACTAC
GGGGGGAGAC GCATCTACGT CTTCTTTAGA GACAAGACTC TCTCTGACAA CATTGGGTTT
AGGTACTCAT CCATGAGTCC GCAGGCGGCG CTCGCGGACT TTGTAAACTA CCTAAGGCGG
GTGCCGCGCG GCGACTGCAA CGTAGTCGTA ATAGCGCTAG ATGGGGAAAA CCCCTGGGAG
AACTACCCCA ACTTCGGCGA CGACTTCTTG CTACAGTTCT TCGGCGGATT GGCCCAGCTT
GAGAAAAATG GCACAATTAA GCTTTGGAAG CCCACAGACT TCGTAAAAGC GTGTGGAGAT
AAGGCAGAGC CGCTCCCGCA GCGGGAGTTC CAGTACTACA ACCTCGGCGT AGACATATCT
TTCTACAACT CGATACGGGA GTTGCCCACG CGGACCGTGT TGGGGAAAAT CGCCGAGGGC
TCTTGGTCCT CGGGGGGTAG CCTGGCCGTG TGGATTGGAG ACCCCGACGA AAATGCGTGG
TGGATGTGGC TTAAGAAGGC GCGGGAGGAC GTCGGCGCGG CCAAGAGCTG GGATGTGCTA
TTCCCACTGC TTGTTGCTGA GGCAAGCGAC TGGCCCTTCT GGTACGGCGG AGACATGGGA
TCTCCGCAGA CGTTTGACCC AGTGGCAAAG GCGGCCTTGA GGGCGTATTA CCAGAGAGCC
GGCTTGGAGC CGCCGGCGTA TTTATACACC ACGGCCTACC CGGCGGGCAT CCCCAGGGAA
GACAAAGTGG CTGGGCAGGG GCATGGAAGC GTAAAGGCGC AAGACGCCAC TATTTACGTT
AACACAACCC ACGTGTGGGT GTCCGGAGGC AGATGCGGCG TCGTTTACAT CTCTAACCCC
AATGTGCCAA GGTCGCCGTA TGTGCCCCGT GGAGCAGTGT TCGGACTGCG CGGGGAGAGG
CTGGACATAT ACGCAGACAT GGCTCTTGAC ACGTGTAATG GGACTGTGTA TCTCGCAGAT
GGGGGCAAAT TCGTCGCCGT GGGCAACAAC GCTCTTCTAA GCCTAATAGG CGCAAAGCCA
GGCGGCAAAC TGTACGTAGA GTTCAACGGC TTCGTCTACG TCTTAGGAAT ACCTGAGACA
ACCACATCAG CTAGGCTAGT AATGTCGGCA GAGGACCCGC CTGGAGACGA CTTTGGCCCC
GGGAGCTATA ACTATCCCAA AAACCCCGCC TTTAGGCCAG GCGTGTTCGA CCTACTCGGC
ATGGAGGTGT ACGACCTCGG AGACAAGCTC AGGTTCGTTT TCAGAGTCAG AGAGCTTGGA
GGCAACCCCT GGGGCGGGCC AGCCGGCTTT TCTCTCCAGT TCTTCCACGT GTACATAAAT
AGGGGACGCG GCACGAGAAA CGACACCCTC GGCCTTAGAG TCGCGCTCTG TAGAGACGCG
GCGTGGGACG CGGCCTTGCT CATAGGGCCT GGGTGGAGCG GGGGGAACCG GATAGTGTTT
GCAGACGGCT CATTCATAGA CGACGCCATT TCCATTAGGG CTGGGCCCAA TAACACCGTG
GTGGCAGATG TGCCAAAGAA GTACATAGGT GACTTTGACC CCAAGTGGAG GCTCACGGTG
TTTTTAACCT CGTGGGACGG CTACGGCCCA GACAATATCA GAAACTTCGG AGTTATGGCG
GACGAGTGGA CTGTGGGCGG CGCAGACCCC GCGGCCGTCC TCGCCGGCGT GGCGCCGAGA
GTGTTCGACC TACTAGCGCC AGCGGCCGAG GAGCAGAAAA AGGCCTTGTC CACCTACAAG
GTGAGTAGAC TGCCCAACGG CACCTACGTG GGGACGCCGG CGACTGTGTG TACATACGTC
TCCCCCAGCA GAGCAGAGAC GCAGACAACC ACAGTAGTAC AAACAGTGGT ACAATACGTG
ACGCAACAGA CTACGGAGAC CGTAACAAGA GAAACCACGT ATACCACGAC GTACATATCG
ACTACGACTA CTACGCTTGT AGAAAAAGTA GTAGACTGGG GCATAGCGGC AATTCCCGTG
GCCTTAATGG CCCTAATTAC TGTGGCGGCC ATCGCCTTGG CGCTGGCTTT AGCCAAGAGG
AGGCCGCGAC AATCTATATA A
 
Protein sequence
MKILALLLSA LVLAQTVNVV VILHNHQPWY IDLERGELVL PWVRMHAVGN YLKVPLLINQ 
TGVSVAFTLS GSLIEQLNWY ANGTYLDARF KISEKLARGE ALTAEEKYSM LKVPGGFFDI
NWQNILYKNP RYTALLGIRN DAFNKCPPGD VTCVVSKFSD QDFVDLATLF NLMWIDPYIA
RQRPDIWALR NKTSFTRDDL AKVLQFHIEL IKEVLPLYKR LAEQGRIELV PVPYSHPLMP
LLADMGAVDD LRLHIQLSNG LFRRYLGASP LGVWPPEQAV NDEVLRLFAE EGYLWTVTDE
DVLKMTMPGK SHFQLYYADY GGRRIYVFFR DKTLSDNIGF RYSSMSPQAA LADFVNYLRR
VPRGDCNVVV IALDGENPWE NYPNFGDDFL LQFFGGLAQL EKNGTIKLWK PTDFVKACGD
KAEPLPQREF QYYNLGVDIS FYNSIRELPT RTVLGKIAEG SWSSGGSLAV WIGDPDENAW
WMWLKKARED VGAAKSWDVL FPLLVAEASD WPFWYGGDMG SPQTFDPVAK AALRAYYQRA
GLEPPAYLYT TAYPAGIPRE DKVAGQGHGS VKAQDATIYV NTTHVWVSGG RCGVVYISNP
NVPRSPYVPR GAVFGLRGER LDIYADMALD TCNGTVYLAD GGKFVAVGNN ALLSLIGAKP
GGKLYVEFNG FVYVLGIPET TTSARLVMSA EDPPGDDFGP GSYNYPKNPA FRPGVFDLLG
MEVYDLGDKL RFVFRVRELG GNPWGGPAGF SLQFFHVYIN RGRGTRNDTL GLRVALCRDA
AWDAALLIGP GWSGGNRIVF ADGSFIDDAI SIRAGPNNTV VADVPKKYIG DFDPKWRLTV
FLTSWDGYGP DNIRNFGVMA DEWTVGGADP AAVLAGVAPR VFDLLAPAAE EQKKALSTYK
VSRLPNGTYV GTPATVCTYV SPSRAETQTT TVVQTVVQYV TQQTTETVTR ETTYTTTYIS
TTTTTLVEKV VDWGIAAIPV ALMALITVAA IALALALAKR RPRQSI