Gene Pcal_0233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPcal_0233 
Symbol 
ID4909383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum calidifontis JCM 11548 
KingdomArchaea 
Replicon accessionNC_009073 
Strand
Start bp232172 
End bp233314 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content60% 
IMG OID640123985 
Producthypothetical protein 
Protein accessionYP_001055136 
Protein GI126458858 
COG category[R] General function prediction only 
COG ID[COG1373] Predicted ATPase (AAA+ superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.230022 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGTGGT GGCCCGAGCT CATTGACGAA ATCTCGCTGA AGCCCCCCGC GCTCCACTTC 
CTCTTCGGCC CTCGGCAGGT GGGCAAGACG ACTCTGCTAA AGTTGCTCGT GAAAAAGCTC
GTGGAGGGCG GGAGAGATCC CCGTACTATT TTCTACTACA CCTGCGAAAT GGCGGCTGAC
CACAGAGAGC TGGGGGAGGT CCTCGGCGAG ATTGTAAAAC TTAAGAAGAG GTGGGGCGTC
TCAAGCGCCT TGGTGCTCCT GGACGAGGTG ACTTACCCCA GGGAGTGGTA CAGGGCCCTC
AAGTTCTACC TCGACCAGGG GCACTTTCAA AACGACGTGA TAATTGCCAC AGGCTCGGTG
AGCATGTACG CCAAGAGGGA GGTGGAGACT TTCCCAGGCC GGAGGGGGTG GGGGCGGGAC
TACGTCATGT ACCCGCTCTC TTTTAAAAAA TTCGCCGAGG TGCAAGGCGT GCCGCCTGGG
GCAGACCCCA CGGCGTGGAG GACGGAGCTC GCCGAGGCGC TTGAGCTCTA CCTCAAATGC
GGCGGCTACC CCGCCGCCGT GGTAAACTGC GCCACCTCAG GCAACCCCGG GGGCGCCGCC
GACGTGGTGA TCTCCTCCCT CGCCTTCGAC TTGGCGAGGC TTAAGAGGAG CGAGGCGTAC
GCGAAGCGCC TCCTCAAGGC GGTCTTAGAG ACGGCCCCTA GCCCCGTCTC TCTCAACGCC
TTGGCCAAGG AGGCAGAGCT CCGGTCGCAC AAGATAGCCT TCTCGTACCT CAACCTATTG
GAGTCCCTCT ACCTCCTGCG CCAGCTCTAC TACATTGACC CCTACCGCTT GGTCGAGGAC
TACAAGAAGC CTAGGAAAAT CCACCTATTA GACCCACTGG TCTACCAAGC CGCCGCCAAG
TGGACCGGGG CGAAGATTCC GCACGAGGCC GCGCTCTTAG AGGCAACCGT GGCCATGCAC
TTCGCCAGAA GCCACAGAGT GGGCTACTGG CGAGACGGCT TTGAAGTAGA CGTGGTAGTG
CCAGAGCTAG GCCTCGGGAT AGAGGTTAAG TGGGGCAAAA AGGCCGGGAT GAAGAGAGTG
GGGCAAATCG TGGCGAAGAC CCTAGACCTA GAGGAGCTGG CCCAGCTCCT TTATCAGCTC
TAA
 
Protein sequence
MKWWPELIDE ISLKPPALHF LFGPRQVGKT TLLKLLVKKL VEGGRDPRTI FYYTCEMAAD 
HRELGEVLGE IVKLKKRWGV SSALVLLDEV TYPREWYRAL KFYLDQGHFQ NDVIIATGSV
SMYAKREVET FPGRRGWGRD YVMYPLSFKK FAEVQGVPPG ADPTAWRTEL AEALELYLKC
GGYPAAVVNC ATSGNPGGAA DVVISSLAFD LARLKRSEAY AKRLLKAVLE TAPSPVSLNA
LAKEAELRSH KIAFSYLNLL ESLYLLRQLY YIDPYRLVED YKKPRKIHLL DPLVYQAAAK
WTGAKIPHEA ALLEATVAMH FARSHRVGYW RDGFEVDVVV PELGLGIEVK WGKKAGMKRV
GQIVAKTLDL EELAQLLYQL