Gene Pcal_0891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPcal_0891 
Symbol 
ID4909036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum calidifontis JCM 11548 
KingdomArchaea 
Replicon accessionNC_009073 
Strand
Start bp848428 
End bp849417 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content62% 
IMG OID640124640 
Product3-deoxy-7-phosphoheptulonate synthase 
Protein accessionYP_001055783 
Protein GI126459505 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTATA TAGTCGATGG GCCTGAGAGG GGGAAGGCCT TGAGGGAGGA GCTGGAGTCG 
AGGGGGGTAC CGGCTTGGTA TATAGAGCTC TGGGGGCACT ACATAGTTGC CACGCCGCCG
GGGGCGAGGA CCGAGGTCAA GACCCCCGTC AAGGCCGTCG TGGAGCTTAA GACCGACTAC
CAGCTAGTGT CGCGGGAGTG GAAGAGGGAT CCAACGCCCG TGTTTATAGG GGATAGAGAA
GTGCGGGAGG GCAAGGTCTT CATTATTGCG GGGCCTTGCT CAGTCGAGGG AGAGGAGCAG
ATTATCTCTA CGGCGCTGGC CGTCAAAGAG GCTGGGGCAC ATGCGCTGAG GGGAGGGGCC
TTCAAGCCGC GGACGAGCCC CTACGCCTTC CAAGGCCTGG GGGAGGCGGG GCTTAAGCTC
TTGGCTAAGG CTAGGGAGGC CACTGGGCTC CCGGTGACCA CCGAGCTGAT GGACCCAGAG
GACCTCCCGC TGGTGGCCAA GTACGCCGAC GCGATACAAG TGGGGGCCAG GAATATGCAG
AATTTTACGT TGTTAAAGAA GCTGGGGAGG GCGGGGAAGC CCATACTGCT CAAGAGGGGG
TTTGGCAACA CTGTGGAGGA GTGGCTACTG GCGGGGGAGT ACGTGGCTCT CCACGGGAAT
GGGGGCGTGG TGTTCGTGGA GAGGGGGATT AGGACGTTTG ACCGCACTCT GCGTTTTACC
CTGGACGTGG GGGCCATCGC CTACGTAAAA CAACACACTC ACTTGCCTGT GATAGGCGAC
CCGAGCCACC CCGCCGGCGA CCGGCGCTAC GTCATTCCGC TCGCCTTGGC CATACTGGCG
GCGGGGGCAG ACGGCCTAAT CGTCGAGGTG CACCCAGACC CAGACAAGGC GTGGAGCGAC
GCCAAACAAC AACTCACCTT TGACCAGTTT AGGGAGCTTA TGGCTAAGGC GAGGGAGCTG
GCCCGGGCTC TTGGGAAAGA GTTCCCGTAG
 
Protein sequence
MLYIVDGPER GKALREELES RGVPAWYIEL WGHYIVATPP GARTEVKTPV KAVVELKTDY 
QLVSREWKRD PTPVFIGDRE VREGKVFIIA GPCSVEGEEQ IISTALAVKE AGAHALRGGA
FKPRTSPYAF QGLGEAGLKL LAKAREATGL PVTTELMDPE DLPLVAKYAD AIQVGARNMQ
NFTLLKKLGR AGKPILLKRG FGNTVEEWLL AGEYVALHGN GGVVFVERGI RTFDRTLRFT
LDVGAIAYVK QHTHLPVIGD PSHPAGDRRY VIPLALAILA AGADGLIVEV HPDPDKAWSD
AKQQLTFDQF RELMAKAREL ARALGKEFP