Gene Pcal_1209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPcal_1209 
Symbol 
ID4908308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum calidifontis JCM 11548 
KingdomArchaea 
Replicon accessionNC_009073 
Strand
Start bp1120865 
End bp1122127 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content65% 
IMG OID640124963 
Productanthranilate synthase, component I 
Protein accessionYP_001056100 
Protein GI126459822 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR01820] anthranilate synthase component I, archaeal clade 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.689744 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCC CGCTGTCTAA GCTTCCGCCT CCCAGGGACC TCGCCCACGG CCTCTACCTG 
GGGGGCGAGG AGTTTGTGGC GCTCCTGGAG TCTGGGCAGG GGTTTGTGGA GAGGTCTAGG
TACACGCTGG TGGCTTGGGG TGTGGAGAGG GAGTACGTGG CCCACGGCGG GGAGCTGTAC
AACGTATTGG ACTCGGCCTA CCGGGGCCTT GCGAGGGGGG AGGGCCCCTT CGGCGGCGAG
GTGGCAATAG GCGTGGTGGC CTACGACGCC GCAGCCTACG TGGAGCCGGT GCTGCTTAAG
TACGGGAAGG TACGGCGCCC CGTGGCCTTC TTTGTGAAGC CCAGGGGGTT TGTGCTATAC
GACAAGGCCC TTGGGAGGGC GTATGTGTAT GGCGAAGTGC CGCCGATCCG CGGCGTGGGC
GACGGCGGAG AGCTAGAAGT GAGGGGGCCC GTGGGCCAGA CGGACTCTGC GTCTTTTAAG
AGGTGGGTGG CGGAGGCCAA GAGGAGGATT GAGGCTGGGG AGGCCTTCCA GGTGGTGCTC
TCCCGCTTTG TGGACTTCGC CGCGCGGGGC GACTTGTTCA AGCTGTACCA GTCGCTGGCG
GAGCTCAACC CCTCGCCGTA TATGTATTTC CTCAAGTGGA GAGACGTGGC GGTGTTGGGC
ACCTCGCCGG AGCTCTTGGT GAAGGTGCAG GGAGACAGGG CCGAGACGCA CCCCATCGCC
GGCACTAGGC CCAGGGGGGC CACCGAGGAG GAGGACATAG CCCTGGAGGA AGAGATGCTG
AGAGATGAGA AGGAGCTAGC CGAGCACTCC ATGTTGGTGG ACTTGGCGCG GAACGACCTG
GGCCGCGTCT GCCGGCCGGG CACCGTGCGC GTGGACGAGC TGTTTGCGGT GGAGAAGTAC
AGCAGAGTTC AACACATCGT GTCCCGCGTC TCGTGCGTCG TGGAGAAGAA GTACAGCCCA
GTGGACGTCC TACTCGCCGC GCACCCCGCC GGCACAGTCT CGGGGGCGCC GAAGGTGAGG
GCCATGGAGA TAATAGCGGA GCTCGAGGAA GAGCCGCGTT GGATATACGC AGGCGCCCTG
GGCTTCTTCT CCCCCGCCCT CTCCGAGTTC GCCATCGTCA TAAGGTCCGC CGTATTCCAC
GAGGGCCTCC TCCGCATTCA GGCAGGCGCC GGGGTGGTAT ACGACTCGAC GCCGGAGCGG
GAGTTCAACG AGACTGAGGC CAAGCTCAAG GCGCTGAGGG AGGCGCTGGG GCTATGGACC
TGA
 
Protein sequence
MKIPLSKLPP PRDLAHGLYL GGEEFVALLE SGQGFVERSR YTLVAWGVER EYVAHGGELY 
NVLDSAYRGL ARGEGPFGGE VAIGVVAYDA AAYVEPVLLK YGKVRRPVAF FVKPRGFVLY
DKALGRAYVY GEVPPIRGVG DGGELEVRGP VGQTDSASFK RWVAEAKRRI EAGEAFQVVL
SRFVDFAARG DLFKLYQSLA ELNPSPYMYF LKWRDVAVLG TSPELLVKVQ GDRAETHPIA
GTRPRGATEE EDIALEEEML RDEKELAEHS MLVDLARNDL GRVCRPGTVR VDELFAVEKY
SRVQHIVSRV SCVVEKKYSP VDVLLAAHPA GTVSGAPKVR AMEIIAELEE EPRWIYAGAL
GFFSPALSEF AIVIRSAVFH EGLLRIQAGA GVVYDSTPER EFNETEAKLK ALREALGLWT