Gene Pisl_1914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_1914 
Symbol 
ID4617277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp1728850 
End bp1730118 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content60% 
IMG OID639785005 
Productanthranilate synthase 
Protein accessionYP_931404 
Protein GI119873397 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0888242 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCC CGCTGTCTAA GCTCCCACCG CCGAGAGAGC TAGCCCACGG GCTGTATCAG 
TCGGGGGAGG AGTTCGTGGC TCTTCTAGAG TCGGGCCAGG GCTTCGCAGA GAGGGCGAGG
TTCACCCTCG TGGCGTGGGG GGTGGAGAGG GCGTACGTAT CCTCTGGGCC CGATCTACAA
CAAGTGCTCT ACTTAGCACA AAGGGAGTTG AAAGCGGATG GAGGGCCCTT CGGCGGCGAT
GTGTTAATCG GGGCTCTGAC CTACGAGGCA TCGTACTACA TGGAGCCTCT GTTGCTTAGG
TATAACAAGG TAGACCGGTC TATCCCGGCG GCGTTTCTGG TTAAGCCCAG GGGCTACATC
CTCTACGACA AGATGCTGGG GAGGGGCTAC CTGAGGGGGG AAATGCCGAA GGTTTCTGTG
GAACGGAGAG AGACCAAGGT GAGGGGGCCG GTGGCCATGA CCGACCCGAA CCGCTTCAAG
AGCTGGGTGG CAGAGGGGAG GGAGAGGATC GCAGCTGGGG AGATCCTCCA AGTGGTGCTC
TCCAGGTGGG TAGACTACAG AGCGGAGGGG GACCTCTTCC CTCTGTACAA GGCGCTGGCA
GAGGAGAACC CCTCGCCGTA TATGTACTTC GTTAAATACG GCGATATCCA CTTGATTGGG
ACGTCGCCTG AGCTGTTGGT AAAGGTGCAG AGCGGCCGCG TGGAGACCCA CCCAATCGCC
GGGACTAGGC CAAGGGGCGC CACCGAGGAG GAGGATCTAG CGCTGGAGGA AGATATGCTC
AGCGACGAGA AGGAGCTAGC TGAACACATC ATGCTCGTGG ATCTGGCTAG GAACGACATC
GGGAGGGTGT GCCAGCTGGG GTCTGTCAAG GTGGAAGAGC TGTTCGCCGT GGAGAAATAC
AGCAGAGTGC AACACATAGT GTCTAGGGTC ATGGGCGTCA TGGATAGAAG GTTCACCCCT
GTCGACGCCC TCTTGGCCAC CCACCCGGCG GGCACCGTGT CCGGCGCCCC CAAGGTAAGG
GCTATGGAGA TAATCGCCGA GCTTGAGGAC GAGCCTCGGA GGTTCTACGC AGGGGCCGTG
GGCTTCATCT CGCCGTCTCT CCTCGAGTTT GCCATAGTCA TAAGGACTAT AGTGGCCATG
GGCGACTCCC TCCGTATACA AGCCGGGGCG GGGGTTGTGT ATGACTCCAC GCCCGAGCGT
GAGTTTAGAG AGACCGAGTC TAAGCTTGCA GCGCTCAGAG CCGTCGTGGA GGGTGGGCCA
TGGACTTGA
 
Protein sequence
MKIPLSKLPP PRELAHGLYQ SGEEFVALLE SGQGFAERAR FTLVAWGVER AYVSSGPDLQ 
QVLYLAQREL KADGGPFGGD VLIGALTYEA SYYMEPLLLR YNKVDRSIPA AFLVKPRGYI
LYDKMLGRGY LRGEMPKVSV ERRETKVRGP VAMTDPNRFK SWVAEGRERI AAGEILQVVL
SRWVDYRAEG DLFPLYKALA EENPSPYMYF VKYGDIHLIG TSPELLVKVQ SGRVETHPIA
GTRPRGATEE EDLALEEDML SDEKELAEHI MLVDLARNDI GRVCQLGSVK VEELFAVEKY
SRVQHIVSRV MGVMDRRFTP VDALLATHPA GTVSGAPKVR AMEIIAELED EPRRFYAGAV
GFISPSLLEF AIVIRTIVAM GDSLRIQAGA GVVYDSTPER EFRETESKLA ALRAVVEGGP
WT