Gene Pisl_1789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_1789 
Symbol 
ID4618206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp1625015 
End bp1625902 
Gene Length888 bp 
Protein Length295 aa 
Translation table11 
GC content64% 
IMG OID639784873 
Product3-dehydroquinate dehydratase, type I 
Protein accessionYP_931281 
Protein GI119873274 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0710] 3-dehydroquinate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01093] 3-dehydroquinate dehydratase, type I
[TIGR01808] monofunctional chorismate mutase, high GC gram positive type 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATATGCG GCGCAGTACC AGTTAGAAAG CCGACAGATA TATATAGAGC TCTGGACTCC 
CCTGTTTCCT GCCTCGAGCT AAGGCTCGAT TACCTAGAGA CCTCACTCGC CGAGGCGAAG
CCAGCGTTGG AGGAGGCGGT AGCGAGGAGG ACAGTCATAT TCACCGTCAG GAGGAGGGAG
GAGGGGGGCG TCTGGCGGGG CACAGAGGAG GAGAGGGCGG CCCTCTACCT AAAGCTCCTG
GAGCTGACGC CCCACTTCGT AGACGTGGAG GCCGCCGCGC CGGCGGCTGA GCAGGTGGCG
GCCGCCAAGG GGAGGACAAA GCTGATAGCC AGCAGACACG ACTTCGGCGG GACCCCGCCG
TATGAGACCC TCCTCTCCTG GGCCCGGGAG GCGGCAGCCT TGGGCGACGT GGTGAAGATA
GTCACCTACG CCAGAGAGCC CCGGGACGGC CTCGCCGTGC TCTCCCTAAT CGGCGCCGTG
GAGAAACCGA CGGTGGCCTT TGCCATGGGG CCGGCCGGGG CCTACACCAG GCTGGCGGCG
GCGGCCCTGG GGAGCCCCAT CATGTATGTG TCGCTGGGTG AGACGACGGC GCCTGGCCAG
ATATCGGTAG ACGCCTACTA CGCCGCGCTC CTAGGCATAG GGGCCGCCCC CCGGGGGGAG
GGTCTGCCGG CGCTGAGGGA GGCGCTGGAC TGGATAGACA GTGCCCTCAT GCACCTCCTC
AAGAGGAGGC TGGAAGTGTG CCGCGACATG GGGAAGATAA AAAAGGCCGC CGGTCTCCCT
ATATACGACG ACGTTAGAGA GACCCAGGTC TTGAAGAGGG CGGGCGACTT TAAACAGATC
TTCGAGCTGG TGGTGCAGAT GTGCAAAGCA GTGCAGCTAG TCGCCTAG
 
Protein sequence
MICGAVPVRK PTDIYRALDS PVSCLELRLD YLETSLAEAK PALEEAVARR TVIFTVRRRE 
EGGVWRGTEE ERAALYLKLL ELTPHFVDVE AAAPAAEQVA AAKGRTKLIA SRHDFGGTPP
YETLLSWARE AAALGDVVKI VTYAREPRDG LAVLSLIGAV EKPTVAFAMG PAGAYTRLAA
AALGSPIMYV SLGETTAPGQ ISVDAYYAAL LGIGAAPRGE GLPALREALD WIDSALMHLL
KRRLEVCRDM GKIKKAAGLP IYDDVRETQV LKRAGDFKQI FELVVQMCKA VQLVA