Gene Pisl_1934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_1934 
Symbol 
ID4617555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp1746646 
End bp1748190 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content66% 
IMG OID639785025 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_931424 
Protein GI119873417 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0000050145 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTTTTAT CTGTGTGTAT TGTAGACGAT ATGGAGGCTA TCTCCTCTCT TGAGATGTAT 
GTGGTGGATA GGAACGCGGA GTGGCTTGGG GTGCCGCGGC TTGTGTTGAT GGAGAACGCG
GGGGCGGCTG TGGCGAGGAA TGTCGTTGGT AGGTTTCCGG GGGCGAGGAG GGTTTTGGTG
GTGTGTGGGA CGGGGGACAA CGGGGGGGAT GGCTACGTGG CGGCTAGGCA TCTCCACGGG
GCGGGGCTGT GGGTGAGGGT GGTGGGGCTT GGGGAGCCTA GGGAGGAGTT GGCTAGGGCG
AATTTCGAGG CTGTGAGGAG GCTGTGGGGG GTGGAGCTGG CGCTGGCGGC GACTCCCCTG
GAGCTTCTTG CACTTCAGGA CTGGTTCCTC TGGGCGGATG TAATTATAGA CGCAGTGTTG
GGGACTGGGG TGAGGGGGGC GCTTAGGGAG CCGCACGCCA CCGCGGTCGA CCTCATGAAC
GCGGCTCCTG CGCCGAAGGT GGCTGTGGAC GTCCCCAGCG GGCTGGACCC GGACACGGGG
GAGGTGCGGG ACAAGGCGGT TAGGGCCGCC CTCACCGTCA CTTTCCACAA GCCAAAGAGG
GGCCTCCTGG CTGAGGGGGC GCGGAGGTAC GTGGGGGAGC TGGTGGTGGA GCCCATCGGC
ATCCCGCCGG AGGCTGAGGT GGTAGTTGGG CCGGGGGACT TCGCCTATCT AGACTTCTCC
CGGCGGGCCG ACGCGAAGAA GGGAGACCAC GGCCGGGTGC TGGTGGTGGG TGGGTCCCTC
GAGTACTCGG GGGCGCCTAT GTACGTGGCG CTGGCCGCGC TGAGGTCTGG CGTGGATCTG
GCGGTTATCG CGGCGCCTGA GCCGGCGGCG CAGGCGGCTA AGGCCTACAG CCCAGACATA
ATCGCCGTCC CGCTGGAGGG GCCTAGGCTC TCCCTACGCC ACGTGGAGAA GGTGCTGAGG
CTGGCGGAGA AGTTCGACGT GGTGGCCATC GGCCCGGGGC TGGGGCTGGA GGGCGAGACC
CCCGACGCGG TTAAAGAAAT AGCCGCGCGG GTCAAAAAAC CGCTTGTCGT CGACGCAGAC
GCCATAAAAG CCCTCGGGGG GTCGCCGGTG GGGGGGCCCC AGGTGGTGTA CACCCCACAC
GCGGGGGAGT TCAAAGCGCT GACAGGCGTA GAGCCGCCGA GGGGGCTAAG GGAGAGGGCC
GAGGCGGTGA GGGAGTGGGC GGGGAGGATC GGCGCTGTCA TACTACTCAA GGGCAGATAC
GACGTGGCGT CAGACGGGAG GCGGGTCAAG ATAAACACCA CCGGCACCCC CGCCATGACC
GTCGGCGGGA CAGGCGACGT ACTCACAGGC CTCACCGCTG CGTTTATGAC CAAGACACGT
GACCCCCTAG AGGCCGCGGC CGTGGCGGCC TTCGTCAACG GGCTAGCCGG CGAGGAGGCC
GCCGCTCAGC TAGGCTTCCA CATCACCGCC AGCGACCTCA TAGAGAAGAT CCCAAGCGTC
GTCAGGAGAT ATGCGCGAGA AGACATAACC AGCCCCCGGC CATAG
 
Protein sequence
MFLSVCIVDD MEAISSLEMY VVDRNAEWLG VPRLVLMENA GAAVARNVVG RFPGARRVLV 
VCGTGDNGGD GYVAARHLHG AGLWVRVVGL GEPREELARA NFEAVRRLWG VELALAATPL
ELLALQDWFL WADVIIDAVL GTGVRGALRE PHATAVDLMN AAPAPKVAVD VPSGLDPDTG
EVRDKAVRAA LTVTFHKPKR GLLAEGARRY VGELVVEPIG IPPEAEVVVG PGDFAYLDFS
RRADAKKGDH GRVLVVGGSL EYSGAPMYVA LAALRSGVDL AVIAAPEPAA QAAKAYSPDI
IAVPLEGPRL SLRHVEKVLR LAEKFDVVAI GPGLGLEGET PDAVKEIAAR VKKPLVVDAD
AIKALGGSPV GGPQVVYTPH AGEFKALTGV EPPRGLRERA EAVREWAGRI GAVILLKGRY
DVASDGRRVK INTTGTPAMT VGGTGDVLTG LTAAFMTKTR DPLEAAAVAA FVNGLAGEEA
AAQLGFHITA SDLIEKIPSV VRRYAREDIT SPRP