Gene Pisl_1708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_1708 
Symbol 
ID4616893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp1544733 
End bp1545803 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content40% 
IMG OID639784790 
Productglycoside hydrolase family protein 
Protein accessionYP_931202 
Protein GI119873195 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.913199 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.0276874 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGATAG GCGCGGCCGT ATCTCCATAT CAACACTTCG GGTTTTGTAA ATGCGATATG 
CTTGACGAGC CTGGCGCATA TCACATACTT TTTTATGAGG AAGATTTCGA TATCGCCAAG
GCAGTGGGCC TAGATGTATT TAGAACAGGA ATTGAATGGG CTATAATAGA GCCTAGAGAG
GGCTATTACG ACAAAGAAGC TCTCAAGCTT TTTAACGAAT ATCTATCGTC TATCAAGAGA
CGCGGTATAA AGACTTGGGT TACGTTACAC CATTTTACAA ATCCTAGATG GGTGTGGAAA
TATGGCGGTT GGGAGTCTAA AGACGTGACA AGGAGATTTT TGTCATATGT AGATTATGTT
GCAAGAGAGC TTGGAGGTCT AATCGACGTA GCTTTAATAT TTAACGAGCC AAGTATGTAC
ACATTTCTCG CATACATTAG AGGCGACTTG CCACCGTATG GTTTCATGTC GCTTAAACAT
ATGAGAAGGG CACTATCAAA CATAAATGAG ACTATTCTCA TGGCTAGAGA CATATTAAAA
AACTATGGCG TAGTAAAATC TTTTACACAT TCATTTACAA AGTTTGAGTC TAAAAATGCT
ATATTTAAAC CGATTATCTA TTTTATAAAT AGGTTAAACT CAAAATACTT AGCAATGTTT
AAAGAAATGG ATTATACATC TATAAATTTT TATGTCGTAG GTAGATATGA AGATTTTTCA
ATGCGCTTCC TATACAGACC TAAGAGTTTG TTAGAAATAA AACCGCCCAC GCCTCTCGCA
GTGACAGAGT TTGGAATAGC CACAAGAGAT GAAGAGCTTA GGTATAGATA CCTCTGCTCT
ATGGCACACG TATTTAAAGA AGTAAAGCCT ATTGTTGCAA TTTGGTGGAG TTTTTTGCAT
GGCTATGAAT GGGGACTAGG ATATCAGCCT TTTTTCGCGC TTGTTGATAT AAAAGGCACT
AGACGTATAT TAACACGGTT AGCTAAGGTC TTTAGGACGA CTCTGGAGAA TCCCCCCCGT
TGCGAGTTTG TGGAGAGAGA CGCCGGGCTT GAATGGCGTT GGCACCTATA G
 
Protein sequence
MQIGAAVSPY QHFGFCKCDM LDEPGAYHIL FYEEDFDIAK AVGLDVFRTG IEWAIIEPRE 
GYYDKEALKL FNEYLSSIKR RGIKTWVTLH HFTNPRWVWK YGGWESKDVT RRFLSYVDYV
ARELGGLIDV ALIFNEPSMY TFLAYIRGDL PPYGFMSLKH MRRALSNINE TILMARDILK
NYGVVKSFTH SFTKFESKNA IFKPIIYFIN RLNSKYLAMF KEMDYTSINF YVVGRYEDFS
MRFLYRPKSL LEIKPPTPLA VTEFGIATRD EELRYRYLCS MAHVFKEVKP IVAIWWSFLH
GYEWGLGYQP FFALVDIKGT RRILTRLAKV FRTTLENPPR CEFVERDAGL EWRWHL