Gene Pisl_1457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_1457 
Symbol 
ID4618204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp1328272 
End bp1329342 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content64% 
IMG OID639784540 
Productglycosyl transferase family protein 
Protein accessionYP_930956 
Protein GI119872949 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.7433 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTAGAGC TAGCTACTGC ACTGGCGGCG TTACATTTCG GCGCGCCAGC CCTCTATCTA 
CTCTATCTAC GTGCCGCCCC AAAGAAGCCG CTCCAGCCGG CGGCTATATA CCCCAAGGTG
GCGGTCGTCG TGCCAACGTA CAACGAAGCC CCAAACATAG AGGCTAAGCT CGAGGATATA
TACAGCCAGA GCTACCCCAG AGATAGGATG TCTATATACG TCGTCGACTC GGCCTCCACC
GACGGCACAG CCGAGGCGGC GGAGCGGTGG GCCGCCGGCA GAAAAGACGC CAAGGTTGTA
GTGCTGAGGG AGCCCGAGAG ACGGGGGAAG GCCCACGCCT TAAACACAGC CCTTGCCCAC
CTCGCCGACG AGGAAGTTGT CGTGGTTACA GACGCCGACT CCCGCTGGCT AGACCGAGAC
ACGCTGAGGA AGGCCGTGGC CTACCTCGCC GCCGCCGACG CGGTCTCCTG CCTAAAGAAG
CCGGCGGGGG GAGGCCCCAC GGAGGAGGCC TACCGCACGT GGTACAACCG GCTGAGACTC
GCCGAGAGCT TGGTCCACTC CACCCCGGTC TTCCACGGCG AACTCGCCGC CTTTAGACGG
GAGGCCATCG CCGGGGGATT CCCGGAAGAC GTCGGCGCAG ACGACAGCTA CGCCGCCATT
AGGATAGCCA TAGAGGGGGG CCGCGCCGTC ACGCCACCAG ACGTGTGGTG CATAGAGGCG
GTGCCCCAGA GGGGCTACGC CAGGTGGCGC CTAAGGCGGG CACAACACTT GATACAAACC
TTCGCGCGGG CGCTTCCAAA AGTTGCCAAG GCCCCGCCGC CCTACAGAGC AATCCTCGCC
GCCGAGGCCT ACCTACACCT GTTTAACCCA TGGCTCCTCC CAGCCGCCGC CGCCCTAGCC
GCCGCCTCCG GACCCCCCGG CCTGGCCCTC CTCGCCGCAG GCGCCGCCGC GTTGCTATAC
AAACCCTACA GAGCCTGGGT GGCGGGCCAG ATATACCTAA TGGCAGCCGC CCTGAGAAAC
ATATGGAACA AGGAACTCAT ATGGCAAAAA CAAGAAAAGC CGCTGCCGTA A
 
Protein sequence
MLELATALAA LHFGAPALYL LYLRAAPKKP LQPAAIYPKV AVVVPTYNEA PNIEAKLEDI 
YSQSYPRDRM SIYVVDSAST DGTAEAAERW AAGRKDAKVV VLREPERRGK AHALNTALAH
LADEEVVVVT DADSRWLDRD TLRKAVAYLA AADAVSCLKK PAGGGPTEEA YRTWYNRLRL
AESLVHSTPV FHGELAAFRR EAIAGGFPED VGADDSYAAI RIAIEGGRAV TPPDVWCIEA
VPQRGYARWR LRRAQHLIQT FARALPKVAK APPPYRAILA AEAYLHLFNP WLLPAAAALA
AASGPPGLAL LAAGAAALLY KPYRAWVAGQ IYLMAAALRN IWNKELIWQK QEKPLP