Gene Pisl_1143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_1143 
Symbol 
ID4617720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp1033508 
End bp1034983 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content44% 
IMG OID639784239 
Productcarboxypeptidase Taq 
Protein accessionYP_930657 
Protein GI119872650 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2317] Zn-dependent carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.880926 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0000000000114944 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGATTAGAT CAGACACCGT TAAGCAGATA CTAGAGCATT ACAGAGTCAT ATGGGCTCTT 
AGCCATGCCC AGGGACTAAT GGGCTGGGAT TCAGAGACTT ATATGCCTGA GGAGGGCATC
AAGGGTCGTG CCGTCGCTAG GGCGGAGATT GCCCAGTTGA TACAGAAATT TATGCTTGAC
GAGAAGTTTG TAAAGTTGGT AGAAAAGGCG GAAGAGGAAA AAGATTTAAC AGATGTAGAG
AGAGGAATTA TAAGAGTCTT AAAGAGAGAT TTAAAATTCT ACCAGAGGGT GCCGCCGGAG
GTTGTAAAAG AGTTTGTCAA AGTTACGTCA GAGGCTTTTA TAGTTTGGAA AAATGCGAAA
GAAAAGGCGA AATTTGAAAT TTTTGCGCCG TATTTAGAAA AAATCGTCGA CCTTTCTAGA
GTAATAGCTG ACAAATTGGG CTATGAACAA CATCCATATG ACGCATTGCT AGATTTATAC
GAAGAGGGGC TTACGTCGCG AGATGTAGAA TCTATCTTTT CAACACTAGA GCCGGGTATT
AAGACGTTGC TCTACAAGTT AGAAAGCCGC GGCTGGCCTA AGAAACATCC GTTGGAGGAG
GTTCCCTACG AGAGACAAGC GGTTGAGGCG GCAGTTATAG ATGTTTTAAA TCTACTGGGG
TACCCCAGGG GGAGGTTCAG AGTGGATGTT TCGCCACATC CATTTACGAT AGGTATTACA
TCTCCATACG ACGTAAGAAT AACTGTGAGG TATAGAGGGG TTGATTTTAA AGAGCCGCTT
TTCTCTGCGT TACACGAATA CGGCCACGCG CTGTATGAGT TAAATATTGA CGAGTCTATC
GCCATGACGC CTGTAGGCAC GGGGGTGTCG CTTGGAGTTC ATGAGAGTCA GTCTAGGTTT
ATTGAAAACA TCGTGGGTAG AAGCCGCGAG TTTGTCTACA AAATTTCGCC AATTCTCCGC
AAACATCTCG CCTTTTTGTC AAAATACAGC GACGAGGACT TGTTCTACTA TTTCAACGTA
GTTAGGCCAA GTCTCATACG TACAGAGGCA GACGAGGTTA CCTACAACCT ACATATACTT
CTGCGTTATA AATTAGAGCG TCTCATGATA ACAGGCGAGG TAAAGATTTC TCAACTACCA
GAGTTGTGGA ATAGCGAAAT GGAACGCCTC CTCGGCGTTA GACCTAAAAA CGACGCAGAG
GGTATATTAC AAGATGTACA TTGGTCACAT GGCTCAATCG GCTACTTCCC GACTTACACA
CTGGGAAATG TAATAGCTGC GATGATATAT TACAAACATG GAAATATACG CGGCCTTATC
TCAGAGGGGG ACTTTGCCGC AGTGAAGGAG TACCTCCGCG AGAAAATACA TAGATGGGGC
AGCATCTATC CGCCAAAGGA GCTTCTTATG AAAAACTTCG GCGAGGTATA TAACGCCAGC
TATTTAGTCA AATACCTAGA GGAGAAATAC GTCTAG
 
Protein sequence
MIRSDTVKQI LEHYRVIWAL SHAQGLMGWD SETYMPEEGI KGRAVARAEI AQLIQKFMLD 
EKFVKLVEKA EEEKDLTDVE RGIIRVLKRD LKFYQRVPPE VVKEFVKVTS EAFIVWKNAK
EKAKFEIFAP YLEKIVDLSR VIADKLGYEQ HPYDALLDLY EEGLTSRDVE SIFSTLEPGI
KTLLYKLESR GWPKKHPLEE VPYERQAVEA AVIDVLNLLG YPRGRFRVDV SPHPFTIGIT
SPYDVRITVR YRGVDFKEPL FSALHEYGHA LYELNIDESI AMTPVGTGVS LGVHESQSRF
IENIVGRSRE FVYKISPILR KHLAFLSKYS DEDLFYYFNV VRPSLIRTEA DEVTYNLHIL
LRYKLERLMI TGEVKISQLP ELWNSEMERL LGVRPKNDAE GILQDVHWSH GSIGYFPTYT
LGNVIAAMIY YKHGNIRGLI SEGDFAAVKE YLREKIHRWG SIYPPKELLM KNFGEVYNAS
YLVKYLEEKY V