Gene Pisl_1965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_1965 
Symbol 
ID4617743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp1781636 
End bp1782772 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content53% 
IMG OID639785056 
Productamidohydrolase 
Protein accessionYP_931455 
Protein GI119873448 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0906256 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.000000000345538 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTGTATAA GAGTCGAAGG CAGAGCGTAT ATAGGTGGAC ATTTCGTCCG GATAAGACTG 
GGGAGGGAGG GGTGTAAGAC TGTACAGTTT TCAAACGACT ACATAATACT GCCCGGGATG
GTCGATATAC ACGTCCACTT CCGCGACTGG GAGTTGGCAC ATAAGGAGAC TTTAGAAGGC
GGAGCCGCGG CGGCGTTGGC AGGAGGCGTA GTCGCCGTGG GCGATATGCC AAATACAAAA
CCCCATATTA GAACTGCAGA GCTCTACAAG AAGAGGCTAG GGGAGGGGGC GCGGTTGCCC
ATAGTATACA GAGTGCATAT GGGAGTGCCT GTGGACCTTA GAGAGTTAGA TATAGCAAGG
CCTCCTACTG TGAAGGTTTA CCCAGAGGAT GTAGCAGAGT TCGGCTGGGG GCATATAGAG
GCGCTCGCGA GGAGATGTGC CGGCCTTGGA TGTACATTAA TCTTCCACTG TGAGGACCCG
GCTTATTTTA AAGACGGCGA GAGGCCGCCG GAGGCTGAGA TGGCGTGTGT TGAAAAGGCG
AGACGTCTCG CCTGGGATAC AGAGGCCAGA GTCCATCTGA CGCATGTCTC TCTACCTCAG
ACTGTCGATA TAGCCAGGGG CTGGGCCACG GTAGACGTAA CTCCACACCA TCTATTTCTA
GACAGAGAGA ACTGTAAGCT AGGCGGTTTA TGTCTAGTCA ACCCGAGGCT TAGGGAGCCG
GGACTTAGAA AACTACTCCT CGCCCGTCTC GCCGCCGGGC TTGTGGATAT ATACGCCACA
GACCACGCCC CGCATACGCC GGAGGAGAAA AAGTCAGACA GCCCGCCGCC GGGCATATGT
AGCTTAGACA TAGCGCTCAG CTTGTTGCTT TCTCTCTGGA AGAGGGGGGT GTTAACACTA
GACGACGTCG TACGGCTATA TTCCCACAGA CCCGCGCGTT TCCTCAATGT AAACAACGAT
ATTATAGGCG GCGTATTTAC AATTATCAAG CTCGAGGAGT TTACAGTAAG GGGGGAGGAA
TTTGCCGGCA GGTGTAAATA TACGCCGTTT GAGGGGTTTA GAGCATTTGG CGTAGTCGTC
GCCACTGCAG TTGGCGGAAA AATCTTCTTT AGAAATGGCG AAGTGTACGA CGTTTAG
 
Protein sequence
MCIRVEGRAY IGGHFVRIRL GREGCKTVQF SNDYIILPGM VDIHVHFRDW ELAHKETLEG 
GAAAALAGGV VAVGDMPNTK PHIRTAELYK KRLGEGARLP IVYRVHMGVP VDLRELDIAR
PPTVKVYPED VAEFGWGHIE ALARRCAGLG CTLIFHCEDP AYFKDGERPP EAEMACVEKA
RRLAWDTEAR VHLTHVSLPQ TVDIARGWAT VDVTPHHLFL DRENCKLGGL CLVNPRLREP
GLRKLLLARL AAGLVDIYAT DHAPHTPEEK KSDSPPPGIC SLDIALSLLL SLWKRGVLTL
DDVVRLYSHR PARFLNVNND IIGGVFTIIK LEEFTVRGEE FAGRCKYTPF EGFRAFGVVV
ATAVGGKIFF RNGEVYDV