Gene Pisl_2000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_2000 
Symbol 
ID4616382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp1815767 
End bp1816822 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content57% 
IMG OID639785091 
Productamidohydrolase 
Protein accessionYP_931490 
Protein GI119873483 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTCTCC GCGCGCGCTA CATCCTCACG GGAGAGCTTG AAGTTGTGGA AAACGGCGTC 
GTAGAGGTAA ACGACGAGGG CGTGGTGGCG GGGGTGGGTA AATACACTGG GGGTGTCGCC
GCCGATCTTG GCAACGTAGT GCTTATGCCT CAGCTCGTCA ACGGCCATGT ACATGTTTTA
GACGTCGCCA TGTTAGACAG AGACGATATG TATATCGACG ACTTAGTGGG GTGGCCCCAC
GGCGTGAAAT ACCACGTCGT TAAAAAACTT GTGAAGAAGG GTAAACACAT CCCACTTCTA
GAGAAGGTGG CGAAGAGAAT GAGGAGATAT GGCGTGGGGT GCGCCCTGGT ATACGCAGAA
TATGCGGCGA GAGATGTAGA AGAGGTGTTC CGGCGGTGGG GGGTAGAGAC TGTAGTCTTC
CAAGAGGCCC ATGGCGGCTT TCCAAACTAT CCCAATGTCC AAGTGGCCAC TCCCGTCGAC
CACCCCCCAG AGTACCTCCG GCAACTCAGA GCCAGGTATA AGCTAGTCTC TACCCACGTC
TCTGAGACAA AAGACTGCCA CGAAGCCGGC GATCTAGAGC TCGCGCTAAA GGTGTTAGAT
GCGGACGTTT TAATACACCT TGTATATATC ACGCCCGAGG AGGTCGCGGA GATCCCGCCG
TCAAAGACTG TCGTGGTGAA TCCCAGGGCC AACGCCTATT TCGTTGGGCG GGTGGCGCCG
GTGCCCCAGC TACTACACCT AAAGCCCCTA CTCGGCACAG ACAATGTCTT TATGAACGAA
CCAGACCCCT GGGCCGAGAT GAAGTTTCTC CACGCCTACG CCGCCGCCTC TGGCTGGAGA
CTGGGCGAGA AAGAGATACT CGCAATGGCC ACGGTCTGGG GCTGGGAAAA AATGAGGTGC
ATCCCGCCGA TTGAGCCCGG CCACAGGCTC AGGGCACTCG CCGTGGCGGC GCCATACGCA
GGAGAAAAGG TGTTGAAGTT CTTGGTGAAG AGGGCCGCCC ACACAGACCT AGTGGCATTG
GTGGAGGGCG CCTCTATAGA GCCGCCCCCC TCCTGA
 
Protein sequence
MRLRARYILT GELEVVENGV VEVNDEGVVA GVGKYTGGVA ADLGNVVLMP QLVNGHVHVL 
DVAMLDRDDM YIDDLVGWPH GVKYHVVKKL VKKGKHIPLL EKVAKRMRRY GVGCALVYAE
YAARDVEEVF RRWGVETVVF QEAHGGFPNY PNVQVATPVD HPPEYLRQLR ARYKLVSTHV
SETKDCHEAG DLELALKVLD ADVLIHLVYI TPEEVAEIPP SKTVVVNPRA NAYFVGRVAP
VPQLLHLKPL LGTDNVFMNE PDPWAEMKFL HAYAAASGWR LGEKEILAMA TVWGWEKMRC
IPPIEPGHRL RALAVAAPYA GEKVLKFLVK RAAHTDLVAL VEGASIEPPP S