Gene Pars_1402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1402 
Symbol 
ID5054224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1265188 
End bp1266663 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content43% 
IMG OID640468945 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_001153614 
Protein GI145591612 
COG category[R] General function prediction only 
COG ID[COG1524] Uncharacterized proteins of the AP superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.556587 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTACTGG TTGTTGTTTA TTCTATTATA TTCCTCGCTT TGTCTATTGT TGCATACAAC 
ACCTGGCTTC AATACAGAGG AGTGAATTTA CCGTATTCTT CGCCTATTGA GACTGGCGGA
GAAACTCCGG ACTTTATAAT AGTCGTCTTG CTTGATGGTG CTTCTTCACC AGTAGTTCAA
AGCTATATAC GGAGTGCAGA GACTTCTGTG CTTTTAGACA TGGGGCTGTT TTTGCCAAAT
GGGCGTTCGG TTTATCCATC TTATTCAGGC CCCAGCAGGG CGTCAATATT AACAGGAGTC
CCGCCTGCTG TTCATGGAGT TGTTTCTAAT GAGGGGGCTT TTAAAGTAAA AATCACTGGG
CTAATCGACT TAGCTAGGGA GAAAGGGTTT AAGATAATTA ACATCGGCGA CGGTTTGATA
GAGACGATAT TCGGCGTAAA GGCGGTGGCG ATAGACGAAG GCGCCGGCCA GGGGAGTTTG
GCGCTGAAAA AAGCCGTAGA GGTGCTTAGG GCTAATCTTT CCAACGGCTC CAAGGTTTTT
ATCTGGGTTA CGGTAAATGA TGTAGATGTA ATTGGGCATA AGGCAGGTGG TTTTTCAAAG
GAGTACAACG CGACAGTTAA AAATTACCTC ATATTAATTG CCGGCTTTAT TTCAGAGATC
TCAGATGTTT TAAATAGAGG CGTTGTGGTA GTGCTTAGTG ACCACGGTTT TAAAAAAGGT
GGTCACCACG GCGGGGGAGA GGACACAGTT ATGAACACTT TTATGTTCAT AGCAGGTAGA
GGCATTGCCC CAGGGGTGTG TTATGAAGAA TTTTTGCTAA TAGACATCGC TCCGAGTCTG
GGAATACTCA CCAGCATTGG CGTACCCCCC TACTCGATGG GCAAAGCGCT TGCCACATGT
CTAGGCATAA ATCCAACACC TGCAGAGATG AAGAGAAAGG AGGTGTATAA CCTATTAGGA
ACGAGGGAAA CCGTGTCCAT GTTTACTGAT CAATTGTGGA TTAGGTTGTT TATAATCACG
GCGTTGTTTA TACCTCTTGT GCTGGAGGTT AGGAGGATTG GGTTAAAACC TCTTACGCTG
GGCGTCGCGT TTCTTGTTAT CTACATTGTA TATTACGTTT ATAGTGTTAG AGTTTACACA
TTTTCAGATA TTTACTCATT TACAGAGGTA ATGACTAAAA TTATTGTGGC AGTCGTAGTT
GTTTCGTTTT TAACTGGTTT ATTTGCCGCT AGATTTTACC CAACTCGTGG GGAGGTTGCT
AGAGGTTTAA TAGGGGCTTA CCTATTTATC ATCACTGTAG TTTTTATCGG CGTATCTACG
TTCTTAGTAC CATATGGCCC AGTTGTTGTT TTTCCAAATC CTGATTGGGA TTTCGCGGTG
AGGTACTTCG CAATGTTGAT CACAGGAAGC TTTTCAGGAC TTGTCGGCAT GCCCATTGCG
TTAGTTACAG CTATTTTGAT ACAGAAAAAT CGCTAA
 
Protein sequence
MLLVVVYSII FLALSIVAYN TWLQYRGVNL PYSSPIETGG ETPDFIIVVL LDGASSPVVQ 
SYIRSAETSV LLDMGLFLPN GRSVYPSYSG PSRASILTGV PPAVHGVVSN EGAFKVKITG
LIDLAREKGF KIINIGDGLI ETIFGVKAVA IDEGAGQGSL ALKKAVEVLR ANLSNGSKVF
IWVTVNDVDV IGHKAGGFSK EYNATVKNYL ILIAGFISEI SDVLNRGVVV VLSDHGFKKG
GHHGGGEDTV MNTFMFIAGR GIAPGVCYEE FLLIDIAPSL GILTSIGVPP YSMGKALATC
LGINPTPAEM KRKEVYNLLG TRETVSMFTD QLWIRLFIIT ALFIPLVLEV RRIGLKPLTL
GVAFLVIYIV YYVYSVRVYT FSDIYSFTEV MTKIIVAVVV VSFLTGLFAA RFYPTRGEVA
RGLIGAYLFI ITVVFIGVST FLVPYGPVVV FPNPDWDFAV RYFAMLITGS FSGLVGMPIA
LVTAILIQKN R