Gene Pars_0344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0344 
Symbol 
ID5054404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp297553 
End bp298806 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content60% 
IMG OID640467919 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_001152606 
Protein GI145590604 
COG category[S] Function unknown 
COG ID[COG3379] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.828514 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTAAGC TCTTCATATT TGCCCTAGAC GGCGTCCCCT ACGAGGTCTT CGAAGCCATG 
AGAGACGACC TGCCCAACAT CGCAGAGGTT GCCGACCGGG GATCCAGGGG GGTCATGAGG
GCCGTAGACC CGCCCATAAC AGTGCCGGCA TGGGCCTCCA TGTTCACGGG AAAAGACCCC
GGCGAGCTCG GCATATACGG CTTCCGCCAC CTCAACAAAC AGACGAGAAG GAGCTACATC
GTCACGTCCC GCGAACTAAG AGAGCCCTAC ATCTGGGAGA GGCGGGGGCT ACGCTCAGTG
GTCATTGGGC TACCTCCGGG CTACCCGCCG AGGCAGTACG GCGTCTGGAT ATCCGACTTC
ATGACCCCCG AAGGGAGGTC CTGGACCCAC CCCCCAGAGC TGGCAAACAC GATCGGGCGC
TACATCTTCG ATATTGAATA CCGCACAGAC CGCAAAGACG AGGCCTTCAA GGCGCTCGTT
GAGATGACCC AGCTCAGATT CGCCGCCGCC GAGAAGCTAA TGGAATGGGC AAACTGGGAC
GTCTTCGTCC TCCACGAAAT CGGCACAGAC CGCGTCCACC ACCTCTTCCA GAAGTACTGG
GATCCAGAGC ACCCCATGTA CCAGCCGGGG AACCCCCACG AGGACAAGAT ACCGCGGTAC
TACAAGCTCA TAGACCAATT AGCGGGTAAG CTACTCAAGA AAATTCCCAA AGACGCCGAG
ATCTTGATAA TCTCCGATCA CGGGAACCAA GCCCAGAGAG GCGTCCTCGC CGTGAATCAA
ATACTAGCCG AGTGGGGGCT TGTGGAGTAC AAAGCAGAGC CACGCCGAGG AGCCGACATA
GACGAGGTGG TGGACTGGGA ACGCTCCAAG GCCTTCGCCT GGGGCGGCTA CTACGCCCGC
GTCTTCGTCA CGGCTCGCGG CGAAGAGGCA CACGACGTGA AGAAGGAGCT CAAGAAAAAG
CTCAGGACGC TAAAGGCGCC GTGGGGATAT ATACAAAACG CCGTTTACGA GCCCCGGGAG
CTCTACAGAG AGGTGAAAGG CGATGCCCCC GACCTCATGA TCTACTTCGA CTCGGTGCGG
GTCCGGCCGG TGCAGACAGT TGGATACGAA TCGCCGTGGC TGGAGGGCAA CGACCGGGGG
CCCGACGACT CGCTACACAG CTTCAACGGC TTCTACGCCG CGACTTGGGG CGACAGGAGG
AAGAAGGATT TACACGCCCT AGACGTGGCG AGCTTCGTCG AGGCGGCGCT ATGA
 
Protein sequence
MTKLFIFALD GVPYEVFEAM RDDLPNIAEV ADRGSRGVMR AVDPPITVPA WASMFTGKDP 
GELGIYGFRH LNKQTRRSYI VTSRELREPY IWERRGLRSV VIGLPPGYPP RQYGVWISDF
MTPEGRSWTH PPELANTIGR YIFDIEYRTD RKDEAFKALV EMTQLRFAAA EKLMEWANWD
VFVLHEIGTD RVHHLFQKYW DPEHPMYQPG NPHEDKIPRY YKLIDQLAGK LLKKIPKDAE
ILIISDHGNQ AQRGVLAVNQ ILAEWGLVEY KAEPRRGADI DEVVDWERSK AFAWGGYYAR
VFVTARGEEA HDVKKELKKK LRTLKAPWGY IQNAVYEPRE LYREVKGDAP DLMIYFDSVR
VRPVQTVGYE SPWLEGNDRG PDDSLHSFNG FYAATWGDRR KKDLHALDVA SFVEAAL