Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0344 |
Symbol | |
ID | 5054404 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 297553 |
End bp | 298806 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640467919 |
Product | type I phosphodiesterase/nucleotide pyrophosphatase |
Protein accession | YP_001152606 |
Protein GI | 145590604 |
COG category | [S] Function unknown |
COG ID | [COG3379] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.828514 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACTAAGC TCTTCATATT TGCCCTAGAC GGCGTCCCCT ACGAGGTCTT CGAAGCCATG AGAGACGACC TGCCCAACAT CGCAGAGGTT GCCGACCGGG GATCCAGGGG GGTCATGAGG GCCGTAGACC CGCCCATAAC AGTGCCGGCA TGGGCCTCCA TGTTCACGGG AAAAGACCCC GGCGAGCTCG GCATATACGG CTTCCGCCAC CTCAACAAAC AGACGAGAAG GAGCTACATC GTCACGTCCC GCGAACTAAG AGAGCCCTAC ATCTGGGAGA GGCGGGGGCT ACGCTCAGTG GTCATTGGGC TACCTCCGGG CTACCCGCCG AGGCAGTACG GCGTCTGGAT ATCCGACTTC ATGACCCCCG AAGGGAGGTC CTGGACCCAC CCCCCAGAGC TGGCAAACAC GATCGGGCGC TACATCTTCG ATATTGAATA CCGCACAGAC CGCAAAGACG AGGCCTTCAA GGCGCTCGTT GAGATGACCC AGCTCAGATT CGCCGCCGCC GAGAAGCTAA TGGAATGGGC AAACTGGGAC GTCTTCGTCC TCCACGAAAT CGGCACAGAC CGCGTCCACC ACCTCTTCCA GAAGTACTGG GATCCAGAGC ACCCCATGTA CCAGCCGGGG AACCCCCACG AGGACAAGAT ACCGCGGTAC TACAAGCTCA TAGACCAATT AGCGGGTAAG CTACTCAAGA AAATTCCCAA AGACGCCGAG ATCTTGATAA TCTCCGATCA CGGGAACCAA GCCCAGAGAG GCGTCCTCGC CGTGAATCAA ATACTAGCCG AGTGGGGGCT TGTGGAGTAC AAAGCAGAGC CACGCCGAGG AGCCGACATA GACGAGGTGG TGGACTGGGA ACGCTCCAAG GCCTTCGCCT GGGGCGGCTA CTACGCCCGC GTCTTCGTCA CGGCTCGCGG CGAAGAGGCA CACGACGTGA AGAAGGAGCT CAAGAAAAAG CTCAGGACGC TAAAGGCGCC GTGGGGATAT ATACAAAACG CCGTTTACGA GCCCCGGGAG CTCTACAGAG AGGTGAAAGG CGATGCCCCC GACCTCATGA TCTACTTCGA CTCGGTGCGG GTCCGGCCGG TGCAGACAGT TGGATACGAA TCGCCGTGGC TGGAGGGCAA CGACCGGGGG CCCGACGACT CGCTACACAG CTTCAACGGC TTCTACGCCG CGACTTGGGG CGACAGGAGG AAGAAGGATT TACACGCCCT AGACGTGGCG AGCTTCGTCG AGGCGGCGCT ATGA
|
Protein sequence | MTKLFIFALD GVPYEVFEAM RDDLPNIAEV ADRGSRGVMR AVDPPITVPA WASMFTGKDP GELGIYGFRH LNKQTRRSYI VTSRELREPY IWERRGLRSV VIGLPPGYPP RQYGVWISDF MTPEGRSWTH PPELANTIGR YIFDIEYRTD RKDEAFKALV EMTQLRFAAA EKLMEWANWD VFVLHEIGTD RVHHLFQKYW DPEHPMYQPG NPHEDKIPRY YKLIDQLAGK LLKKIPKDAE ILIISDHGNQ AQRGVLAVNQ ILAEWGLVEY KAEPRRGADI DEVVDWERSK AFAWGGYYAR VFVTARGEEA HDVKKELKKK LRTLKAPWGY IQNAVYEPRE LYREVKGDAP DLMIYFDSVR VRPVQTVGYE SPWLEGNDRG PDDSLHSFNG FYAATWGDRR KKDLHALDVA SFVEAAL
|
| |