Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0511 |
Symbol | |
ID | 5055617 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 461843 |
End bp | 463627 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640468073 |
Product | hypothetical protein |
Protein accession | YP_001152758 |
Protein GI | 145590756 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0574] Phosphoenolpyruvate synthase/pyruvate phosphate dikinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.493067 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.202815 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATGCCT ATTACAGACT CCCGGCAACT CTATTATTTG ACAGAGATGC CCCGAAGGAG CTAGTTGAGT GGGAGAGGAA GATGTTCTGG TATCAAGATT CGCTACACCA CTACGACGTC ATGTACCCGC TTGACGACAT CCACCCGGCC ACGTGGGAGA TCGCGCTATC TGCTTATAAC TCTAGGATAT TTGTAGTCCC GCCGGCAAAC GGCATACCTC ATAGAATCGT CAACGGCTAT CTCTTCATCT CCTTCGTGCC GGTACTGGAC GAAAAGGAAA TTCAGAAGCG CCTCGAGTAT TTCCAAAAGA GGGCGGGGTA TTACTACCAG AACTGGAACG ACATTTATGA GAAGAATTGG AAGCCAGAAG CCGAGAAGAT AGTCAAAGAA ATGGAATCCC TTGAGTTCAG AGATCTGCCG GAGGTGGAGG ACGAGAAGAT AGTTTTTGAA AGAATGGGCT ACTCGTCTTC CGCTTATCAC CTTGTTGACA ATTGGTTAAG ACTTGTAATG TTGCACCAGA GACTGTGGTG GAAACACTTC GAGATGCTGA ATCTGGGATA TGCCGCATAT CTATTATTCT ACCAGTTCAT GAAGCAGAAA TTCCCCGACA TACCCGACCA ACACATCGCC TTGATGGTGG CGGGCATAGA CGTCATCTTG TTCAAGCCGG ATCTGGAGTT GAGGAGGTTG GCCAAGCTGG CGGTGGAGCT GGGAGTCGCC GATAGGATAT TGTCCTTCGC CACCGCTGAG CAAATGGAGG CGGAGCTCTC GAGGAGCAAT AATCCCAACG AGAAGAAGTG GTTCGAGGAG TGGAACTCAG TTAAGTACCC GTGGTTCTAC TACACGACGG GCATTGGCTT TTACCACCAC GAGCCGAGAT GGATAGACGA TTTGAACATA CCGTTTAACT TCTTAAAAGA CTATATAATA AAAGTAAAGA GGGGGGAGAA ACTTGAGACC GCCACGGAGA GGCTGAGGAG GCAGAGAGAT GAAATAGCGG CTAAATATAG GGAGTTATTA AAAGAGGAAG ATAGAAAGCT GTTTGATCAG TATTTACAAG TGGCCAGGAC CGTGTTCCCA TACGTCGAAG AGCACAACTT CTTCGTGGAG CACTGGGGCC ATACGGTGTT TTACAACAAG GTGAAAGAGG TCGGCAAGAT ACTGTCAAAA CACGGGTTCT TAGAAAGACC CGAAGACATA TTCTATATGA CGTGGTGGGA GGTGTACCTG GCGCTGTTGG ATCTGGTGGC CAGCTGGGCG GTGGTCATGC CGCCTGTGGG GAAGTACTAC TGGCCTAGGG AGATTGCCAA ACGCAAGGAG ATAATATCTA AGCTAAAAGA ATACAAGCCT CCTCCTGCCC TGGGCAGGCC TCCCGAGGTC GTCACGGAGC CCTTCACCGT AATGTTGTGG GGCGTGACGA GGGAGCGCAT AGAGGATTGG CTCGGAACTG CGGCAGGAGC CGGGAAGATC ATCAAGGGCT TCCCGGCATC TCCCGGCGTG GTTGAGGGCA GGGCTGTGGT GGTCACCTCG GTGGAGGAGC TTAACAAGGT GAAGGAAGGC GACATATTGG TGTGTCCAAA CACGTCGCCT GCGTGGGGCC CCGTCTTCGC CAAAGTGAAG GCAGTGGTGT CGGATATCGG CGGATTAATG GCCCACGCGG CGATAGTGGC CAGGGAGTAC GGAGTGCCGG CAGTGGTGGG CACAGGCAAT GCATCGCGGA TCATAAAGGA CGGGCAACGG ATTAGGGTGG ACGGATTCAA AGGCGTTGTG GAAATATTAG AATGA
|
Protein sequence | MYAYYRLPAT LLFDRDAPKE LVEWERKMFW YQDSLHHYDV MYPLDDIHPA TWEIALSAYN SRIFVVPPAN GIPHRIVNGY LFISFVPVLD EKEIQKRLEY FQKRAGYYYQ NWNDIYEKNW KPEAEKIVKE MESLEFRDLP EVEDEKIVFE RMGYSSSAYH LVDNWLRLVM LHQRLWWKHF EMLNLGYAAY LLFYQFMKQK FPDIPDQHIA LMVAGIDVIL FKPDLELRRL AKLAVELGVA DRILSFATAE QMEAELSRSN NPNEKKWFEE WNSVKYPWFY YTTGIGFYHH EPRWIDDLNI PFNFLKDYII KVKRGEKLET ATERLRRQRD EIAAKYRELL KEEDRKLFDQ YLQVARTVFP YVEEHNFFVE HWGHTVFYNK VKEVGKILSK HGFLERPEDI FYMTWWEVYL ALLDLVASWA VVMPPVGKYY WPREIAKRKE IISKLKEYKP PPALGRPPEV VTEPFTVMLW GVTRERIEDW LGTAAGAGKI IKGFPASPGV VEGRAVVVTS VEELNKVKEG DILVCPNTSP AWGPVFAKVK AVVSDIGGLM AHAAIVAREY GVPAVVGTGN ASRIIKDGQR IRVDGFKGVV EILE
|
| |