Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1014 |
Symbol | |
ID | 5054751 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 905698 |
End bp | 907080 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640468572 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_001153246 |
Protein GI | 145591244 |
COG category | [S] Function unknown |
COG ID | [COG1892] Uncharacterized protein conserved in archaea |
TIGRFAM ID | [TIGR02751] phosphoenolpyruvate carboxylase, archaeal type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.189328 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.549107 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATGCATA TACCGCGTTT GATGTGTACA CAGCATCCTG ACACCACCGT TAAGATCACC ACTGCAGAGG AGGTTGACGA GGCCATAGTG GCGTACACCG CGTATGGGTG CGATGAGGTG ATGGTAGACT ACGAGGGAAA GATGACGCCA TATGGACAGC CCAAGGAGAT TGTAATGAAG GCGATCAGGG GCGATGTGCC TCTTGGCGAT GAGTTCTACA TCACTGTGAG GCTTCCTAAT CCCAAACTGG AGGAGTTTGA TAGAGCGATG CTTTCGCTAG AGGCGGCTCT CGTGGCGAAC TACTTCTCTA GGAGGTACGC CGACGCGCAG GCGGTGAGGT GGGTGGTGTT GCCCATGGTG GAGGATTTTG ATACTGTGAT ACTGGTGAGG AGGATGCTGA GGCGCAAGGC CGAGATCTAC AAGTCGGAGA CAGGTGTTGA CGTGGGGGAG GTGGAGGTGA TCCCACTTAT AGAAGACGCC TTTGTGCAGG TGAAGGCTAA GGTGATTGTC GGCGAGGTCT TCAAGAGCGA GGAGGCAAGG GAGGTGAGGG TCTTCTTAGG CAAGTCCGAC TCGGCGGTGA GGCACGGCCA CCTCGCGTCG GCGTTGGCTA TAATCTACGC TATGTCGAAA CTGAAGGAGT TCGAGGCAGA GTCAGGTATA AGGGTGCGGC CCATCTTAGG CATGGGATCA CCGCCTTTTA GAGGCGCCTT AAACAACCCG AGGCTGGCCC ACTTGGAGGT GGTGCAGTAC GCGGGCTACT ACACCGCCAC GATACAGTCA GCAGTTAGGT ACGACACGTC GCTAGACGAG TACGTAAAGG TCAGGGAGTC TATCCTCAAC GCTTGTTGCG GCACGAGGGG CCTGGTGGGG GAGGAGGTGT TGCCGCTCAT ACAGGAAGCC TCCGCTAAGT ACAGGTCACA AGCTATGAAG CACGTGGATA AAATAGCGGA GGTGGCCCGT CTCGTGCCAA GCACTAGGGA TAGGGTTAGC TGGAAGGAGT ACGGCAGATC CCTACTTGAC GGAGACCGAG TAGTGCATAT GCCCCGTGCC ATTGTATACA CGTCGGCGTG GTATGCCATG GGATTCCCGC CGACGCTCAT TGATGCGCCG TTACTCTTGG AGCTGGCTAA GTCGGATAAG CTAGATGCGG TGTTTAAACT CCTTCCTACC TACAAGATGG AGCTGGAATA CGACTACGAG TTTTTCGACC CACAGACAGC GAGGAACTAC CTAAGTGAGG AGCTCGTATA CGCCGCGGTG GAACTCGCCG ACTATCTCGG CGTAGAGGCA AGGCCCACCC CCACTTACAC CGCGTTGTTG AAGATGCCTA GAAGCGAGCC TAACATAATT GCACTGAGCA AATATCGAAA ATTCCTGGGA TAA
|
Protein sequence | MMHIPRLMCT QHPDTTVKIT TAEEVDEAIV AYTAYGCDEV MVDYEGKMTP YGQPKEIVMK AIRGDVPLGD EFYITVRLPN PKLEEFDRAM LSLEAALVAN YFSRRYADAQ AVRWVVLPMV EDFDTVILVR RMLRRKAEIY KSETGVDVGE VEVIPLIEDA FVQVKAKVIV GEVFKSEEAR EVRVFLGKSD SAVRHGHLAS ALAIIYAMSK LKEFEAESGI RVRPILGMGS PPFRGALNNP RLAHLEVVQY AGYYTATIQS AVRYDTSLDE YVKVRESILN ACCGTRGLVG EEVLPLIQEA SAKYRSQAMK HVDKIAEVAR LVPSTRDRVS WKEYGRSLLD GDRVVHMPRA IVYTSAWYAM GFPPTLIDAP LLLELAKSDK LDAVFKLLPT YKMELEYDYE FFDPQTARNY LSEELVYAAV ELADYLGVEA RPTPTYTALL KMPRSEPNII ALSKYRKFLG
|
| |