Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0756 |
Symbol | |
ID | 5103445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 687908 |
End bp | 689437 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640506661 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_001190855 |
Protein GI | 146303539 |
COG category | [S] Function unknown |
COG ID | [COG1892] Uncharacterized protein conserved in archaea |
TIGRFAM ID | [TIGR02751] phosphoenolpyruvate carboxylase, archaeal type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0335931 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACCCA TACCTAGAAC CATGTCTACC CAGCATCCAG ATAACGCAAC CGTCCCAGAA TGGGCTAAAG GCGATGTTAT TGAGGGAGAG GCAGAAGTAA TAGAGGCATA TTACGCGTTC TCCAGGCTCA ACGTGCACGA GGTCATGTGG GACGCTGAGG GCAAGGATGT AGACACCCAC GTGGTTAGGA AGTTGTTTTC CTCATTTGAC GAGTACTTTA AAAACAACAT CCTGGGTGAG GACATTTTCC TCACGTATAG GTTACCTAAC CCCAAAATAG AGGGGGCAGA GAGGAAGGTT TTCGCAGAGA CCATGGAGAG CATACCCATT ACCTTTGACG TTGCCGAGAG ATTTTACGGC AGGAAAGTAG TCCCGGTGTT TGAGGTTATT CTTCCCTTCA CCACTAACGC AAGTGACATC ATATCTGTGG CCAGGTATTA CGAACGTGCA GTGGCCATGG AAGAAAACAT CGAACTTCAG GACGGAGTTT ACGTGAGGGA TCTAGTGGGG GAGATCTATC CCAAGAGGAT AGAGGTTATA CCTCTCATAG AGGATAAGGA CTCACTCCTT AACACAAGGA ACATTATTGA GGGGTACTAT CGTGCAATTA AGCCGTCATA CATGAGGCTG TTCATTGCCA GATCGGATCC TGCCATGAAT TACGGCATGC TGACAGCGGT ACTTCTGGCA AAGTACGCCT TAAGCGAGGC TGGGAAGCTT GCTGAGGAGT TGGGAATTCC AATCTTTCCC ATCATAGGAG TAGGTTCTTT ACCCTTTAGA GGCCACCTAA GCCCCGAGAA CTATCAGAGA GTAATGGAAG AGTACGAGGG GGTTTACACG TTTACCATTC AGTCTGCCTT CAAGTATGAC TACAGCGAGG AACAGGTGAA GGGGGCAATT TCCCACATAA ACAGGGAGGA GGTCAAGGAA CCAAGGATCC TAGGTGAGGA GGAAAAGAAG GTTACCAGGG ACATCATAGA AACCTACACA CTATCCTATC AACCCGTGAT AGAGAGCTTG GCCAACCTGA TCAACACGGT AGCCCTTCAT CTGCCTAGGA GAAGGGCAAG GAAACTCCAC ATTAGTCTGT TTGGATACGC AAGGAGTACA GGAAAGGTGA TGTTGCCAAG AGCGATCACC TTTGTGGGCT CCCTGTACAG CGTTGGCCTC CCCCCAGAGG TCATTGGCAT CTCTTCACTC GGCAAACTAA ACGAGATGCA GTGGAATATA CTGGAGGAGA ACTACAAGTT CCTAAAGAAT GACCTGCAGA AGGCCTCAGA GTTCATTAAC CCTGAAGGGC TCTCAACCCT TGTGTCGTAC GGGTATCTAG ACGCGGAAAT CTCAAAGAAG CTAGAGGAGG ACATCAAATA CCTGGAGAGC ATGGGGGTAA AGATAGGACC TAGAAGTTAT GAGACCAAGA AGCACGCTCT ACTATCACAA CTTCTTATGC TATCACTCAA GGAGAAGAAA TATAACGAAG TCAAACAATA TGCGAGGGAA ATGGCAGTCA TCAGGAAGTC AATTGGGTAA
|
Protein sequence | MRPIPRTMST QHPDNATVPE WAKGDVIEGE AEVIEAYYAF SRLNVHEVMW DAEGKDVDTH VVRKLFSSFD EYFKNNILGE DIFLTYRLPN PKIEGAERKV FAETMESIPI TFDVAERFYG RKVVPVFEVI LPFTTNASDI ISVARYYERA VAMEENIELQ DGVYVRDLVG EIYPKRIEVI PLIEDKDSLL NTRNIIEGYY RAIKPSYMRL FIARSDPAMN YGMLTAVLLA KYALSEAGKL AEELGIPIFP IIGVGSLPFR GHLSPENYQR VMEEYEGVYT FTIQSAFKYD YSEEQVKGAI SHINREEVKE PRILGEEEKK VTRDIIETYT LSYQPVIESL ANLINTVALH LPRRRARKLH ISLFGYARST GKVMLPRAIT FVGSLYSVGL PPEVIGISSL GKLNEMQWNI LEENYKFLKN DLQKASEFIN PEGLSTLVSY GYLDAEISKK LEEDIKYLES MGVKIGPRSY ETKKHALLSQ LLMLSLKEKK YNEVKQYARE MAVIRKSIG
|
| |