Gene Msed_0756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0756 
Symbol 
ID5103445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp687908 
End bp689437 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content48% 
IMG OID640506661 
Productphosphoenolpyruvate carboxylase 
Protein accessionYP_001190855 
Protein GI146303539 
COG category[S] Function unknown 
COG ID[COG1892] Uncharacterized protein conserved in archaea 
TIGRFAM ID[TIGR02751] phosphoenolpyruvate carboxylase, archaeal type 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0335931 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACCCA TACCTAGAAC CATGTCTACC CAGCATCCAG ATAACGCAAC CGTCCCAGAA 
TGGGCTAAAG GCGATGTTAT TGAGGGAGAG GCAGAAGTAA TAGAGGCATA TTACGCGTTC
TCCAGGCTCA ACGTGCACGA GGTCATGTGG GACGCTGAGG GCAAGGATGT AGACACCCAC
GTGGTTAGGA AGTTGTTTTC CTCATTTGAC GAGTACTTTA AAAACAACAT CCTGGGTGAG
GACATTTTCC TCACGTATAG GTTACCTAAC CCCAAAATAG AGGGGGCAGA GAGGAAGGTT
TTCGCAGAGA CCATGGAGAG CATACCCATT ACCTTTGACG TTGCCGAGAG ATTTTACGGC
AGGAAAGTAG TCCCGGTGTT TGAGGTTATT CTTCCCTTCA CCACTAACGC AAGTGACATC
ATATCTGTGG CCAGGTATTA CGAACGTGCA GTGGCCATGG AAGAAAACAT CGAACTTCAG
GACGGAGTTT ACGTGAGGGA TCTAGTGGGG GAGATCTATC CCAAGAGGAT AGAGGTTATA
CCTCTCATAG AGGATAAGGA CTCACTCCTT AACACAAGGA ACATTATTGA GGGGTACTAT
CGTGCAATTA AGCCGTCATA CATGAGGCTG TTCATTGCCA GATCGGATCC TGCCATGAAT
TACGGCATGC TGACAGCGGT ACTTCTGGCA AAGTACGCCT TAAGCGAGGC TGGGAAGCTT
GCTGAGGAGT TGGGAATTCC AATCTTTCCC ATCATAGGAG TAGGTTCTTT ACCCTTTAGA
GGCCACCTAA GCCCCGAGAA CTATCAGAGA GTAATGGAAG AGTACGAGGG GGTTTACACG
TTTACCATTC AGTCTGCCTT CAAGTATGAC TACAGCGAGG AACAGGTGAA GGGGGCAATT
TCCCACATAA ACAGGGAGGA GGTCAAGGAA CCAAGGATCC TAGGTGAGGA GGAAAAGAAG
GTTACCAGGG ACATCATAGA AACCTACACA CTATCCTATC AACCCGTGAT AGAGAGCTTG
GCCAACCTGA TCAACACGGT AGCCCTTCAT CTGCCTAGGA GAAGGGCAAG GAAACTCCAC
ATTAGTCTGT TTGGATACGC AAGGAGTACA GGAAAGGTGA TGTTGCCAAG AGCGATCACC
TTTGTGGGCT CCCTGTACAG CGTTGGCCTC CCCCCAGAGG TCATTGGCAT CTCTTCACTC
GGCAAACTAA ACGAGATGCA GTGGAATATA CTGGAGGAGA ACTACAAGTT CCTAAAGAAT
GACCTGCAGA AGGCCTCAGA GTTCATTAAC CCTGAAGGGC TCTCAACCCT TGTGTCGTAC
GGGTATCTAG ACGCGGAAAT CTCAAAGAAG CTAGAGGAGG ACATCAAATA CCTGGAGAGC
ATGGGGGTAA AGATAGGACC TAGAAGTTAT GAGACCAAGA AGCACGCTCT ACTATCACAA
CTTCTTATGC TATCACTCAA GGAGAAGAAA TATAACGAAG TCAAACAATA TGCGAGGGAA
ATGGCAGTCA TCAGGAAGTC AATTGGGTAA
 
Protein sequence
MRPIPRTMST QHPDNATVPE WAKGDVIEGE AEVIEAYYAF SRLNVHEVMW DAEGKDVDTH 
VVRKLFSSFD EYFKNNILGE DIFLTYRLPN PKIEGAERKV FAETMESIPI TFDVAERFYG
RKVVPVFEVI LPFTTNASDI ISVARYYERA VAMEENIELQ DGVYVRDLVG EIYPKRIEVI
PLIEDKDSLL NTRNIIEGYY RAIKPSYMRL FIARSDPAMN YGMLTAVLLA KYALSEAGKL
AEELGIPIFP IIGVGSLPFR GHLSPENYQR VMEEYEGVYT FTIQSAFKYD YSEEQVKGAI
SHINREEVKE PRILGEEEKK VTRDIIETYT LSYQPVIESL ANLINTVALH LPRRRARKLH
ISLFGYARST GKVMLPRAIT FVGSLYSVGL PPEVIGISSL GKLNEMQWNI LEENYKFLKN
DLQKASEFIN PEGLSTLVSY GYLDAEISKK LEEDIKYLES MGVKIGPRSY ETKKHALLSQ
LLMLSLKEKK YNEVKQYARE MAVIRKSIG