Gene Mboo_0034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_0034 
Symbol 
ID5411334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp28486 
End bp30906 
Gene Length2421 bp 
Protein Length806 aa 
Translation table11 
GC content57% 
IMG OID640867248 
Productphosphoenolpyruvate synthase 
Protein accessionYP_001403201 
Protein GI154149583 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0574] Phosphoenolpyruvate synthase/pyruvate phosphate dikinase 
TIGRFAM ID[TIGR01418] phosphoenolpyruvate synthase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCG CAAAATACAT TCGCTGGTTT GAGGAGATCA GGAACGAGGA TGTTGCCAGT 
GTCGGGGGCA AAAATGCCTC GCTTGGAGAG ATGTACCAGG ACCTGACAAA GGTGGGAGTA
AAGATCCCCA ACGGGTTTGC CATCACCGCC GACGCATACT GGCGTGTCGT GAAAACAGGG
AATGTCCTTG GCCAGCTCAA AACCACTCTC GCCGATCTCG ATAAGACCAG CGTCACCAAT
CTCGAAGGCC GGGGGAAACA GGCACGAGAT CTGATCCTTG GTGCAGGCAT CCCGGACGAT
ATCTGGAACG AGATAAGCGC GGCCTACGAT ACACTCTGTG AGGAATACGG GCCCGATACC
GATGTTGCAG TCAGGAGCTC CGCAACAGCC GAAGATCTTC CCACCGCCTC GTTTGCCGGC
CAGCAGGAGA CCTATCTTAA TATCCGGGGA TATCCTGCGC TCAGGGAGGC CTGCAGCAAA
TGTTTTGCCT CCCTCTTTAC CGACCGGGCA ATATCCTACC GGATCGACCA GAACTTCGAT
CATTTCAGGG TCGCGCTCTC GATCGGGGTC ATGAAGATGG TCCGCTCCGA CCTGGCGTCA
AGCGGGGTGA TTTTCACTCT CGATACCGAG ACCGGGTTCC GGGAGGTTGT CTTTATTACC
GGTTCGTACG GCCTTGGCGA GAACATCGTG CAGGGCGCTG TCAACCCGGA CGAGTTCTAC
GTCTTCAAAC CCACGGTGCG GACCGGGCAC CGGGCCATTA TCCGGAAAAA CCTTGGCGAC
AAGAAGATCA AGATGATCTA CGGGCAGGGC ACCTCCAAGG TCCTGACACG GAATGTGGAA
GTCCCCGAAT CAGATGGACG GCGGTTCTGT ATCACGGATG ACGAGATCCT CGAACTGGCA
CAGTTCGCGA TAAAGATTGA GGATCACTAC TCAGGGAAAG CACGCCAGAC CGTTCCGATG
GATATCGAAT GGGCAAAGGA TGGGATCACC GGGGAGCTCT TCATTGTGCA GGCCCGGCCC
GAGACCGTCC ACTCCCAGGA GGCAATGGAT ATCCTGGTGA CCTATCACCT GGAGAAGCGG
GGGCCGGTCC TTGCGACGGG AAAGGGTGTG GGTGAAAAGA TCGCGACCGG CAGGGTGCGG
GTGATCGCCG ATGTGAGCCG GCTTGGCGAG TTCCGGCCCG GGGAGATCCT TGTTGCCGAT
ACAACAAACC CCGACTGGGA ACCGGTCATG AAGACCGCTG CGGCTATCAT CACCAACCGT
GGAGGGCGGA CCTGTCATGC TGCGATCGTC AGCCGTGAGC TTGGCGTTGC AGCGGTGGTC
GGGACAAACG ATGCTACGGA GAAGATCAAA ACCGGTCAGA ACGTAACGGT CAACTGTTCG
GAGGGGGACG AGGGGGTGGT ATACGACGGG ATCCTGCAGT TCCATGTCGA AAAGGTATCC
CTAAAGGATC TCAGGCGCCC GAAAACAAAG ATTATGATGA ACCTTGGGGA GCCTGACCAG
GCCTTTGCCC TTTCGATGAT CCCAAACGAC GGCATCGGGC TTGCCCGGAT GGAGTTTGTC
ATCAACAATT ATATCAAAAT CCACCCGATG GCACTCGTCC ACCCGGAAAA AGTCACCGAT
GAGACCGTAA AGGCAAAGAT CGCCGGCCTC ACGTACGGGT ACGCAAGCCC GGAAGATTAT
TTTGTCGGAA AACTGTCGCA GGGCATCGGG ACGATTGCGG CCGCTTTTTA TCCCAAACCG
GTGGTGGTCC GGATGAGCGA TTTCAAGACC AACGAGTATG CCGATCTTCT CGGGGGGAAA
TCTTTCGAGC CCGAAGAGTC AAACCCCATG CTCGGCTTCC GCGGGGCCTC CCGGTACTAT
GATGAGAAGT ACCGCGAAGG TTTTGCCCTT GAATGCCGTG CGATGAAACA GGTACTCGAA
GAGATGGGTC TCACTAACCT GGTCATCATG ATCCCATTCT GCCGGACGGT CGATGAGGGG
GAAAAGGTGC TTGCCGAGAT GGCAAAGAAC GGGCTTGTGC GGGGTAAGAA CGGGCTGTCG
GTCTATGTGA TGTGCGAGAT CCCCAACAAC ATCCTCCTTA TCGATGAATT CAGCCGGCTG
TTCGATGGTT TCTCAATCGG TTCAAACGAC CTGACTCAGC TGACCCTCGG GGTTGACCGC
GACTCGGTGG TGGTGGCCCA CGATTTCGAT GAACGGGATC CCGGTGTGAT GAAATTTATA
TCCCTGGCGG TACAGGGTGC GAAACGGAAC GGACGCCATT CCGGCCTGTG CGGGCAGGCG
CCCAGCGACT ACCCGGAGTT CGCCGAGTTC CTGGTCAGGG AAGGCATCGA ATCCATATCC
CTCAACCCGG ACTCGGTGAT GAAGATCACC CAGAAAGTGG TTGCACTGGA GGAGCGTCTG
GCGGGCAGGA AAACAACGTA A
 
Protein sequence
MSGAKYIRWF EEIRNEDVAS VGGKNASLGE MYQDLTKVGV KIPNGFAITA DAYWRVVKTG 
NVLGQLKTTL ADLDKTSVTN LEGRGKQARD LILGAGIPDD IWNEISAAYD TLCEEYGPDT
DVAVRSSATA EDLPTASFAG QQETYLNIRG YPALREACSK CFASLFTDRA ISYRIDQNFD
HFRVALSIGV MKMVRSDLAS SGVIFTLDTE TGFREVVFIT GSYGLGENIV QGAVNPDEFY
VFKPTVRTGH RAIIRKNLGD KKIKMIYGQG TSKVLTRNVE VPESDGRRFC ITDDEILELA
QFAIKIEDHY SGKARQTVPM DIEWAKDGIT GELFIVQARP ETVHSQEAMD ILVTYHLEKR
GPVLATGKGV GEKIATGRVR VIADVSRLGE FRPGEILVAD TTNPDWEPVM KTAAAIITNR
GGRTCHAAIV SRELGVAAVV GTNDATEKIK TGQNVTVNCS EGDEGVVYDG ILQFHVEKVS
LKDLRRPKTK IMMNLGEPDQ AFALSMIPND GIGLARMEFV INNYIKIHPM ALVHPEKVTD
ETVKAKIAGL TYGYASPEDY FVGKLSQGIG TIAAAFYPKP VVVRMSDFKT NEYADLLGGK
SFEPEESNPM LGFRGASRYY DEKYREGFAL ECRAMKQVLE EMGLTNLVIM IPFCRTVDEG
EKVLAEMAKN GLVRGKNGLS VYVMCEIPNN ILLIDEFSRL FDGFSIGSND LTQLTLGVDR
DSVVVAHDFD ERDPGVMKFI SLAVQGAKRN GRHSGLCGQA PSDYPEFAEF LVREGIESIS
LNPDSVMKIT QKVVALEERL AGRKTT