Gene Mboo_1456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1456 
Symbol 
ID5411401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1493738 
End bp1495399 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content55% 
IMG OID640868691 
ProductPpx/GppA phosphatase 
Protein accessionYP_001404617 
Protein GI154150999 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.240113 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGGAA AAAAGAAGAT CGGCGCAACC GGGCGAATCG TGGCATTCAT TGACATTGGA 
ACAAACTCCG TACGGATGCT TGTGGTCCGG TTCAACCCCA ACCATTCCTA CAGTGTTCTT
TCCCGGCAGA AGCAGCAGGC TCGCCTGGGG GAGGGAGAGT TTGACGATGA CGTGATCACC
CCCGAGGCCA TAGACCGGGC CTGCATGGTC TGCCGCAAGT TCGTGGAGCT CGCAAAAACA
TTTGGGGCCG AGGAGTTCGT AGCCGTGGCA ACGTCAGCGG CACGGGAGGC TACCAACCAG
AACATTCTTC TCGAACGGTT CCGAGAGGAG GTGCAGATCG ATGTCCGGGT GATCTCCGGC
CTTGAGGAGG CACGGCTTAT CTACCTTGGC GTTGCAAGCG GGATGCACCT GGGTGAGAAG
CAGGCATTTT TTATCGATAT CGGGGGCGGA AGTACCGAGA TCTCGCTCGG CGGCCAACAG
CAGTTCACCC TTTTAGAAAG TTTCAGGCTC GGGGCGATCC GCCTCACCGG TATGTTCCTT
GCAGATAACC CCCCCGGGCC CGTGCGCCCG GACCAGTACC GGGTCATCCA GCAGTACGTG
AGAAATGCGA TTATCCACAC GGTCCAGAAG ATCAAAAACC AGAAGATCGA TCTGGCCGTG
GGAAGCTCGG GGAGCATCAT GAACCTTGCC GATATTGCGG CAAAGGCGCT TCATCCCGAG
GGCAGCCTTC CGGCAGGCGT TCTCACCTGC CGGGATCTTA AGAAGATAGT AGAGATCCTC
TGCTCCCTTC CCCTTGAAGA ACGCCGGAAG GTGCCCGGGA TCAATCCCGA AAGGGCAGAT
ATCATCATTG CCGGGGCGGT GATTCTTGAG ACATTTATGA AAGAACTTGC CCTGGATTCA
ATCACCACCA CCGGCCGCGG CCTCCAGGAC GGCCTTTTGA TGGACTATCT CTCGCGCATG
GAAGACTTCC CGCTTTTTGG TACGCTCTCC CCCCGGGAAC GAAGCGTGCT CCAGCTGGGC
CGGTCCTGCG GGATAAACGA GGCCCATGCA CGGAACGTGA CAAGGCTGGC GCTTGAGATC
TTTGATTCGG CAAAAGACGT AAAACTTCAC GAGTACGGAG ACAATGAACG TGAACTTCTG
GAATACGCGG CTTTCCTTCA TGACATCGGA TCGTTTATCT CGTTTACCAA CCATCATGCC
CATTCCTATT ATATCATAAA GAATTCCGAG CTGCTCGGCT TTGACCAAAA AGAGATTGAC
ATGATGGCAA ACATTGCCCG GTTCCACCGC AAAAAGAAAC CTCGCAAAAA AAGGCTTGAT
CTCCCGGATT TTGCCGACCA CGAGCAGGGG ACAGTTCTTG TCCTTGCCAT GTTTGTCCGG
CTGGCCGAGA GCCTGGATAG AAGTCATGCG GGGCTCGTGC AGCACGCTGA ATTTGTCCGT
GCAGAAAAAA ACGAGATTGT ACTGGATATT ATCGCAGAAA CGGACTGCCA GCTTGAGATC
TGGGGTGCAG AAGCGGAGCA GCGGGCGTTT GAGAAAGTCT TTGGCCGGCA CCTGAAAATT
GAAGTAATCG CACCAAGCTC CGTGAACAGG GAGGATACTA ATGATGAAGA TACCCTGCCT
GTTCCTCTTG CGGCCGCAAA AAAGAACGGT CCAGCCCGGT AA
 
Protein sequence
MKGKKKIGAT GRIVAFIDIG TNSVRMLVVR FNPNHSYSVL SRQKQQARLG EGEFDDDVIT 
PEAIDRACMV CRKFVELAKT FGAEEFVAVA TSAAREATNQ NILLERFREE VQIDVRVISG
LEEARLIYLG VASGMHLGEK QAFFIDIGGG STEISLGGQQ QFTLLESFRL GAIRLTGMFL
ADNPPGPVRP DQYRVIQQYV RNAIIHTVQK IKNQKIDLAV GSSGSIMNLA DIAAKALHPE
GSLPAGVLTC RDLKKIVEIL CSLPLEERRK VPGINPERAD IIIAGAVILE TFMKELALDS
ITTTGRGLQD GLLMDYLSRM EDFPLFGTLS PRERSVLQLG RSCGINEAHA RNVTRLALEI
FDSAKDVKLH EYGDNERELL EYAAFLHDIG SFISFTNHHA HSYYIIKNSE LLGFDQKEID
MMANIARFHR KKKPRKKRLD LPDFADHEQG TVLVLAMFVR LAESLDRSHA GLVQHAEFVR
AEKNEIVLDI IAETDCQLEI WGAEAEQRAF EKVFGRHLKI EVIAPSSVNR EDTNDEDTLP
VPLAAAKKNG PAR