Gene Mboo_1026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1026 
Symbol 
ID5411489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1005433 
End bp1006701 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content53% 
IMG OID640868252 
Producthypothetical protein 
Protein accessionYP_001404187 
Protein GI154150569 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1361] S-layer domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.441563 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.903414 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCAGA CCATACACCC ACGGGCCACT CCCGATAACG GGAAACTTTC CCCAACGGAA 
AGACCGGCCG GCCAGATGCA GGAAAGAGTG AGACAAAAAA AATTCCCCGG CTCCGTGAGC
GGGATATTTT TGCTCCTTGC CGTTACGCTG GGGATCTTTG CGATTCCGGT GTCTGCCTAC
CCTACGCCTT ATTCTCTCTC GGCAGATTCC AGCGTATATG TCTCTAATGT TACGTATTAC
CCGGGCGCTT TTTTTTCCGG TGACAGCGGT ACGGTCACGT ACCAGGTGAT CAACGGCAAT
ACCAACACGA GCATGGTGGT AAATCATGCA TCGTTCAGCG ATACCGATAT CCGGCTGACA
AGCGGTACCT ATGATTCCTC GCAAAACATC GGTCCTCTCC AGACAGAGCC GTTTACGTTT
TCGATTACCA CCAATGCGAG TGACGGCAAC TACTACCCCA CCTTCTCCCT TTCGTTCCAG
GATGGCGAGT CCATGCACTA CCAAGGACTG GTCAAGGTGG ATAACCGCCC GCTGGTCATG
ACCATCCAGG ACCAGCCGGA TGCCTATACC CAGGGAAAGA AGAACACGAT CAGCGTGCAG
ATCGCAAACC CCCGGTCCGA CGATGTACAC AATGTGATCT TCACGGTTTC CGGTGATGGC
GCTACACTTA CGCCATCGCA GACCTATATT GGGGACCTCC CGTCAGGAGC CATGACGCTG
GTCAATTTCA CGGTTACACC AAATGCACCC ACCACCCTGA ACCTGGTGGT CGGTTACGAC
AATGGCGATA ACGCACACAG CATCGATTCG ACCCTTCCAA TCCAGTTCAC CACAGACAAG
CAGCAGGCCG ACCCGGTGAT GAGCAACATC GTTATTACCG CCAATGGCAC GGTCTACACG
GTCAACGGTG ATTCAACCAA TGCCGGACTT TTAAATGCAA ATGGTGTAAC GATCACCGCT
CTTTCCCCGG CAGTTCCGGA AGATCCCTAC CAGAATTACG TGATCGGGAC ACTCAAACCT
GACGATTTCG GCAGTTTCGA ACTTACCTTC TCCGTCCCTG AGGGAACAAA GAGCATTCCC
CTCAAGCAGT CCTTCAAGGA TAGTGACGGC AACGTGATCA CTTCAACCCA GGATATTGAC
CTGACAACTG CCCAGCAGGC TTCGCAAAGC AATGCCGGTC CGGGAATGCT CCCGGTGCTT
GTCGTTGTTG CCATTGTCGT GATCGGTGCG GGCGGCTACC TGTATATGAA AAAGAATCGG
AAACAGTGA
 
Protein sequence
MIQTIHPRAT PDNGKLSPTE RPAGQMQERV RQKKFPGSVS GIFLLLAVTL GIFAIPVSAY 
PTPYSLSADS SVYVSNVTYY PGAFFSGDSG TVTYQVINGN TNTSMVVNHA SFSDTDIRLT
SGTYDSSQNI GPLQTEPFTF SITTNASDGN YYPTFSLSFQ DGESMHYQGL VKVDNRPLVM
TIQDQPDAYT QGKKNTISVQ IANPRSDDVH NVIFTVSGDG ATLTPSQTYI GDLPSGAMTL
VNFTVTPNAP TTLNLVVGYD NGDNAHSIDS TLPIQFTTDK QQADPVMSNI VITANGTVYT
VNGDSTNAGL LNANGVTITA LSPAVPEDPY QNYVIGTLKP DDFGSFELTF SVPEGTKSIP
LKQSFKDSDG NVITSTQDID LTTAQQASQS NAGPGMLPVL VVVAIVVIGA GGYLYMKKNR
KQ