Gene Mboo_1197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1197 
Symbol 
ID5411345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1211003 
End bp1212097 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content54% 
IMG OID640868423 
ProductPEGA domain-containing protein 
Protein accessionYP_001404358 
Protein GI154150740 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.258799 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.00873035 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCTCTA AAAATTTCCC GGAATTGTTT TCCTGGCTTC TTCTTGCAAT AATTCTGCTA 
TTGTGCATCG GACCGGCACA GGCCGGGACC GTTTCGATCA CGTACCGGGG AAGCGGTGGA
TATTATGTTG GTGACAGTGT GATACTGGAC GGGATGAACA CGGTAGGCAA CACCACCGTG
ATAACCATCA CCGGCCCGGG CCTGCCGGCT GCCGGTGTAC CTCCCTATAA CCTGACCGGC
GACGCAGGAA CCGGGAATAC CGCGGTTACC GATCCGTCCG GGACATGGTC ATACGACTGG
GATTCGTCAC GGGCACTGGG AGCCTCCAGT CTTAACCCCG GACGGTATAC ATTTACGGTC
TACGACAACA GCAATTCTCA AATTAACTCC TCGGTCTCTG TCTTCCTGAA GCAACCGGAA
TTTTATGCCT CCATATCTCC CAACCCGGCT GTTCTCAATG ATTATGTGCA GGTAACCGGA
AAGGTGGAAT CTGCGGCAGA TACCATCGGG ATTGATGTGA TAGATGCATC CGGGAATAAG
GTGCATACCT TTTCCTCGCC GGTCAGTAAC GGGGGGTATT TCCAGTATGG ATTCCATGTG
GATATGCCCC CGGGCGTGTA CACGGTTTAC ATCAGCAGCC CTTCACTGTC CAACAGCCTG
ACAAGCACCC TGACCGTGGT AGAATCCAAT GCAAACCTGA CGGCGGTTGC ACCGGTTATT
AGCACTCAGG TTACTTCGCC TCCTGCTTCG ACCGGGACAC CTGTTGCTCC TCAGGCCACG
GCCACAATCC CGCCGGGATC GGGGACACTG GTGATATCAT CAGTACCGGC CGGCGCTTCA
GTCTATCTTG ATTCAGCAAA TGTCGGAATT TCGCCGGTGA CACTGAATGG CGTTGCACCC
GGTACGCACC TTGTGGAGAT CAAGTCTCCG GGTTACCTTA CCGTGTCCAT GGATGTCGTT
GTCACAAGTG ACAAGCCTGT TGAGGTCTCA CCCCAGCTGG TAAGGGCACC CTTTGGACTT
GGGCTTTCTC CCCTTGCAGC GCTCGGCGGT TGCCTTGGTG CAGCAGCTTT GTTTATCGTT
TCACGGAAGA AATAA
 
Protein sequence
MSSKNFPELF SWLLLAIILL LCIGPAQAGT VSITYRGSGG YYVGDSVILD GMNTVGNTTV 
ITITGPGLPA AGVPPYNLTG DAGTGNTAVT DPSGTWSYDW DSSRALGASS LNPGRYTFTV
YDNSNSQINS SVSVFLKQPE FYASISPNPA VLNDYVQVTG KVESAADTIG IDVIDASGNK
VHTFSSPVSN GGYFQYGFHV DMPPGVYTVY ISSPSLSNSL TSTLTVVESN ANLTAVAPVI
STQVTSPPAS TGTPVAPQAT ATIPPGSGTL VISSVPAGAS VYLDSANVGI SPVTLNGVAP
GTHLVEIKSP GYLTVSMDVV VTSDKPVEVS PQLVRAPFGL GLSPLAALGG CLGAAALFIV
SRKK