Gene Mboo_1330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1330 
Symbol 
ID5411597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1355151 
End bp1357046 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content57% 
IMG OID640868562 
Producthypothetical protein 
Protein accessionYP_001404491 
Protein GI154150873 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.507535 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACAG CACAGGGGAT GAGCGGGATC GATGTACGGG CGGTCACGTA CGAGCTTGCC 
GGAAAGCAGC CGCTCTGGAT CGACAAGGTG TATCAGTTTG ATTCCCGCAC GCTTGGTATC
CGCTTAAACG GGGAGGCGCA TGCGAAGTAC CTGCTGCTCA TCGAGGCCGG GCGTCGGGCA
CACCTGGTAA AAAATGCTCC CGAGCCGCCC AAGAATCCCC CGCAGTTTGC CATGTTCCTG
CGCAAGTACC TGACGGGAGG TAAGGTGCTT GCGATCCGCC AGCACGGTCT GGAACGGATT
CTCATTTTCG ATATCGGGAA GGGTGCGCTT ACCTACCGGC TCATCATTGA GCTCTTTGAC
GAGGGAAATG TGATCCTTGC CGATGAAGCA TACCGGATCA TCAAACCGCT TCGCCACCAC
CGGTTCAAGG ACCGGGATAT TGTTCCCGAT GCGGTTTACG CAACGAGTGG CACTGACCCG
ACCGGCTCCC GGGAGAACCT TGCGGCAGTT CTCGCCGGAG ATGAGCGCGA CCTGGTACGG
GCGCTTGCGA TTGCCTGCAT GCTGGGCGGT ACGTATGCCG AATGGGTCTG CAGGACTGCC
GGCGCGGATA AGGCCATGCC TGCCGCACAG GCAGATCCAG CCCTGCTCTT TGATGCGGTG
ACCTCACTCT TTGACCGGGT GGAGCACCAT GTGCACCCGG TGATCTCCAA ATCAAGCTGC
GAGCCGGTGG TGCTTGCTGA AAATGCCCCG CAGGATGAGA ACCAGTTTGC CGGGTTCTCT
GACGCGCTCG AAGTCTTTTA CCCGATGACT AAAGCCGAGA AGGTAAAGGT GGCAGCGAGG
CCCAAACTAT CAGAAGGGGA GCGGATCCGG AAGTACCAGG AGGCCGCGAT CAAAAAGTTC
GACGAGAAGG TCGCAAAGGC CGAGGAGGTT GTGGCAGCGA TCTACGAGAA CTATCCGTTT
ATCTCTCAGG TAATCACCTC CCTTGCCGCC GCAAGTAAAC GCCTTTCCTG GCAGGAAATT
GAACATCATC TTAAAGATAC CTCATCGACC GATGCAAAAC GGATCACCGC CTTTTTCCCC
GGTGAGGCCG CAGTCGAGGT TGATATCGGC AAAAAGGTAA AGATCTTTGT CCACGAGACT
GTTGAGCAGA ACGCCGGGCA CTACTACGAC CAGATCAAGA AGTTCAAAAA GAAAAAAGAA
GGCGCCCTCC TTGCGATGAA AACGGTAAAA CCCCGGAAGA AGGTGATCCG CCACGATATC
GTCCCGATGA AAAAACTCTG GTACCACCGG TTCCGGTGGT TTATCACCAG TGACGGCGTG
GTAGTGCTGG GCGGCCGGGA CGCCGGGCAG AACGAGGAAC TGGTCAAGAA GTACATGACC
GGCGGCGATC TCTTTGTCCA TGCGGACGTA CATGGGGCAA GTGTGGTGAT CGTCAAAGGT
AAAACGGAAA AAATGGATGA GGTGGCGCAG TTTGCCGCAT CGTACTCGGG TGCCTGGAGG
AGTGGGCATT TTACGGCAGA TGTATTCAGC GCACAGCCAA CTCAGGTCAG CAAGACCCCG
CAGGCCGGCG AATTTGTTGC CCGGGGATCG TTTATCGTCC GGGGGGAGCG CACCTATTAC
CGCGATGTCC CTCTGTCTGT CGGGATAGGC CTTGTGCTGG AGCCATATGC GGCCGTGATC
GGGGGGCCAC CGGCCGTGAT CCGCTCGCGG ACAAAGACCT TTGCCGAGTT AAAGCCCGGC
CGGTTCGAGC CAAACGATGT GGCAAAGAAG GTGCTTCGCC AGCTGCGCGA GAAGATAACA
CCTGAAGAGG AGAAACTGCT CAAAGGGATC CTCAATACCG AATCTGTTGC CGCATTTGTC
CCGCCGGGTG GATCCGATAT AGCAGGGACG CCATGA
 
Protein sequence
MATAQGMSGI DVRAVTYELA GKQPLWIDKV YQFDSRTLGI RLNGEAHAKY LLLIEAGRRA 
HLVKNAPEPP KNPPQFAMFL RKYLTGGKVL AIRQHGLERI LIFDIGKGAL TYRLIIELFD
EGNVILADEA YRIIKPLRHH RFKDRDIVPD AVYATSGTDP TGSRENLAAV LAGDERDLVR
ALAIACMLGG TYAEWVCRTA GADKAMPAAQ ADPALLFDAV TSLFDRVEHH VHPVISKSSC
EPVVLAENAP QDENQFAGFS DALEVFYPMT KAEKVKVAAR PKLSEGERIR KYQEAAIKKF
DEKVAKAEEV VAAIYENYPF ISQVITSLAA ASKRLSWQEI EHHLKDTSST DAKRITAFFP
GEAAVEVDIG KKVKIFVHET VEQNAGHYYD QIKKFKKKKE GALLAMKTVK PRKKVIRHDI
VPMKKLWYHR FRWFITSDGV VVLGGRDAGQ NEELVKKYMT GGDLFVHADV HGASVVIVKG
KTEKMDEVAQ FAASYSGAWR SGHFTADVFS AQPTQVSKTP QAGEFVARGS FIVRGERTYY
RDVPLSVGIG LVLEPYAAVI GGPPAVIRSR TKTFAELKPG RFEPNDVAKK VLRQLREKIT
PEEEKLLKGI LNTESVAAFV PPGGSDIAGT P