Gene Mboo_0045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_0045 
Symbol 
ID5410939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp39752 
End bp41386 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content60% 
IMG OID640867259 
Producthypothetical protein 
Protein accessionYP_001403212 
Protein GI154149594 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.380843 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCCCG ATCAGGCAAC ATTCCGGAGG ATGGTAGAGG AAGAGCCTGA TCTCTCGGGG 
GCTGCAAAAG AGACCCTCCT CACCCTGTAC GATGGTCCCC TGCACCTCCG GCAGATCCTT
GAGCGGGTAA ATTTACCGGA GGCCGCGGGG GAACAGGAGG GCCACAAAGA CCGGGCCATC
ACCGAATCGG CCCTCAGGAA AAGGCTGGAA CTCCTGATCG GGCGGGGGAT CCTTGCCCGG
GCGGGAAGCG AGCGGACTAA TCCCTACTAT TACATCCGCC GCCCGTGGAT CTTCAACAAG
TACATCCTTA TCAAGTGCCG GGACAAGCCC CACGAGGGCC TTCTTGACCT GACTGTTCTG
CTCCACGAAC TCAGCCGTAT GGGTGATGGG GAGAACGCCG CCCTTCCCCA CCCGAGATTT
ATCTCCGCGG TAGGGGAACG GACAGAGCGG AGCCACCAGA TCGAAGCCGC GTACTCGGCA
TTCCAGAAGA TCCTCGGGAA CAGCAATGCG ATCGGCGACT ACCTCGAAGG GATCTACGAT
GACATCTATG CAGGAAAGAT CCCGGAGAGC GACATTGACA GCATGGTGGC CCGGAACTTC
CTGCGGTTTG TAGCCACCGG GCCGGTCGAA GAGCGGGAGG CCCGGTTTTT TCTGTGGTAC
GCGGATTTTT TCACGATACT CAACCAGTAC CAGGAGGCAT ACGAGGCATT TGTCCGGGGT
GTGGCCCTGG CAGAACAGCA GGGCCTGGCT CTTCCGGCCC TTTTGTCGGA ATCCCGGATC
ACCAAAGGCC GTATCCTGCT CCACCTCAAT GACCTTACCG GGGCAAAAGA GGCATATCTT
GAACTTCTCC GGATCCGGGA TGCCGATCCC CTCCTCAGGG CAAAAGGGCT CCTTGGCGCC
GGCGAGGTCG AGCTGCTCTG CGGGGATTAT GCACCCACGT CCTCACCGGC CCGGTTTGTC
CAGGCGCTCG AGCTCTGCGG AAAAGCGGAC CCGGCCAGGA CCAACCCGGA TGTCGCCGAG
CTGCGGGCCG ATATCCTGCG GAGGACCGGG ACGGCATGCC GCCTTTCCGG GAAGTCTGAC
GAGGCCTCCG GGTACTACGA CACGGCAGAA GAGATCTATC GCAACGGGAT GATGCGGGGC
CTGGTCATGC TCCTGCCCGA ACGGGCCGAG CTGTTCCGGG CCCGGGCGTT CCTTTCCGGC
CCGGCGGAGG CAGAAAAAGC GTGCGCACAA GCGGCGGCAG CATACGATGA GGCAAAGACC
GTGGCCCAGA GGGTGCGGAG TGTCAACTGG TTTGCCCACT GCCTTATCGG GGAATGCGAA
TGTGCACGCG TGGCATTCCA AAAGTGCAAA AAACCGTTCC CCCGGGACCT TGACACAAAA
TTCCAGAATG CGTTTGAGAT CTATTGCCAG ATCTCCTCGC ACTGGGGCAT TGTCCAGACA
TTCCTCTCAG AAGCGCTCCT CTTCCATGCA GCGCCGGACT CCTTCCCGGA CAGGTACGCG
TCCACAGCTG ACAAGCTCGA ACAGGCCGAG CGGTTCAGCC GCGAGCTCGG GCTCAGGAAC
GAACTGGCAA TCATACAGCG GATAAAATCC GGCTGCGGGC AGGAGTCCGA GCTCCACCCG
CTTATATTCC TCTGA
 
Protein sequence
MYPDQATFRR MVEEEPDLSG AAKETLLTLY DGPLHLRQIL ERVNLPEAAG EQEGHKDRAI 
TESALRKRLE LLIGRGILAR AGSERTNPYY YIRRPWIFNK YILIKCRDKP HEGLLDLTVL
LHELSRMGDG ENAALPHPRF ISAVGERTER SHQIEAAYSA FQKILGNSNA IGDYLEGIYD
DIYAGKIPES DIDSMVARNF LRFVATGPVE EREARFFLWY ADFFTILNQY QEAYEAFVRG
VALAEQQGLA LPALLSESRI TKGRILLHLN DLTGAKEAYL ELLRIRDADP LLRAKGLLGA
GEVELLCGDY APTSSPARFV QALELCGKAD PARTNPDVAE LRADILRRTG TACRLSGKSD
EASGYYDTAE EIYRNGMMRG LVMLLPERAE LFRARAFLSG PAEAEKACAQ AAAAYDEAKT
VAQRVRSVNW FAHCLIGECE CARVAFQKCK KPFPRDLDTK FQNAFEIYCQ ISSHWGIVQT
FLSEALLFHA APDSFPDRYA STADKLEQAE RFSRELGLRN ELAIIQRIKS GCGQESELHP
LIFL