Gene Mboo_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1040 
Symbol 
ID5412255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1023148 
End bp1024404 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content58% 
IMG OID640868266 
Producthypothetical protein 
Protein accessionYP_001404201 
Protein GI154150583 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.582207 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.497226 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGGCG CACAGATTTA CCGGGAGAAG CTTGCCCGGC TTGAGACCGT TATTGCGGGT 
AAAGAGCCGG ACCGGGTGCC GGTCACCGCG ATGGTGGATC TCTTCCACGG GCGGTACGCG
GGGTACACGG CGCAGGAGAT ATTCTTCGAT TACGGGAAGA ACCACGATGC AGCGATGAAG
ACTGCCAAAG ACTTTGACTT TGATTCGCTG CTGGTCTTAA ACGGCCTTGA GGGGATGAAC
ATGGTACTCA CGTTCATGAA GAACAACCCG CCGCTTGCAT CAGGGGCGCG GTTCATGACC
GGCCCGTTCC ACCAGATCTT AAAAGACGTG TACACGAAAT GGCCCGGTGT GGAACTCGAC
GCTTCCTCCC ACCCGCAGTT CGTGGGAAAG GAGATCATGA AACCGGAGGA GTATGGCCAG
CTCATCGCCG ACCCGTCCGG CTTTTTGAAC CGGGTTGCGC TGCCGCGGAT GTGCCCGGCA
CTTGCTGATT TGGGATCTCC TGAGGCAAAC GCTGCGATGC TTGCCTACGG CGCGGAACTC
TCAAAATCCG GGGCGGCGCA GATGGCGGTT ATCGGGCAGC TCGGACAGAT GGGCATCCCG
ACGTTCCCGA CCTCGTGGAG TTATGCCCCG CTCGACTTCG TGAGCGACTT TTTGCGGGAC
ATAAAAAATG TCGTGCTCGA CATCTACCGC AAGCCCGATC TCGTGAAGCA GTCCGCGGAT
GCGCTCGTGG AGCCCTTGAT CGAATCGGCC CGGCTGAGCG GTGCCGTCCC GCCCGAGGTC
AAAAAGGCCC TTGGGACAAA CGTGGTCGAG TGCTTCTTCC CGCTGCACTT AACCGAGTAC
CTCAATCCGA AGCAGTACAA CGAGTTCTAC TGGCCGTCGT TAAAGAAGGT GCTTCTCGAA
GTGATCAACA TGGGCCAGAC GCCGTACATC CTCTTTGAGG GCCGGCACGA TGCGCACCTG
GAAACCCTCC TCGATCTCCC GAAAGGAAAG ATCGTTGCGG TCTTTGACAA GACCGACCCG
AGGAAAGTCC GGGAGGTGCT CGATGACCAT GTGGTTCTCG TATCTGGCCC GCCTAACTCG
CTTCTCATCG GAGGCACACC ACAGAAGGTG GACGATTACA TGAAGTCAAT GCTTGACGAC
TGCAAGCAGG GCGGCATGAT GATCTACCCG GGTGCGGACG GTGGCATATC TGGTGAAGCC
CGGCCGGAGA ATGTCAGGGC TGTGTTAGAG GCCGTGAAGA AATACGGGAC GTATTAG
 
Protein sequence
MDGAQIYREK LARLETVIAG KEPDRVPVTA MVDLFHGRYA GYTAQEIFFD YGKNHDAAMK 
TAKDFDFDSL LVLNGLEGMN MVLTFMKNNP PLASGARFMT GPFHQILKDV YTKWPGVELD
ASSHPQFVGK EIMKPEEYGQ LIADPSGFLN RVALPRMCPA LADLGSPEAN AAMLAYGAEL
SKSGAAQMAV IGQLGQMGIP TFPTSWSYAP LDFVSDFLRD IKNVVLDIYR KPDLVKQSAD
ALVEPLIESA RLSGAVPPEV KKALGTNVVE CFFPLHLTEY LNPKQYNEFY WPSLKKVLLE
VINMGQTPYI LFEGRHDAHL ETLLDLPKGK IVAVFDKTDP RKVREVLDDH VVLVSGPPNS
LLIGGTPQKV DDYMKSMLDD CKQGGMMIYP GADGGISGEA RPENVRAVLE AVKKYGTY