Gene Mboo_0580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_0580 
Symbol 
ID5411509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp549105 
End bp550274 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content60% 
IMG OID640867799 
Productradical SAM domain-containing protein 
Protein accessionYP_001403741 
Protein GI154150123 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.185035 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0438607 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACC CGTTCCACGT GATGATCATC CCGACGCTGG GCTGCCCCTC GAAGTGCAAG 
TACTGCTGGA GTTCCGAGGA GGGATCGCCA ATCATGAGCG TGGAGACGGT TCATGAGATC
GTGGTATGGC TCCAGGATAT CCCCCGGGAT CGGGTAACCT TTACCTTCCA CGGGGGGGAG
CCGCTCCTTG CCGGGGCAGA TTTTTACCGC CAGGCACTGC CTCTCCTCGC CGACGGCCTT
GCCCAGATGC ACCCGGAGTT TGCGATGCAG ACCAATCTCT GGCGCATGAC CCCGGAGATC
GCTGATGCCC TCGCAGCTTA CCGGGTGCCA ATCGGGTCGA GCATCGACGG GCCCGAGGCG
ATCACCGATT CCCAGCGGGG GGACGGGTAC TACCAGAAGA CGATGAAGGG GTACGGGATC
GCAAAGGCCC ACGGTCTCGA TGTGCGGTTC ATCTGCACGT TCACGAACAA GTCGGTGAAG
AATAAGGAAG AGATCTTCCA ATTCTTCCTC GACAAGGGCT TCACCCTGAA GCTGCACCCG
GCGCTTCCCT CGATCCGGTC GGAGAACCCG AAGGAGTGGG CGCTGGAACC TTCCGAGTAC
GGCGAACTCC TCGTGTACCT CCTCGACCAG GCGCTCGACC ACATGGGTGA GATTGAGGTA
ATGAACATTA ACGATCTCTG CCGGTGCGTT TTTGTCCGCC GGGGAAGCGT CTGCACGTAC
GTGGACTGCA TGGGGAGCAC GTTTGCCATC GGACCCGACG GAAGCATCTA CCCCTGCTAC
CGGTTTGTCG GGATGCCGGA GTACGTGATG GGAAATGTTG CAGACCGCCC GTCTCTGGAA
ACCCTGATGG AATCTCCTGC GGGAAAACTG ATGCTTGCGT TCAAGGACTA CGTTGATACG
GCCTGCGCAG AATGCAAACA TATCAAGTAC TGCCGGGGCG GGTGCCCGTA CAACGCGATG
GCCCCGACAC CGGGCAGGAT CGAGGGCGTG GACCCGCACT GCACGGCGTA TACGCGGATA
TTCGACGAGA TCGGCGACCG GCTCAACAGG GAGATGTTTG CCGCACCACC CATGGAGATG
GGCGGATTCG GGATGCCATC CGTGCAGAAG CCGGCGAAAA AGGAAAAACC CGGCGTGATG
ACGCTGATGC GCAGGAACGC AATGCGGTAA
 
Protein sequence
MKNPFHVMII PTLGCPSKCK YCWSSEEGSP IMSVETVHEI VVWLQDIPRD RVTFTFHGGE 
PLLAGADFYR QALPLLADGL AQMHPEFAMQ TNLWRMTPEI ADALAAYRVP IGSSIDGPEA
ITDSQRGDGY YQKTMKGYGI AKAHGLDVRF ICTFTNKSVK NKEEIFQFFL DKGFTLKLHP
ALPSIRSENP KEWALEPSEY GELLVYLLDQ ALDHMGEIEV MNINDLCRCV FVRRGSVCTY
VDCMGSTFAI GPDGSIYPCY RFVGMPEYVM GNVADRPSLE TLMESPAGKL MLAFKDYVDT
ACAECKHIKY CRGGCPYNAM APTPGRIEGV DPHCTAYTRI FDEIGDRLNR EMFAAPPMEM
GGFGMPSVQK PAKKEKPGVM TLMRRNAMR