Gene Mboo_2236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2236 
Symbol 
ID5410364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2305047 
End bp2306264 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content63% 
IMG OID640869488 
Productphosphoesterase domain-containing protein 
Protein accessionYP_001405393 
Protein GI154151775 
COG category[L] Replication, recombination and repair 
COG ID[COG0608] Single-stranded DNA-specific exonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTTG AAGAATCCGC TAAAGTCGTC GCAGAGCAGA TCCGGCGCCA GCAGTTCGTA 
GAGGTATACG CCCACCACGA CGCTGATGGC ATTGCAGCAG GGGCTATCCT CTGCCACGCG
ATGCTCCGGG CCGGGATCCG GTTCAGGCTC CGGATCTGTG CGGATATCTC AGCGGCGGAG
CTCTCGCACG ATGCGGTCTC GCTCCTGTGC GACCTGGGAT CCGGTAAAGA AGACCTGCCC
CCGGAGACGA TCGTGGTTGA TCACCACATC CCGCTCTTTG GCGGGCAGTT CCACGCAAAC
CCGCGCCTGG AAAAGATCGA TGGCGACCGG GAGCTCTCTG CGGCCGGCAT GGCCTACATC
GTTGCTCAGG AAATGGGGGA CAACCGGGAC CTTGCCGGTC TCGTAATCCC CGGCATCATC
GGCGATGGCC AGGCATTTGC CGGCAGGAAC CTCGCGATCT TCAACGAAGG TGTGGCAAAC
GGGATTATTG TCCCGGACAA GGGGATAACC CTTGCCGGCC GTGACATGGC GGAACGGTGG
CTGCTTGCCA CCCGTCCTTA TCTCCCCGGG ATCAGCGGCA GCGATTCAGC CGTTGCTGAC
CTGCTCGCTT CCACTCAGGA AGACAACGGG TTGAAACCGG ATGTTCTGAT CAGCCTTGCG
GTGCTCACTT CTGTCCCGGA GGCGTCGGCA GAGGGTCTTA TGCTGCTCTA CGGCGACACC
ATGCACCTCC AGCGCGAGGT TATCGAGGAC GCCCACGCGC TTACCGCTGT CATCGATGCC
TGCGGCAAAT CCGGCTGCGG CGACCTTGCG GTGGCGCTCT GCCTGCGTTC ATCTGTGGAG
ATCACAGAGG CTTGGGAGGT GACCCGTAAG CACCACCTCG CGGTGATTGC AGCGCTTGGC
GAGGTCCGTC CGGTACAGGA GGGTTGTGCA GTCTACGAAT GCGGCAATGC CACCCTTTCA
AGCGACGTTG CCGATGTGCT TGCCCGGGAC CGGGTGCAGC AGGCACCGGT GCTCGTGTAC
GCACGAACAG GGGAGGAGTG CCGGATATCG ACCCGCCTAC CCCATGGCAC CACTGCCAAT
CTCGGGCTGC TTGTCCGGGA ACTCGCCGCG GCTTGCGGTG GGAACGGCGG GGGCCACCAC
AGCCGGGCAG GGGCCACGAT CCCCTGCAAG CGACTCGATG CCTTCGTGAA GGGATGGCAG
GAGGGGCTTG CTGCATGA
 
Protein sequence
MSLEESAKVV AEQIRRQQFV EVYAHHDADG IAAGAILCHA MLRAGIRFRL RICADISAAE 
LSHDAVSLLC DLGSGKEDLP PETIVVDHHI PLFGGQFHAN PRLEKIDGDR ELSAAGMAYI
VAQEMGDNRD LAGLVIPGII GDGQAFAGRN LAIFNEGVAN GIIVPDKGIT LAGRDMAERW
LLATRPYLPG ISGSDSAVAD LLASTQEDNG LKPDVLISLA VLTSVPEASA EGLMLLYGDT
MHLQREVIED AHALTAVIDA CGKSGCGDLA VALCLRSSVE ITEAWEVTRK HHLAVIAALG
EVRPVQEGCA VYECGNATLS SDVADVLARD RVQQAPVLVY ARTGEECRIS TRLPHGTTAN
LGLLVRELAA ACGGNGGGHH SRAGATIPCK RLDAFVKGWQ EGLAA