Gene Mboo_2106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2106 
Symbol 
ID5410640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2178388 
End bp2179389 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content59% 
IMG OID640869351 
Productflap endonuclease-1 
Protein accessionYP_001405263 
Protein GI154151645 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) 
TIGRFAM ID[TIGR03674] flap structure-specific endonuclease 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGTAG CACTACGGGA TATCCTTGCC GATTACAAGA CCCCGGTCAC CTGGGAGGGA 
CTCTCCGGGG TGGCGGCGGT TGATGCAAAC AACACGCTTT ACCAATTCTT AACCATCATC
CGACAGCCGG ACGGAACGCC GCTGATGGAC GCCAAAGGCC GGGTCACCTC CCATCTCTCG
GGGATACTTT TTCGGATGGT CAACTTCCTT GAAAAAGGGA TAAAGCCGGT CTTTGTCTTT
GACGGGAAAC CGCCCGAGCT CAAGCAGGAA ACGAACGCGG AGAGAAAGAA ACTCCGTGAC
GAGGCGGGGG AGAAGTACAA AGAGGCAGTT GAGCGGGGCG ATGAGGAGGA GGCATACAGG
CAGGCCCGGT CAGCGACCCG GGTGGATGAA ACCATTATTG CAACCTCAAA GGAGCTCCTC
GATCTCCTGG GAATTCCGTA CGTGCAGGCT CCTTCAGAAG GCGAGGCGCA GGCGGCATTC
ATGGTGCAGC GGGGCGATGC ACGCTTTGCA GTCTCGCAGG ACTACGATAC CCTGCTCTTT
GGCGCACCGC TCCTGATGCG CAACCTTACC GTGAGCGGGA AGCGCAAGAT CCGGGGCCGA
GCCGTAACTG TCAATCCCGA ACGCCTCGTG CTTTCCGAAG TGCTCTCCGG CCTCTCCCTG
ACCCGGGAGC AGCTTGTGGA AGTCGGCATC CTGGTCGGAA CCGATTTTAA CCCGGGTGCG
GCCGGTGTGG GGGCAAAGAC CGCACTCAAG ATTGTAAAGA GCGGGGGGTT TGCCCAAAAA
CTCGCCGAGA AGTGCCCGGG CTTTGACCCG GCGCCAGTGG CCGACTTTTT CCTGAAGCCG
CCGGTGACAA CGGAGTACGA GCTTGCGTGG GGCCACCCGT GCGTGGAGGG GATCAAAAAG
ATGCTTTGCG ACGGGTACGA CTTTGCCCCG GAACGGGTTG ATGCGGCACT CGAACGCTAC
TCGGCAAAGG CAGGTCAAAA GACGCTGGAA AGCTTTTTCT AA
 
Protein sequence
MGVALRDILA DYKTPVTWEG LSGVAAVDAN NTLYQFLTII RQPDGTPLMD AKGRVTSHLS 
GILFRMVNFL EKGIKPVFVF DGKPPELKQE TNAERKKLRD EAGEKYKEAV ERGDEEEAYR
QARSATRVDE TIIATSKELL DLLGIPYVQA PSEGEAQAAF MVQRGDARFA VSQDYDTLLF
GAPLLMRNLT VSGKRKIRGR AVTVNPERLV LSEVLSGLSL TREQLVEVGI LVGTDFNPGA
AGVGAKTALK IVKSGGFAQK LAEKCPGFDP APVADFFLKP PVTTEYELAW GHPCVEGIKK
MLCDGYDFAP ERVDAALERY SAKAGQKTLE SFF