Gene Mboo_1906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1906 
Symbol 
ID5410863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1968368 
End bp1970617 
Gene Length2250 bp 
Protein Length749 aa 
Translation table11 
GC content54% 
IMG OID640869144 
Producttype II secretion system protein E 
Protein accessionYP_001405064 
Protein GI154151446 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0630] Type IV secretory pathway, VirB11 components, and related ATPases involved in archaeal flagella biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.124978 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.483962 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCCG GTACCCCTCC TCAAGAGGTA GTTCCGGAAT CCCTCCCCAC CCCGGATGAG 
CCCGTACAAT TCCCGAAAGA AGCAACCGGT AAACCTGAAG AGCTGGATAC TTTCGATGAG
CTTCTCGAAG ATGTCCCCCC GGCGCAAGAA GGTGTAGAAC CCCCGGTACT AAGCGGGGAC
GATGTTCGCG CGGCACTGGG CAAGATCATG GGACAACCGG AGATGGTCCC CCCGTCGGCG
CACTCCCCGG AGCCTCTTAC AGCCTCCCCG GAATCAGAGT TACCCCCGGA ACAAGTTGTC
GAAATTACTG AAGAGGAAGA TCCCCGCGAG AATCGCACGT CCCGGTTTTT CAAGAAAAAG
AAGGCCGAAA AAGTGGGAGC CCCGACAGAT CAGATCGGGA AATCCGCGGT GACCGAATCT
GATGAGCGGA TTATTTCCGC TGACCAGATA TCCGAATTTT CCGGTCTTAT TCTCCCCAAA
GGTGCGACAT TCCATGTTGA TGAGATCAAT CTTCATGGCC GGATGCAGGC GTTTGAGTCC
ACCGGAACCG GTAGCCTTCC CCCGGAACTT GCCGAGATCT GGAAAAGGGA GTTTTCCACA
TCCGGTTTCA AGGACCTTGA ATCAGAACTG ACCCTTGCAG ACAAGGGGAC GGATAAGGGG
ACAGAAAAGA AATCCTGGTT TAGTGCACTC TTCAGCGCGA TCCGGTCGGA GAACGTGGAA
TACGATCCCA AGATCCATGG ACCTCTCGTC GATCTCACCT TTGAGCCAAA ACCCGGTATC
GAAGTGATGG ATACGTACCC CGTCAACGAT CCCTATGCAT ATGTCCGGGT CATCTATGAC
CATTCCACCC ATGAATATAC CTACCAGGTT CTCGAACCTG TACTCTCGGA GCCAGAGAAA
GACCTGTTAA AGGAGTTAAA GGAACGCCTT TTTGAGATCC TTGACATCAA CACCAAGGAC
ATTTCCAAAG AGGAGGCCCG CAGCCAGCTC CGTCGTTCGG TTGATGAGAT TCTGGCGGAT
TACGGAATCA AGCTCAACCC GGTGGAACGT GAGAAGATCC TCTATAACAT GCACAAGGAC
TTTCTGGGTG ATGGCATGAT CGATGCGATC ATGCATGACA AGTATATCGA GGATATCTCC
TGCGACGGGG TCAACACCCC TCTCTTTGCC TTCCATGCCA ATTACGAGTC AATGAAGACC
ACCCTGATGT ACCACAATGC CGAGGAGCTC GACTCCTTTG TCACCAAGCT CGCACAAAGG
GCAGGCAAGT ACATCTCCAT TGCCGAGCCC ATTCTCGATG CGACCATGCA GGATGGATCC
CGTATCCAGA TGACACTGGG GCAGGAGGTC ACTGCGCACG GCTCGACATT CACTATCCGT
AAGTTCAAGG ACGAGCCAAT TACCCCTACC GATCTCATTG AATGGCACAC CTTTGCCCCG
CTCTCGATCG CTTACATATG GCTCTGCGTC GAGAATGGCA AGTCCGCTAT CTTTGCCGGT
GGGACCGCGT CCGGGAAGAC TACTGCCCTC AATGCAATCT CGCTTTTCAT TCCCCCGATG
GCAAAGATTG TCTCTCTGGA GGATACCCGT GAAGTCAAGC TTCCTCACCC GAACTGGATC
CCGAGTGTCA CGCGTGACTC ATTTGACACC GCCGGCAGAG GAGAGATCAA CATGTATGAG
TTGCTCCGTG CGGCTATGCG TCAGCGGCCC GAGTACATCA TCGTCGGTGA GGTTAGAGGT
AAGGAATGCC AGACACTTTT CCAGGCAATG AGCACGGGCC ACGTTACCTA CTCCACAACT
CACGCCGACT CTGTTGCGAG CGTCGTGCAC CGTATCGAGA ACCCGCCCAT GGACGTACCA
CGGAATATGC TCTCCGCGCT CGATTTCATC TGTGTCCAGG TACAGGCCCG GATGGGCGGG
AAACGGATCC GCCGGAACAA GCAGATTGTT GAGATCCTCG ATATCGATCC GCGGACAAAC
GAACTGATCA CAAACGAGGT CTTCCGCTGG CGATCGGCAA CCGATGAGAT CACCTACTCG
GGGAAATCTT ACATCCTTGA GGACATCATG GAGGCGCGGG GCTGGAACGA GAACCGTATG
CGGGAAGAAC TCAAACGCAG GCAGGAGGTG CTGGAGTGGA TGCGTATCAA GAAGATCCGG
CACTACAAGG ATGTAGCAAA GATCCTTATC TCGTACTTCC GTGAGCCCGA AGTGGTTGTA
GAGCGCGTAA GGAAGGATCT ATATGAATAG
 
Protein sequence
MNAGTPPQEV VPESLPTPDE PVQFPKEATG KPEELDTFDE LLEDVPPAQE GVEPPVLSGD 
DVRAALGKIM GQPEMVPPSA HSPEPLTASP ESELPPEQVV EITEEEDPRE NRTSRFFKKK
KAEKVGAPTD QIGKSAVTES DERIISADQI SEFSGLILPK GATFHVDEIN LHGRMQAFES
TGTGSLPPEL AEIWKREFST SGFKDLESEL TLADKGTDKG TEKKSWFSAL FSAIRSENVE
YDPKIHGPLV DLTFEPKPGI EVMDTYPVND PYAYVRVIYD HSTHEYTYQV LEPVLSEPEK
DLLKELKERL FEILDINTKD ISKEEARSQL RRSVDEILAD YGIKLNPVER EKILYNMHKD
FLGDGMIDAI MHDKYIEDIS CDGVNTPLFA FHANYESMKT TLMYHNAEEL DSFVTKLAQR
AGKYISIAEP ILDATMQDGS RIQMTLGQEV TAHGSTFTIR KFKDEPITPT DLIEWHTFAP
LSIAYIWLCV ENGKSAIFAG GTASGKTTAL NAISLFIPPM AKIVSLEDTR EVKLPHPNWI
PSVTRDSFDT AGRGEINMYE LLRAAMRQRP EYIIVGEVRG KECQTLFQAM STGHVTYSTT
HADSVASVVH RIENPPMDVP RNMLSALDFI CVQVQARMGG KRIRRNKQIV EILDIDPRTN
ELITNEVFRW RSATDEITYS GKSYILEDIM EARGWNENRM REELKRRQEV LEWMRIKKIR
HYKDVAKILI SYFREPEVVV ERVRKDLYE