Gene Mboo_2355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2355 
Symbol 
ID5411884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2419988 
End bp2421874 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content56% 
IMG OID640869611 
Producttype II secretion system protein E 
Protein accessionYP_001405512 
Protein GI154151894 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0630] Type IV secretory pathway, VirB11 components, and related ATPases involved in archaeal flagella biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.692167 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.583317 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGCAC AGATAAAGAT ATCTGTCATT CTGCGACGAG TCGGTGCCGG AATCGCCGCA 
AACGCCCTCC CGTATCGCTG CCCGTTTGGC CGACGTTTCC TTTGGAAACA CTATTTATAT
GAGTGTCTGC CGAACTATAA ACAGAAATAC GGGGTACCAC GCGAGGATGG ACGCATGAAG
GTTTCAGATC TTTTTACCAG ACTCCGGGAA TACCGGTCCC GGGCCGGGAT GGCCGTGAAG
GGTGCAGGTA CCCCGCAACC TTTGGCGGGA GATACCGGCA CCGCCCTGCA GTCATCCGGC
ATCCGCGAGG CGAGCTACCG GCATTATTTC CGGTTCCTTA AAGGCCGTGA TAAACTGCCA
GAGGAGGAGT ACGATCCTGC ACGGCACGGC CCGCTTGTGA AGGCCGAGAT CCCCGCCGGC
TACGATCTTC TCGACCAGTA CTGGATTGAA GAGGGCCTGA CCCTTGTGTA TATCGCTCTC
AACCGGAAGA CCAACCAGAC CGAGTACCTC CTCTTCGAAC CGCCGCTCTC GGAGTTTGAG
TACGAGCTGC TCGAACGCCT CCATGAAGAC CTCCTTGATG TCCTGATCCT GACCTCAGAT
GAGGTAAAAA CAGATCGGAA GAAAATCCTC CTCTCAAAGG TGCATGGCCT CCTCGACGAC
TATGGCCTTG TCCTTGATGA GTCCGCACAT TTCAAGATCG AATACTACCT GATCCGGAAT
TTCATCGGCT GGTCGCGTAT CGATCCGCTC ATGAAAGATC CCAACCTCGA GGATATTTCA
TGCGATGGCA GCCGGATCCC GATCTTCCTG TACCACCGGA AACACCGGAA TATCAAGACC
AACATCGCGT TCGAGGCCAC GGTCTTGAAT TCGCTTGCAA TCACCCTTGC CCAGCGTTCG
GGAAAACACA TCTCTACCGG CTCCCCCCTG CTGGATGCCA CCCTTCCGGA CGGTTCGCGG
CTCCAGCTGA CGCTCGGGAC CGAGGTAACT ACCCGGGGAA CCTCCTTTAC CATACGCAAG
TTCCGCGAAG ACCCGTTTTC TCCCATTGAG CTTATGGAGT ACGGGACATT CTCCAGCGAC
GAGCTGGTGT ACTTCTGGCT TGCCATTGAA AACGGTATGA GCCTGCTCTT TATCGGGGGA
ACTGCATCAG GGAAGACCAC ATCGCTCAAT GCAGTCTCAC TGTTTATCCC CCCCATTGCC
AAGGTGGTCA GCATCGAAGA TACGCGGGAG ATCACTCTCT ATCATGACAA CTGGATCGCT
AGCGTCACCC GCGAGGCGCT CACTGAGGGC GGCAATGCCA TCAGCATGTT CGATCTTCTC
CGGGCTGCAA TGCGGCAGCG GCCGGAGTAC ATTCTTGTCG GGGAAGTCAG GGGTCCTGAG
GCACAGACGC TTTTCCAGGC AATGAATACC GGCCACACCA CGTTCTCCAC CATGCATGCA
GGAAGCATCG ATGCGGCCAT CCACCGTCTG GAGAGCGCGC CGCTCAACGT GCCGCGTAAC
ATGGTCCAGG CATTGAACGT CATTTGTGTT CAGGCCCTCA TCTACCGCGG TACAGAAAGG
GTGCGGCGGG TCCAGGAGGT TGTCGAGATT GCCGGAATCG ATCCTGCTAC CGGGAACCTC
CGGGTCAACA ATGTCTTTCA GTACGATCCA GTCCATGACC GGACCATCTA TACGGGTCGA
TCCCAGATTT ACAGCATGAT CGCCACAAAA CGAGGATGGA CACGGGAAGA ACTCGATTAT
GAGATCACCG TAAGGAAAAG CCTCCTCGAT GCCATGCATG CGCAGGGGAT CCGCGACTAC
ATATCAGTTG CCTCACTCTT CCATAATTAT AATATCAACC GTGCGGACGT ACTCGCCCAC
AACGACGATC TCAGACAGGT ACTCTGA
 
Protein sequence
MGAQIKISVI LRRVGAGIAA NALPYRCPFG RRFLWKHYLY ECLPNYKQKY GVPREDGRMK 
VSDLFTRLRE YRSRAGMAVK GAGTPQPLAG DTGTALQSSG IREASYRHYF RFLKGRDKLP
EEEYDPARHG PLVKAEIPAG YDLLDQYWIE EGLTLVYIAL NRKTNQTEYL LFEPPLSEFE
YELLERLHED LLDVLILTSD EVKTDRKKIL LSKVHGLLDD YGLVLDESAH FKIEYYLIRN
FIGWSRIDPL MKDPNLEDIS CDGSRIPIFL YHRKHRNIKT NIAFEATVLN SLAITLAQRS
GKHISTGSPL LDATLPDGSR LQLTLGTEVT TRGTSFTIRK FREDPFSPIE LMEYGTFSSD
ELVYFWLAIE NGMSLLFIGG TASGKTTSLN AVSLFIPPIA KVVSIEDTRE ITLYHDNWIA
SVTREALTEG GNAISMFDLL RAAMRQRPEY ILVGEVRGPE AQTLFQAMNT GHTTFSTMHA
GSIDAAIHRL ESAPLNVPRN MVQALNVICV QALIYRGTER VRRVQEVVEI AGIDPATGNL
RVNNVFQYDP VHDRTIYTGR SQIYSMIATK RGWTREELDY EITVRKSLLD AMHAQGIRDY
ISVASLFHNY NINRADVLAH NDDLRQVL