Gene Mboo_1333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1333 
Symbol 
ID5411600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1359290 
End bp1360510 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content55% 
IMG OID640868565 
Producthypothetical protein 
Protein accessionYP_001404494 
Protein GI154150876 
COG category[T] Signal transduction mechanisms 
COG ID[COG3292] Predicted periplasmic ligand-binding sensor domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.554888 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.392861 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCCGC CCATGAGATC TGAATATTTC CTTCTTATCC TCCTCATCCT TTGCGGTCTG 
GTTCCGGTTG TTTCAGGAGC CGGGGCGGTG GGTACCGGTG CTGCCGGGCA GGAATATTCT
GCAAATATCA CCCTGTTTCG CCCTGCCTCT GATTCTGTCC CGTCCACACA GGTAAATGCT
ATTATCAACG GCCTGCAGGG AGAAGTCCTG CTGGGAACCC CTCTGGGCCT TTCGAGCTAC
GATGGTACGT GGAGCACCCG GCATATCGAT CGGAGCAATC TTTCTGAAGG TCTTTTAGAT
AATTTTGTTA CTGCACTCGC TTATGACAGT TCCGGCCATC TCTGGATAGG CTACGCTGGT
GGGATCCAGA TCTACGATGG CCGTACTTAC CAATTGATCA CTGACCAGCA GCTCTTAAAG
AGCCTCCAAA TACGGGCTCT GCAGCGCTGG AATGATGGGA TGTGGATCGC TACCGGGAAC
TCTGGCCTGA GCCGGTACTA TAACGAAACA TGGACCTGGT ATGCCCCGTA CTCTCCCGGG
GGCCCAGGGT TTTACGAGGC TGACAGTATG ACCCTTGACT CGGCCGCAGA CACCCTTCTT
GTGGGAACCC TCAAAGAAGG TCTCTGGGCC GTGAGCGAAG CCAACAACAC AATCGTATTT
ACCCGGATCC AGAACCGGGA CGATCCCTAT GGCCTGCTTG GCCACGTCAG GAAAGATCCG
CTGGGCGGGG CGTACTTCTT TAATGAAACA GATGTGGCTC ACTACAGTCA GGCCGGGGGA
TTTACCCCGG TTCTCTCATC TGGTGATTTC TACGGGGGGC CGTATATCAT TGACGATGTG
GCAGCCGGCC CGGGAGGGGC GGTTTATGTG GCAACGGAAA ACGGCATCTA TGTATGGCAG
AACGGGGCAG TCACCCGACA TCTGGGAACG TTTGAAGGAT TTAGTAGTGC CGCGCACAAC
GTCAAGATGG TTTTTGTCGA TGCCCGGGGC CGGCTCTGGT TCTCCACCAT GGATGTTGTA
GGATATTATA CCGGGGATAT CTCAACCGCT CCCCCGATAT CGGTTGAAAC CATGACCCCG
ACGCCAACAC CGGTCCTGCC CACCAGCATC CCGACCACGG GACCGGTGGT AACGGCGACC
CCTGCGCCCA GTCCCTCGTT TATCGACAAT ATCAGGACTT TTCTTGGCGG GATCTTCGGC
TTTTTACCTC ATTCCCGCTA A
 
Protein sequence
MAPPMRSEYF LLILLILCGL VPVVSGAGAV GTGAAGQEYS ANITLFRPAS DSVPSTQVNA 
IINGLQGEVL LGTPLGLSSY DGTWSTRHID RSNLSEGLLD NFVTALAYDS SGHLWIGYAG
GIQIYDGRTY QLITDQQLLK SLQIRALQRW NDGMWIATGN SGLSRYYNET WTWYAPYSPG
GPGFYEADSM TLDSAADTLL VGTLKEGLWA VSEANNTIVF TRIQNRDDPY GLLGHVRKDP
LGGAYFFNET DVAHYSQAGG FTPVLSSGDF YGGPYIIDDV AAGPGGAVYV ATENGIYVWQ
NGAVTRHLGT FEGFSSAAHN VKMVFVDARG RLWFSTMDVV GYYTGDISTA PPISVETMTP
TPTPVLPTSI PTTGPVVTAT PAPSPSFIDN IRTFLGGIFG FLPHSR