Gene Mboo_1495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1495 
Symbol 
ID5410415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1550485 
End bp1551525 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content57% 
IMG OID640868730 
Productstage II sporulation E family protein 
Protein accessionYP_001404656 
Protein GI154151038 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.520612 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.134288 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATGA ACGGGATGAA GTGCATCCAT ATTGGGGAGA TGAGCGGTAT TGTCGAAGCC 
CGGCAGACAG CGATGAATTT CTGCAAGACA AAGGGATTCA GTGAGGTCCT TTCCGGTAAC
GTTTCCCTCA TCGTGACGGA ACTCGCTACA AACCTCGTCA AACATGCCGG TTCGGGAGAA
CTTCTTATTC GCTCCTGTAC CGAAGGAGAT GTTTCCGGAA TCGAGTGCCT CGCGCTTGAC
CGGGGCATGG GGATCACAGA TATCGGCGGA AGCCTGCGGG GCAAATATCC TGCCACTGCT
GGTAAGGAGA CCGGCCTTAT GGCAGTCCGC CGTCTCTCTT CCCTGTTTGA TCTCTATTCC
ATCCCGGAGA AGGGGACGGC CGTCCTCTCC CGGATCGAAA CCAGATCTCC TGAACCAGTG
TCCGATCCAC CGCACCCCCT GTGCTCAACC CGACCCGAGG TCGGCGTTGT TTGTCTTCCA
CTCGTGCCGG ATGAACCCTG CGGCGACGGA TGGGAGGTGA TCCGGTCGGA GAACAGAACG
GTAATCCTTG TTATCGACGG ACTGGGTCAT GGACCGGAAG CCTCGAAAGT TCAGGTCGAA
GCCATCAGGG TCTTCCGGAA GAACCCGGCG GGTGAACCGG TGGAAATACT CAGGAATCTC
CACGCTGCAC TCAGGAACAC CCGAGGGGGT GTCGCGGCGA TTGCCGCTAT CGATGAGGAG
CGAGGCAGGG TCACGTATAC AGGCGCCGGC AATATTTCCG GCCGGATCAT CACCGGTCAT
GTTACCAATA AAATGGTTTC TCTGAGCGGA ACTGTAGGTG ACCAGATCAG GCAATTCAGT
GAGTTCTCGT ATCCCTGGGT AACGGAGGCT CTCCTGATTA TGTACTCCGA TGGGCTCACC
ATGCAGTGGG ATCTTGAGGA TTACCCGGGA CTCGAGCGAA AGCACCCGGC ACTCATTGCC
GGTGTTCTGT ACCGGGATCA TACGCGGGGA ACGGATGATG TGACGGTCCT TGCCGTAAAA
CTCGTCGGGG AGACTTCATG A
 
Protein sequence
MAMNGMKCIH IGEMSGIVEA RQTAMNFCKT KGFSEVLSGN VSLIVTELAT NLVKHAGSGE 
LLIRSCTEGD VSGIECLALD RGMGITDIGG SLRGKYPATA GKETGLMAVR RLSSLFDLYS
IPEKGTAVLS RIETRSPEPV SDPPHPLCST RPEVGVVCLP LVPDEPCGDG WEVIRSENRT
VILVIDGLGH GPEASKVQVE AIRVFRKNPA GEPVEILRNL HAALRNTRGG VAAIAAIDEE
RGRVTYTGAG NISGRIITGH VTNKMVSLSG TVGDQIRQFS EFSYPWVTEA LLIMYSDGLT
MQWDLEDYPG LERKHPALIA GVLYRDHTRG TDDVTVLAVK LVGETS