Gene Mboo_1052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1052 
Symbol 
ID5410186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1036386 
End bp1037876 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content53% 
IMG OID640868278 
Producthypothetical protein 
Protein accessionYP_001404213 
Protein GI154150595 
COG category[S] Function unknown 
COG ID[COG2326] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0280365 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.329166 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAAA AGTGCGACCT GACCGAAAAG ATCGACCAGA AGACCTTTGA TGAGAAAACA 
GGGCCGCTGC AGGCGCATCT TGGGCAGCTG CAACGGCAGC TCCATGCCGA GAAAATCCCG
GTAATTATCC TCATCGAAGG CTGGAATGCC GCGGGAATTT CACTCACGGT CAAGGAGCTT
ATCCGTTCTT TTGACCCCCG GGGGATCGAG CTTTCAACTA TCGGATCCCC TACTGATGCA
GAGCAGCACC GTCCGCTGTT ATGGAGGTTC TGGAACCGCA TCCCTCCCAA AGGCGGGATT
GCCATCTTTG ACCGGAGCTG GTACAGCCGG GCGCTCGCGG AGACCACCCG GCACCCGGCC
TGGGAAAAAC GCGTTGACAC GGCTATCCAG TCCATCAACA GGTTTGAGCG TCAGCTTGCC
GATGACAACA CGATCATCGT CAAGATTTTC CTTCATATCA GTAAAAAAGA GCAGGAAAAA
CGTTTCATTG ACCGTGAAAC AAACCCCCTT ACCTCGTGGA TGGTAACCCC TGAGGACTGG
GATTTCCACG GCGAGTATGA GACGTATTTC CCGCTTATCG ATTCTTTCCT TAGAAGGACC
AGTACTCCCG GTGCACCCTG GACGGTCATT GGTGCCGATG ATACCCGGTA CGCCATTCTT
ACCGCTTACA AGACGGTCAT AGCGGCCCTT GAAAAGAGAA AAACGGGCAG GGACGGTATA
TCCGGGAAGG AGCATGCCAC GTACACCCCC GATCTAGTCC CGGCGAAACG ACACGAGGAA
AAACCGGCCC GGCTCGATAA GGAGACCTAC GAAAAACGAC TCGCGGCCGC ACAGCTGATG
CTCGGCGATA TGCAGTCCAT CCTTTTCAAG CGCAATATCC CGCTTATTAT TATCTTTGAG
GGAAGGGATG CCGCGGGGAA AGGGGGCACC ATAATGCGCC TCACTCGTGA CTTAAACCCG
AGAGGCTACC GGGTCACACC CGTTGGCGGC CCAAACGAAT TTGAAAAAGA TCATCACTAT
CTCTGGAGGT TTATACGGAA ATACCCCCGC CAGGGCCATA CGACGATCTA TGACCGGAGC
TGGTACGGCA GGGTCCTTGT GGAACGGGTG GAAGGTTTTT GCACCCGGGC TGAATGGAAA
CGGGCCTATT CCGAGATAAA CGAAGTAGAG GAAGAATTCC GCTCGTGGGG CGGAGGAATT
TTGAAATTCT GGCTGGAGGT CAGCCCCGAA GAGCAGCTCA AACGGTTCCA GGGGCGGGAA
AAAGACCCGT CCAAGCAATG GAAGATCACT GAAGAAGACT GGCGGAACCG GGAGAAATGG
GACCAGTACA GCGAAGCCAT TGATGAGATG TTTGAAAAAA CAAGTACGAA AAACGCACCA
TGGATCGTAA TAAACTCGGA TGATAAATGG AATGCACGGA TAAAAACCGT TGAGACTGTC
TGCGAGTACG CCGAGCGCCT GCTTCAGGTA AGATACCAGC ACTACAGTTA A
 
Protein sequence
MLKKCDLTEK IDQKTFDEKT GPLQAHLGQL QRQLHAEKIP VIILIEGWNA AGISLTVKEL 
IRSFDPRGIE LSTIGSPTDA EQHRPLLWRF WNRIPPKGGI AIFDRSWYSR ALAETTRHPA
WEKRVDTAIQ SINRFERQLA DDNTIIVKIF LHISKKEQEK RFIDRETNPL TSWMVTPEDW
DFHGEYETYF PLIDSFLRRT STPGAPWTVI GADDTRYAIL TAYKTVIAAL EKRKTGRDGI
SGKEHATYTP DLVPAKRHEE KPARLDKETY EKRLAAAQLM LGDMQSILFK RNIPLIIIFE
GRDAAGKGGT IMRLTRDLNP RGYRVTPVGG PNEFEKDHHY LWRFIRKYPR QGHTTIYDRS
WYGRVLVERV EGFCTRAEWK RAYSEINEVE EEFRSWGGGI LKFWLEVSPE EQLKRFQGRE
KDPSKQWKIT EEDWRNREKW DQYSEAIDEM FEKTSTKNAP WIVINSDDKW NARIKTVETV
CEYAERLLQV RYQHYS