Gene Mboo_2159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2159 
Symbol 
ID5410135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2230466 
End bp2231599 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content54% 
IMG OID640869404 
Productglycosyltransferase family 28 protein 
Protein accessionYP_001405316 
Protein GI154151698 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR00661] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.188732 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGATTT TGTTTGTTGT ATGCGGGGAG GGTCTGGGCC ACACATCCCG GTGTATCCAT 
CTCGGCCATT ACCTGGAGCA GCAGGGTCAT TCGGTCAGTT TCCTGGCATA CGGGAAATCC
TACGACTTTT TCCGGGACCA TGGATGTACC CGGGTTTACC GTGGGGAACG CGAGGTCTGC
CTCGAAGGTG AGAATGGCTT TTTCTCCTTA AAAAAGACCC TTTGGTGCTC GCGCTGGATC
GTCATAAACA TGGTCCGGTC CGGGCTGCGC GTCAGACGTT TGATCCGGGA ACAGCAGATC
GACTGCGTTG TTTGTGACAC GATGTACGCA GGTGTCCTTG CTGCACGGTT TTGCCGGGTA
CCGGTGATCT TTATTACCAA CCAGAACCGG TTCAGCGGCC CGGGAGGGGC GAAGAACCCG
GTCTGGAGCG TGCTCAACTT CCTGATCCGA CGCTACCTTA AACTCGCCGA TGCCGTTATC
ATTCCCGACT ACCCTCCACC GGATTCCGTG AGCGAGTACA ATCTCCTCAT TCCGGAGAAA
GAGAAACCGC ACTATCATTT CACCGGCCCG TTTCTGGAGA TCGATCTCAA CCGGTACCAA
TTCTCGCAGG AGACGATCTT TACCAGTTTC GGGGGAGAGC CCTACAAACT CCCCCTGTAC
CGGTTGCTGC GGACGATCGC GGACAAACGA AAAGACCTGA TGTTCGATGT TTTCTATACA
GGTGCAACCC TCCCGGGATC CTCGGATAAT TTCCTTTCCC ATGGGTATGT GCCTAATATC
TATGAACATC TCGCCGAGGC CCGGATTGCC ATCGTGCATG GCGGATTGAC CACCCTCCAC
GAAGCGTTGC TCTTCAATAA GCCCGTCCTC ATCATCATGG ACCCGGGCCA TCCTGAACAG
CAGAACAACG CACAAAAAAT CGTTGACCTG GGAGCAGGGA CGGTAGTGGA TGGCAGGACA
GTCACTCTTG AAATTCTCGA ACAAAAGATT GCAGAGACTC TCTCCCTTCC CCTCCGGTCT
GGCGGTCGCG ATCTGGCTGC GGTAAATGGC AGAAAAAATG CCGCAGCAGT TATTGAGGCG
TCTTTCGGTG CTGCTGCAAA CAGGAACATT AAGGTTTCAT CCTCCAAACC CTAA
 
Protein sequence
MRILFVVCGE GLGHTSRCIH LGHYLEQQGH SVSFLAYGKS YDFFRDHGCT RVYRGEREVC 
LEGENGFFSL KKTLWCSRWI VINMVRSGLR VRRLIREQQI DCVVCDTMYA GVLAARFCRV
PVIFITNQNR FSGPGGAKNP VWSVLNFLIR RYLKLADAVI IPDYPPPDSV SEYNLLIPEK
EKPHYHFTGP FLEIDLNRYQ FSQETIFTSF GGEPYKLPLY RLLRTIADKR KDLMFDVFYT
GATLPGSSDN FLSHGYVPNI YEHLAEARIA IVHGGLTTLH EALLFNKPVL IIMDPGHPEQ
QNNAQKIVDL GAGTVVDGRT VTLEILEQKI AETLSLPLRS GGRDLAAVNG RKNAAAVIEA
SFGAAANRNI KVSSSKP