Gene Mboo_2042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2042 
Symbol 
ID5411169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2117972 
End bp2119084 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content58% 
IMG OID640869284 
Productthiamine biosynthesis/tRNA modification protein ThiI 
Protein accessionYP_001405199 
Protein GI154151581 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.389674 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAAAA AATGGCTGGT CAGGTACTCG GAGATCTTCT TAAAATCGGA TCCGGTGCGC 
CGGCACTGGG AGCGCGTTTT GATGAACAAC ATCCGGCAGC TGATGCCGGA CGTCAGGATC
AAGAACGAAC GCGGCCGGAT CTGGCTGACC GGTGATGCAG ACCCGGTAAA ACTGCGGCAC
ATCTTTGGGA TTGTCTCGTT CTCCGAGGTT GAGCATGTTC CCCGCGAGGT TACCCTCGAA
GAGGCCCTTA TCGAGTATGG CCGGGCCCAT GGACTGTCCC TGGCAAAGAC CTTTGCACTC
CGGATAAAAC GGGTGGGAAA ACACGATTTC TCCTCAAACG ACAAGGCCAT CGAACTGGGT
GACCAGGTAA GAAAGGCCTT CCCGCATCTC AAGGTAAACC TCGCCACTCC CGATGTGGAG
ATCCATGTCG AGATCCGGCA GGATGAGTGC TACCTGTACG ATACCGTGAT CAAGGGGGCG
GGCGGTCTTC CCCTCGGGGT AGAGGGAACG CTTGTTGCCC TTGTCTCGGG CGGGATCGAT
TCTCCTGTTG CAACGTACAT GATGATGAAG CGGGGCTGTA AGATCGTCCC CATCTATGTA
GCACTCGAGA CCTTCCTTGA CGAGACCGTG CTTGCCCGGG CCGAGCGGGT GGTAGAGATC
CTGCGGCAGT ACCAGCCGGA CCTGAAGCTC CGGGTGATCC ATGATTCGTA CCTGGCAGCT
GCAAAAGAGG AACTGATCCG GAACCACCAG GAGAAGTATA CCTGTCTCTT CTGCAAACGA
CGTATGTACC GGATTGCGCA GGCCGTAGCC CAGGAAGTGG GGGCAAAAGG TATCGTGAAC
GGGGAGTCGC TCGGGCAGGT AGCCAGCCAG ACTCTCGACA ACCTTGTTGT CCTCTCCGAT
GTGGCGGAGA TCCCGGTGTA CCGTCCGCTT ATCGGGTTTG ACAAGGCAGA TGCCATTGCG
CTTGCCCGCG AGATCGGAAC CTTTGAAGAG TCCACCAGTA AGGCATCCGG CTGCAAGGCG
GTACCCAACG GGCCGTCCAC CAGGGCCCAG CTTGACGAGA TCCTTGCGAT CGAAAGTGCG
CTGGAAGCAA CAAAGATCCC GCTGCCGGTG TAA
 
Protein sequence
MTKKWLVRYS EIFLKSDPVR RHWERVLMNN IRQLMPDVRI KNERGRIWLT GDADPVKLRH 
IFGIVSFSEV EHVPREVTLE EALIEYGRAH GLSLAKTFAL RIKRVGKHDF SSNDKAIELG
DQVRKAFPHL KVNLATPDVE IHVEIRQDEC YLYDTVIKGA GGLPLGVEGT LVALVSGGID
SPVATYMMMK RGCKIVPIYV ALETFLDETV LARAERVVEI LRQYQPDLKL RVIHDSYLAA
AKEELIRNHQ EKYTCLFCKR RMYRIAQAVA QEVGAKGIVN GESLGQVASQ TLDNLVVLSD
VAEIPVYRPL IGFDKADAIA LAREIGTFEE STSKASGCKA VPNGPSTRAQ LDEILAIESA
LEATKIPLPV