Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlab_0123 |
Symbol | |
ID | 4796104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanocorpusculum labreanum Z |
Kingdom | Archaea |
Replicon accession | NC_008942 |
Strand | - |
Start bp | 111901 |
End bp | 112986 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640098770 |
Product | hypothetical protein |
Protein accession | YP_001029567 |
Protein GI | 124484951 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily [TIGR03551] 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, CofH subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGTCA CCACCATTCT GAATGATGTC CTTGGAGGCT CACGGCTCAC GGAAGATGAA GCTGCATTCC TGTTCTCGGT GCAAAACCGG GATATATGGA AGGTTGCCGA GGCCGCCGAT ATCATTCGCG AACGAAAAAA CGGCGATGTC GTAACATATG TCCGAAACAT GAACATCCAC ATGACCAACA TCTGCAAAAA CTGTTGCGGG TTATGTGCAT TCGGACGAAA GAGTACCGAT CCCGAGGCTT TCTGCTTCAC GGATGAAGAG TTCCGGGAGC ATACCAAAGA TGCGGGCAGA AAAAAGGTCA CCGAGGTCTC GTATCTTTCC GGAATTCACC CGGAATTCAC CATCGAAAGT TACGAAAAGA TGATCCGGAC ATTCCACGAA GAGATCCCCG GCATCCACGT TCACGGATGC AGCCCGGATG AGATTTTGTT TGCAGCGAAC CAAAGCGACA TCACAACAAA AGAAGCACTC ATTCGGCTAA AAGATGCGGG ACTCGGTTCA GTCCAGGGAA CGGCGGCGGA GATTCTCGTC GACCGTGTTC GAAACATCAT CTGCAGTAAA AAACTCTCCA CGGCAGAGTG GGTCAGGATC ATCAAAGAAG CCTCGGAAGT TGGATTTCGA GCAACGTCCA CGATCATGTA CGGATCGGTG GAAACGGAAA AGGAACGGGC ACGCCATCTA TCGGTCCTTC GCGACGTCCA GGACGAAACC GGTGTATTTA CCGAACTTGT TCCGCTGGCG TTCCTGCACA AAAATACGCC GCTCGAACGG GCAGGAATCG TAAACCATGA TGCAACAGGA CGCGAAGACA TTCTGCTGAT CGCCATTTCA CGGCTGTTTT TGGACAACTT CGACAACATC CAGGTACCCT GGTCGAAGAT CGGACGAAAG GTCACACAGT TGTCCCTGAT GGCGGGAGGA AATGATGTCG GGGGGACGAT GTTTGTGGAC GCCCTTTCAA AGGATGCCGG CGGCGGAGAT GAATCGGATT ACTTCAGTCC CGAAGACATG AAAATAATGT GTGACGATAT CGGCAGGACA CTTCGTCAGA GAGACACGTT CTATAATCTC ATCTGA
|
Protein sequence | MNVTTILNDV LGGSRLTEDE AAFLFSVQNR DIWKVAEAAD IIRERKNGDV VTYVRNMNIH MTNICKNCCG LCAFGRKSTD PEAFCFTDEE FREHTKDAGR KKVTEVSYLS GIHPEFTIES YEKMIRTFHE EIPGIHVHGC SPDEILFAAN QSDITTKEAL IRLKDAGLGS VQGTAAEILV DRVRNIICSK KLSTAEWVRI IKEASEVGFR ATSTIMYGSV ETEKERARHL SVLRDVQDET GVFTELVPLA FLHKNTPLER AGIVNHDATG REDILLIAIS RLFLDNFDNI QVPWSKIGRK VTQLSLMAGG NDVGGTMFVD ALSKDAGGGD ESDYFSPEDM KIMCDDIGRT LRQRDTFYNL I
|
| |