Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlab_0540 |
Symbol | |
ID | 4796139 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanocorpusculum labreanum Z |
Kingdom | Archaea |
Replicon accession | NC_008942 |
Strand | - |
Start bp | 513545 |
End bp | 515473 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640099198 |
Product | hypothetical protein |
Protein accession | YP_001029981 |
Protein GI | 124485365 |
COG category | [K] Transcription |
COG ID | [COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0523546 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.309902 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACGA TACAAGGGAT GAGCGGAGCG GACGTAAAAG CGATGACAGC GGAGCTTGCC GCGCTTTTGC CGCTCTGGAT CGGGAAAATA TATCAGTACG ACAATGCCTC GCTTGGGTTC CGGCTGAATG GTGAGGAGAA GGCACGCCAT CTTCTGTATG TGGTGAGAGG CATTCGGGCG CATCTCGTAT CCGAACTGCC GCCGGCGCCG AAAAACCCGT CCGGGTTTTC GATGTATCTT AGGAAATACA TCGAGGGGGG CAAGGTCCTC AATATCGAGC AGAAGGCGAT CGAGCGGGTC ATCATAATAA CGATCGGCAA AGGGCCGTCG GAGTATAAGC TGATCATCGA ACTCTTCGAC GAAGGAAACC TTATCCTGAC AGACGAAAAG TTCACGATCA TCAATGCCCT TGCCCAGCGG CGGTTCCGGG ACCGGGATAT CGTCGGCGGC GCAGAGTATG CGATCGAAGC CGTCTGGCCC GAGAGGCTGA CGTTCGAGGA GTTCAAAGAG AAGATCACCG CAGACGAGAA CGATATCGTG CGGGCTCTCG CAACGAAACT GCAGCTCGGC GGCATCCCCT CGGAAGAGAT ATGTCAGCTT GCGGGCGTCT CGAAGTCGAT GCCCTGTAAG TTCGCGACCG ATGTCCAGCT TCGCCCGGTG TATGAGGCGA TGAAAAGCTG GATCGCCAAA CTGACCTATG CACGCGATCC GGTGATCGAT GCGAAGGGAG CGTTCCCGTT CCCCTCACTG GTCCGCGAGC CAAAAGAGCA CTTCGCGACG TTTTCGCAGG CGCTCGAGGC GTTCTATCCA AAACCGGTGG CCGAGAAGGT CATCGAACAG AAGATCAAAC TCTCCAAAGA GGAGCGGATC CGAAAACAGC AGGAAGCAGC CGTCGTCAAC TTCGACAAAA AGATCGCCGA GGCGACCGAG ATCTCGGAGA TCATCTACTC GCATTACGGC GAGGTGCAGG AGACGATCGA CGTTCTTGCG GCCGCAAGCC AGAAGCTTTC CTGGCAGGAT ATCGCGGCCG TGATCAAAAA GAGCGACCTG CCCGCCGCAA AACGGATCAT CTCGGTGGAC CCGAAGAATG CGTCCGTCGT GATCGATCTG CAGGAAAAGC ACAAGGTCAC GATCTTCGTG CACGAAAGTC TGGAGGCAAA CGTCGGCAGA TACTTCGCCG TCGTGAAAAA GTTCCGGGCG AAAAAGGCGG GAGCGCTTCG GGCGATGGAA GCAGGCATCG TGCATGCGGA GAAGAAAAAG GCGGCCGGAC CCGGCCGACT CAAGCCGAAA TGGTATCACC GTTTCCGGTG GATGGAGACC TCCGACGGCG TGCTTGTGAT CGGCGGCCGA AATGCCGACC AGAACGAGGA GCTTGTCAAG AAGTATATGG AAGGCAAAGA CACCTTCCTC CACGCAGATG TCTTCGGAGC GTCCGCGGTG ATCGTGAAAG GCGTAACGGA ACGCATGGAT CAGGCGGTCC AGTTTGCCGC GTCCTACTCG CGGGCATGGG CCGGCGGTGG GGCATCCGTC GATGTGATCG CAGCAAGCCC CAACCAGGTA AGTAAGACGC CCGAGTCGGG AGAGTATGTG GCTCACGGTT CGTTCGTCAT CCGGGGCGAG CGTAAGATCT ACAAGGACGT GCCGCTCGAG ATCGCGATCG GGGTCAGGAC CGAACCGGTC CTTGCGGTGA TCGGCGGAAC GCCTTCCGCG ATCGAACCGC TGGCTTCGCA TTCCGTTCGT CTGGTTCCGG GGACATTCGA AGGAAACGAC GTTGCTAAGA AAGTTCTTCG AAAACTCAAA GAGGCGGTGC CTGAAAGCGA GCAGAAAGCG CTCAAAGCGA TCCTGAACAC CGAGGGCGTA GCCGCGTTCG TGCCGCCGGG CGGGTCCGAT CTCAAAGAGC CGGCCCGAGA GGGAGCGGAC CGCGAATGA
|
Protein sequence | MATIQGMSGA DVKAMTAELA ALLPLWIGKI YQYDNASLGF RLNGEEKARH LLYVVRGIRA HLVSELPPAP KNPSGFSMYL RKYIEGGKVL NIEQKAIERV IIITIGKGPS EYKLIIELFD EGNLILTDEK FTIINALAQR RFRDRDIVGG AEYAIEAVWP ERLTFEEFKE KITADENDIV RALATKLQLG GIPSEEICQL AGVSKSMPCK FATDVQLRPV YEAMKSWIAK LTYARDPVID AKGAFPFPSL VREPKEHFAT FSQALEAFYP KPVAEKVIEQ KIKLSKEERI RKQQEAAVVN FDKKIAEATE ISEIIYSHYG EVQETIDVLA AASQKLSWQD IAAVIKKSDL PAAKRIISVD PKNASVVIDL QEKHKVTIFV HESLEANVGR YFAVVKKFRA KKAGALRAME AGIVHAEKKK AAGPGRLKPK WYHRFRWMET SDGVLVIGGR NADQNEELVK KYMEGKDTFL HADVFGASAV IVKGVTERMD QAVQFAASYS RAWAGGGASV DVIAASPNQV SKTPESGEYV AHGSFVIRGE RKIYKDVPLE IAIGVRTEPV LAVIGGTPSA IEPLASHSVR LVPGTFEGND VAKKVLRKLK EAVPESEQKA LKAILNTEGV AAFVPPGGSD LKEPAREGAD RE
|
| |